Task 3: Match Paper (~20%)

Create a section in your notebook called Match Paper.


(Again this is a small section to break up the grading - so a few lines of code in each case.)

When reading the paper, I highlighted in green the data that I verified. The markup is visible on the local copy of the paper. I want you to wrote code to:

  1. Verify each of the data values marked in green in order given in the paper (ยง3 Data Collection).
  2. Reproduce the green marked data in Table 1.
  3. Reproduce Table 2.

Next we are going to following the step covered in the topic modelling of Trumps false claims practical and clean the text features. In this dataset we have three such features df_comments.body, or df_submissions.title and df_submissions.text.