Task 1: Setup, load and prepare datasets (~5%)

Setup

Create a level 2 section in your notebook called Setup.

Datasets

Create a level 2 section in your notebook called Dataset.

The dataset __details posted in slack___ consists of five files. For each import, and as feather format to subfolder orig.

df_authors from file authors.feather

List of 13,182 reddit users who commented/submitted to QAnon identified subreddits. Columns:

df_comments from file comments.feather

List of 10,831,922 comments with full text. Columns:

df_submissions from file submissions.feather

List of 2,099,875 posts with full text. Columns:

df_subreddits from file subreddits.feather

List of 12,987 subreddits where at least two QAnon-enthusiastic users have made a submission. Too many columns to list here.

df_paper from file paper.feather

List of 19 subreddits, identified in the paper [see paper, Appendix A], where QAnon users were more active.