Yes, we are going back in time to the Presidential Debate in the US 2020 - the time of lots of unhappy Tweeting. It’s just too good a dataset and case to let it go…
Political tweets: https://github.com/SDS-AAU/SDS-master/raw/master/M2/data/pol_tweets.gz
from https://github.com/alexlitel/congresstweets We’ve preprocessed a bit to make things easier. 0: Dems. 1: Rep.
Tweets around the time of the debate in oktober 20 (8000): https://github.com/SDS-AAU/SDS-master/raw/master/M2/data/pres_debate_2020.gz
Both datasets are in JSON format.
Time | Activity |
---|---|
9:10-9:30 | Indivitual/Groups work making sense of data, preprocess |
9:30-9:45 | Follow up in class |
10:00-11:00 | Train SML model for congress and classify the pres. debate tweets R/Py split |
11:10-11:30 | Discuss solutions R/Py split |
11:30-12:00 | Joint review, Hand out Peergrade assignment |
R team :::: HERE ::::
Python team :::: HERE ::::