I’m interested in real-time (near real-time for now) analytics of social media on education policies. I plan to share some of my results on a weekly basis here.
As a start, I used Twitter API and retrieved 1,228 tweets containing the hashtag #EdPolicy posted over the last week from May 2nd to 9th, 2016.
The text data of tweets are very, very noisy, due to typos, incomplete spelling, URLs, emoticons, etc.. After much data processing and cleaning (removing URLs, stopwords, whitesapces, etc.), I (with the help from David Fikis, a doctoral candidate at Georgia State University) was able to extract frequently used hashtags in the 1,228 tweets.
Using the R package of topicmodels, I identified 10 topics in the #EdPolicy tweets. Here are the results.