diff --git a/README.rst b/README.rst index 351e8c8..53d9ad5 100644 --- a/README.rst +++ b/README.rst @@ -26,6 +26,8 @@ Tweet datasets * `3 million Russian troll tweets `_ {?} [3m] - Released by 538. +* `Lerman Twitter 2010 Dataset `_ [2.8m] - Contains tweets containing URLs that have been posted on Twitter during October 2010. In addition to tweets, links of tweeting users were followed, allowing the reconstruction the follower graph of active (tweeting) users. + * `MovieTweetings `_ {`MIT`_} [725k] - A live movie rating dataset collected from Twitter. * `350k MeToo tweets `_ {?} [350k] @@ -60,6 +62,8 @@ User datasets * `Twitter Social Graph `_ {?} [41m] - From the `"What is Twitter, a Social Network or a News Media?" paper `_. +* `Arizona State University Twitter Data Set `_ [11m] + * `Twitter User Sample (Tweets Loud and Quiet) `_ {`MPL 2.0`_} [400k] - Metadata of ~400,000 Twitter accounts, scraped between September 17, 2013, and October 19, 2013, as part of the work on the `"Tweets loud and quiet" article `_. * `Higgs Twitter Dataset `_ {?} [456k] - The Higgs dataset has been built after monitoring the spreading processes on Twitter before, during and after the announcement of the discovery of a new particle with the features of the elusive Higgs boson on 4th July 2012. @@ -88,8 +92,11 @@ Other Lists * `Twitter open datasets `_ - A question on `opendata.stackexchange `_. -Data Collection Tools -===================== +Tools +===== + +Data Collection +--------------- * `twitter-dataset-collector `_ {`Apache License 2.0`_} [Java] - Facilitates the distribution of Twitter datasets by downloading sets of tweets (if still available) using their ids as input. @@ -99,10 +106,7 @@ Data Collection Tools Analysis -======== - -Analysis Tools --------------- +-------- * `OSU Twitter NLP Tools `_ - A suite of Twitter NLP tools. @@ -115,11 +119,16 @@ Analysis Tools * `Tools by Alan Ritter `_ - Several Twitter-related tools by Alan Ritter. -Analysis Articles ------------------ +Academic Papers +=============== + + +Articles & blog posts +===================== `Twitter sentiment analysis using Python and NLTK `_ +`72 Hours of #Gamersgate