Twitter 2010 data set

Twitter_2010 data set contains tweets containing URLs that have been posted on Twitter during October 2010. In addition to tweets, we also the followee links of tweeting users, allowing us to reconstruct the follower graph of active (tweeting) users.
URLs 66,059
tweets 2,859,764
users 736,930
links 36,743,448

Tweets

Table (in csv format) link_status_search_with_ordering_real_csv contains tweets with the following information

Table (in csv format) distinct_users_from_search_table_real_map contains names of tweeting users, and the following information for each user:

Follower graph

File active_follower_real_sql contains zipped SQL dump of links between tweeting users in the form:

These data are empirically characterized of this data is described in
Kristina Lerman, Rumi Ghosh, and Tawan Surachawala (2012) "Social Contagion: An Empirical Study of Information Spread on Digg and Twitter Follower Graphs."

This data is made available to the community for research purposes only. If you use the data in a publication, please cite


Copyright 2000-2014 University of Southern California Information Sciences Institute