Additional resources related to social network and computer science:

Related teaching


Unfortunately, although there are now a few books on the topics (and many of them very well written), none of them contains entirely the contents that is required to cover at the graduate level a computer science class oriented towards algorithmic aspect of social network. Here are some of them that could be useful for your own study:
  • Networks, Crows and Markets, by D. Easley and J. Kleinberg, Cambridge (2010),
    electronic copy available for free
  • Epidemics and Rumours in Complex Networks, by M. Draief and L. Massoulié, Cambridge (2009),
  • Complex Social Networks, by F. Vega Redondo, Cambridge (2007),

  • Social and Economic Networks, by Matthew O. Jackson, Princeton (2008),
  • Connections: An Introduction to the Economics of Networks, by S. Goyal, Princeton (2009),

  • Networks, An Introduction, by M. Newman, Oxford (2010),

  • Random Graph Dynamics, by R. Durett, Cambridge (2007),

List of Available Data sets online (quoted from different sources):

Twitter makes about 10% of their public tweets available through the garden-hose API. Check it out at:

Max Planck Institute has made data from IMC 2007 paper, WOSN 2008 papers, WWW 2009 paper, and WOSN 2009 paper, as well as Alan Mislove's PhD Thesis publicly available. Details at:

Stanford Large Network Dataset Collection makes several data sets (not limited to social network) available at the following URL

(a repository website)

(From J. Kleinberg's webpage)
Network Datasets
There are a number of interesting network datasets available on the Web; they form a valuable resource for trying out algorithms and models across a range of settings.
  • Collaboration and citation networks: For the 2003 KDD Cup competition, Johannes Gehrke, Paul Ginsparg, and I provided a dataset based on the arXivpre-print database, which allows one to study the networks of co-authorships and citations among a large community of physicists. Here is the KDD Cup dataset and a paper describing the competition in more detail.

  • Internet topology: The network structure of the Internet can be studied at several levels of resolution. Here is a dataset at the autonomous system (AS) level.

  • Web subgraphs: There are many such datasets available for download. One set is maintained by Panayiotis Tsaparas; the experiments that used this data are described in his Ph.D. thesis, and in other papers linked from his home page.

  • Semantic networks: Free association datasets for words have been collected by cognitive scientists; these are constructed by compiling the free responses of test subjects when presented with cue words. (For example, a test subject presented with the cue word `ice' might react with the word `cold,' `cream,' or `water.')

(Taken from MPI website)
Data from our IMC 2007 paper, our WOSN 2008 papers, our WWW 2009 paper, our WOSN 2009 paper, and Alan Mislove's PhD Thesis is publicly available by emailing Alan Mislove at amislove (at) mpi-sws (dot) org. Each of the data sets has been anonymized to protect the privacy of the social network users.
Alan Mislove, Massilmiliano Marcon, Krishna P. Gummadi, Peter Druschel, Bobby Bhattacharjee. Measurement and Analysis of Online Social Networks. In Proceedings of the 5th ACM/USENIX Internet Measurement Conference (IMC'07), San Diego, CA, October 2007.
Meeyoung Cha, Alan Mislove, Ben Adams, Krishna P. Gummadi. Characterizing Social Cascades in Flickr. In Proceedings of the 1st Workshop on Online Social Networks (WOSN'08), Seattle, August 2008.
Meeyoung Cha, Alan Mislove, and Krishna P. Gummadi. A Measurement-driven Analysis of Information Propagation in the Flickr Social Network. In Proceedings of the 18th Annual World Wide Web Conference (WWW'09), Madrid, Spain, April 2009.
Fabrício Benevenuto, Tiago Rodrigues, Meeyoung Cha, and Virgílio Almeida. Characterizing User Behavior in Online Social Networks. In Proceedings of Usenix/ACM SIGCOMM Internet Measurement Conference (IMC), Chicago, Illinois, November 2009.

(Taken from J. Leskovec's course resource sections):
Snap network datasets
Yahoo! Webscope Catalog of datasets
  • Note: Jure Leskovec will have to apply for any sets you want, and we must agree not to distribute them further.
    There may be a delay, so get requests in early.
Coauthorship and Citation Networks
Internet Topology
Movie Ratings
Who trusts whom data at Trustlet
Mark Newman's pointers