This repository contains the necessary files to work with the Twitter dataset. Due to its large file size, the actual dataset is not hosted on this repository. The twitter-2010.graph can be downloaded here, and should be placed inside the net subdirectory.
The dataset is a crawl presented by Haewoon Kwak, Changhyun Lee, Hosung Park, and Sue Moon in “What is Twitter, a Social Network or a News Media?”, Proceedings of the 19th International World Wide Web (WWW) Conference, pages 591−600, 2010, ACM press.
WebGraph is a framework for graph compression aimed at studying web graphs. It provides simple ways to manage very large graphs, exploiting modern compression techniques. WebGraph is developed by the Laboratory for Web Algorithmics.