Twitter Datasets

By Volkan TUNALI, June 22, 2010 11:14 pm

Twitter data gathered through the Twitter’s streaming API was published at http://demeter.inf.ed.ac.uk/index.php (about 5.5 GB) in February and April. Second release was a bit of cleaning the first one.

On June 14, Twitter event detection corpus was also released at the same address.

Actually I have no idea how those datasets can be made use of for clustering experiments. Worth having a look at, though. Especially, it may be used in large scale data mining research, as the authors say.

——————————————-
UPDATE April 12, 2011: I’m sorry to see that Twitter dataset is no longer available due to the request of Twitter. I hope an up-to-date Twitter dataset will be available soon.

I am unable to provide download links for this dataset. Please do not make requests for this dataset. I’m sorry.

2 Responses to “Twitter Datasets”

  1. sasan says:

    hi
    I need to this dataset.

    please help me!!

    thanks

  2. Sasan,

    I’ve sent you a mail. Please reply back.

    Volkan

Panorama Theme by Themocracy