Week Ending Sept 24
This week I worked on gathering useful datasets for the Healthcare and Twitter categories (follow up from previous weeks' work).
HealthCare Datasets
Twitter Datasets
Name | Location | Description | Size |
---|---|---|---|
Volume Time Series of Memetracker Phrases and Twitter Hashtags | http://snap.stanford.edu/data/TwtHtag.txt | Twitter Hashtags | 1000 |
Higgs Twitter Dataset | http://snap.stanford.edu/data/higgs-twitter.html | 5 Graph datasets | varies |
ICWSM Datasets | https://www.icwsm.org/2015/datasets/datasets/ | 11 Datasets available upon request/registration | |
Harvard Dataverse | https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/TQBLWZ | This dataset contains the tweet ids of 5,655,632 tweets that were collected from approximately 3000 Twitter accounts affiliated with the U.S. government. | 101.5MB |
TweetID Datasets | https://www.docnow.io/catalog/ | Collection of Tweet IDs based on Hashtags |