Annotated-directory-big-data
This is an annotated directory of public, freely available, "large" data sets. For now they are in no particular order.
Google ngrams
- URL - http://books.google.com/ngrams/datasets
- Description - The ngram databases on which Google's ngram viewer is built. A variety of corpora are available, e.g. by language, the "Google Million", English fiction, etc. Each set contains a list of ngrams, frequency, and date information.
- Curator - CharlieP
MusicBrainz
- URL - http://musicbrainz.org/doc/MusicBrainz_Database
- Description - In a nutshell, the musical equivalent of IMDb.
- Curator - Jahelton07
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -