Difference between revisions of "Annotated-directory-big-data"
Jump to navigation
Jump to search
Jahelton07 (talk | contribs) (→Another Data Set) |
|||
Line 7: | Line 7: | ||
* Curator - CharlieP | * Curator - CharlieP | ||
− | ==== | + | ==== MusicBrainz ==== |
− | * URL - | + | * URL - http://musicbrainz.org/doc/MusicBrainz_Database |
− | * Description - | + | * Description - In a nutshell, the musical equivalent of IMDb. |
− | * Curator - | + | * Curator - Jahelton07 |
==== Another Data Set ==== | ==== Another Data Set ==== |
Revision as of 14:16, 4 October 2011
This is an annotated directory of public, freely available, "large" data sets. For now they are in no particular order.
Google ngrams
- URL - http://books.google.com/ngrams/datasets
- Description - The ngram databases on which Google's ngram viewer is built. A variety of corpora are available, e.g. by language, the "Google Million", English fiction, etc. Each set contains a list of ngrams, frequency, and date information.
- Curator - CharlieP
MusicBrainz
- URL - http://musicbrainz.org/doc/MusicBrainz_Database
- Description - In a nutshell, the musical equivalent of IMDb.
- Curator - Jahelton07
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -