Difference between revisions of "Annotated-directory-big-data"
Jump to navigation
Jump to search
(→Another Data Set) |
(→Another Data Set) |
||
Line 18: | Line 18: | ||
* Curator - Twright09 | * Curator - Twright09 | ||
− | ==== | + | ==== Large Data Sets on AWS ==== |
− | * URL - | + | * URL - http://aws.amazon.com/publicdatasets/#1 |
− | * Description - | + | * Description - A list of large data sets on Amazon's AWS, more data sets within the four links in the list. |
− | * Curator - | + | * Curator - Twright09 |
==== Another Data Set ==== | ==== Another Data Set ==== |
Revision as of 21:27, 6 October 2011
This is an annotated directory of public, freely available, "large" data sets. For now they are in no particular order.
Google ngrams
- URL - http://books.google.com/ngrams/datasets
- Description - The ngram databases on which Google's ngram viewer is built. A variety of corpora are available, e.g. by language, the "Google Million", English fiction, etc. Each set contains a list of ngrams, frequency, and date information.
- Curator - CharlieP
MusicBrainz
- URL - http://musicbrainz.org/doc/MusicBrainz_Database
- Description - In a nutshell, the musical equivalent of IMDb.
- Curator - Jahelton07
World Cubing Association Database
- Browse - http://worldcubeassociation.org/results/
- Download Database - http://www.worldcubeassociation.org/results/misc/export.html
- Description - All times, competitions, competitors of WCA competitions from 1984 until now.
- Curator - Twright09
Large Data Sets on AWS
- URL - http://aws.amazon.com/publicdatasets/#1
- Description - A list of large data sets on Amazon's AWS, more data sets within the four links in the list.
- Curator - Twright09
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -
Another Data Set
- URL -
- Description -
- Curator -