Annotated-directory-big-data

From Earlham CS Department
Revision as of 11:23, 4 October 2011 by Charliep (talk | contribs) (Google ngrams)
Jump to navigation Jump to search

This is an annotated directory of public, freely available, "large" data sets. For now they are in no particular order.

Google ngrams

  • URL - http://books.google.com/ngrams/datasets
  • Description - The ngram databases on which Google's ngram viewer is built. A variety of corpora are available, e.g. by language, the "Google Million", etc.
  • Curator - CharlieP

Another Data Set

Another Data Set

Another Data Set

Another Data Set

Another Data Set

Another Data Set

Another Data Set