Jump to navigation Jump to search
This is an annotated directory of public, freely available, "large" data sets. For now they are in no particular order.
- URL - http://books.google.com/ngrams/datasets
- Description - The ngram databases on which Google's ngram viewer is built. A variety of corpora are available, e.g. by language, the "Google Million", etc.
- Curator - CharlieP