Difference between revisions of "Annotated-directory-big-data"

From Earlham CS Department
Jump to navigation Jump to search
(Another Data Set)
(Another Data Set)
Line 12: Line 12:
 
* Curator - Jahelton07
 
* Curator - Jahelton07
  
==== Another Data Set ====
+
==== World Cubing Association Database ====
* URL -  
+
* Browse - http://worldcubeassociation.org/results/
* Description -  
+
* Download Database - http://www.worldcubeassociation.org/results/misc/export.html
* Curator -  
+
* Description - All times, competitions, competitors of WCA competitions from 1984 until now.
 +
* Curator - Twright09
  
 
==== Another Data Set ====
 
==== Another Data Set ====

Revision as of 21:13, 6 October 2011

This is an annotated directory of public, freely available, "large" data sets. For now they are in no particular order.

Google ngrams

  • URL - http://books.google.com/ngrams/datasets
  • Description - The ngram databases on which Google's ngram viewer is built. A variety of corpora are available, e.g. by language, the "Google Million", English fiction, etc. Each set contains a list of ngrams, frequency, and date information.
  • Curator - CharlieP

MusicBrainz

World Cubing Association Database

Another Data Set

  • URL -
  • Description -
  • Curator -

Another Data Set

  • URL -
  • Description -
  • Curator -

Another Data Set

  • URL -
  • Description -
  • Curator -

Another Data Set

  • URL -
  • Description -
  • Curator -

Another Data Set