Robbie-big-data

From Earlham CS Department
Revision as of 11:19, 9 December 2011 by Rdbean08 (talk | contribs)
Jump to navigation Jump to search

Snapshot of Youtube as of Feb. 22, 2008 749362 videos crawled

Project Tasks
  1. Identifying and downloading the target data set
  2. Data cleaning and pre-processing
  3. Load the data into your Postgres instance
  4. Develop queries to explore your ideas in the data
  5. Develop and document the model function you are exploring in the data
  6. Develop a visualization to show the model/patterns in the data
Tech Details
  • Used postgres instance on my personal computer
  • All data is in rdbean08/Public on the ACLs
Results
  • Visualization:
Most Prolific Uploaders
Total Comments of Top Uploaders
Most Popular Tags by Uploads
Most Popular Tags by Views
Number of Uploads Over Time
Number of Views Over Time
  • The story