Difference between revisions of "Robbie-big-data"

From Earlham CS Department
Jump to navigation Jump to search
 
Line 1: Line 1:
 +
 +
== Headline text ==
 
Snapshot of Youtube as of Feb. 22, 2008
 
Snapshot of Youtube as of Feb. 22, 2008
 
749362 videos crawled
 
749362 videos crawled
Line 22: Line 24:
 
  Number of Views Over Time
 
  Number of Views Over Time
 
* The story
 
* The story
 +
[[File:youtubedata.tar]]

Latest revision as of 23:47, 15 December 2011

Headline text

Snapshot of Youtube as of Feb. 22, 2008 749362 videos crawled

Project Tasks
  1. Identifying and downloading the target data set
  2. Data cleaning and pre-processing
  3. Load the data into your Postgres instance
  4. Develop queries to explore your ideas in the data
  5. Develop and document the model function you are exploring in the data
  6. Develop a visualization to show the model/patterns in the data
Tech Details
  • Used postgres instance on my personal computer
  • All data is in rdbean08/Public on the ACLs
Results
  • Visualization:
Most Prolific Uploaders
Total Comments of Top Uploaders
Most Popular Tags by Uploads
Most Popular Tags by Views
Number of Uploads Over Time
Number of Views Over Time
  • The story

File:Youtubedata.tar