Difference between revisions of "Robbie-big-data"
Jump to navigation
Jump to search
Line 1: | Line 1: | ||
+ | |||
+ | == Headline text == | ||
Snapshot of Youtube as of Feb. 22, 2008 | Snapshot of Youtube as of Feb. 22, 2008 | ||
749362 videos crawled | 749362 videos crawled | ||
Line 22: | Line 24: | ||
Number of Views Over Time | Number of Views Over Time | ||
* The story | * The story | ||
+ | [[File:youtubedata.tar]] |
Latest revision as of 23:47, 15 December 2011
Headline text
Snapshot of Youtube as of Feb. 22, 2008 749362 videos crawled
Project Tasks
- Identifying and downloading the target data set
- Data cleaning and pre-processing
- Load the data into your Postgres instance
- Develop queries to explore your ideas in the data
- Develop and document the model function you are exploring in the data
- Develop a visualization to show the model/patterns in the data
Tech Details
- Used postgres instance on my personal computer
- All data is in rdbean08/Public on the ACLs
Results
- Visualization:
Most Prolific Uploaders Total Comments of Top Uploaders Most Popular Tags by Uploads Most Popular Tags by Views Number of Uploads Over Time Number of Views Over Time
- The story