Difference between revisions of "Robbie-big-data"
Jump to navigation
Jump to search
(2 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
− | + | ||
− | + | == Headline text == | |
+ | Snapshot of Youtube as of Feb. 22, 2008 | ||
+ | 749362 videos crawled | ||
===== Project Tasks ===== | ===== Project Tasks ===== | ||
Line 11: | Line 13: | ||
===== Tech Details ===== | ===== Tech Details ===== | ||
− | * | + | *Used postgres instance on my personal computer |
− | * | + | *All data is in rdbean08/Public on the ACLs |
+ | ===== Results ===== | ||
+ | * Visualization: | ||
+ | Most Prolific Uploaders | ||
+ | Total Comments of Top Uploaders | ||
+ | Most Popular Tags by Uploads | ||
+ | Most Popular Tags by Views | ||
+ | Number of Uploads Over Time | ||
+ | Number of Views Over Time | ||
+ | * The story | ||
+ | [[File:youtubedata.tar]] |
Latest revision as of 23:47, 15 December 2011
Headline text
Snapshot of Youtube as of Feb. 22, 2008 749362 videos crawled
Project Tasks
- Identifying and downloading the target data set
- Data cleaning and pre-processing
- Load the data into your Postgres instance
- Develop queries to explore your ideas in the data
- Develop and document the model function you are exploring in the data
- Develop a visualization to show the model/patterns in the data
Tech Details
- Used postgres instance on my personal computer
- All data is in rdbean08/Public on the ACLs
Results
- Visualization:
Most Prolific Uploaders Total Comments of Top Uploaders Most Popular Tags by Uploads Most Popular Tags by Views Number of Uploads Over Time Number of Views Over Time
- The story