Difference between revisions of "Robbie-big-data"

From Earlham CS Department
Jump to navigation Jump to search
(Created page with "* Project title * Project data set ===== Project Tasks ===== #Identifying and downloading the target data set #Data cleaning and pre-processing #Load the data into your Postg...")
 
 
(3 intermediate revisions by 2 users not shown)
Line 1: Line 1:
* Project title
+
 
* Project data set
+
== Headline text ==
 +
Snapshot of Youtube as of Feb. 22, 2008
 +
749362 videos crawled
  
 
===== Project Tasks =====
 
===== Project Tasks =====
Line 9: Line 11:
 
#Develop and document the model function you are exploring in the data
 
#Develop and document the model function you are exploring in the data
 
#Develop a visualization to show the model/patterns in the data
 
#Develop a visualization to show the model/patterns in the data
 +
 +
===== Tech Details =====
 +
*Used postgres instance on my personal computer
 +
*All data is in rdbean08/Public on the ACLs
 +
===== Results =====
 +
* Visualization:
 +
Most Prolific Uploaders
 +
Total Comments of Top Uploaders
 +
Most Popular Tags by Uploads
 +
Most Popular Tags by Views
 +
Number of Uploads Over Time
 +
Number of Views Over Time
 +
* The story
 +
[[File:youtubedata.tar]]

Latest revision as of 23:47, 15 December 2011

Headline text

Snapshot of Youtube as of Feb. 22, 2008 749362 videos crawled

Project Tasks
  1. Identifying and downloading the target data set
  2. Data cleaning and pre-processing
  3. Load the data into your Postgres instance
  4. Develop queries to explore your ideas in the data
  5. Develop and document the model function you are exploring in the data
  6. Develop a visualization to show the model/patterns in the data
Tech Details
  • Used postgres instance on my personal computer
  • All data is in rdbean08/Public on the ACLs
Results
  • Visualization:
Most Prolific Uploaders
Total Comments of Top Uploaders
Most Popular Tags by Uploads
Most Popular Tags by Views
Number of Uploads Over Time
Number of Views Over Time
  • The story

File:Youtubedata.tar