Difference between revisions of "Gus-big-data"

From Earlham CS Department
Jump to navigation Jump to search
(Created page with "* Project title * Project data set ===== Project Tasks ===== #Identifying and downloading the target data set #Data cleaning and pre-processing #Load the data into your Postg...")
 
 
(2 intermediate revisions by one other user not shown)
Line 1: Line 1:
* Project title
+
* NYC Subways
* Project data set
+
* New York MTA data (performance metrics)
  
 
===== Project Tasks =====
 
===== Project Tasks =====
 
#Identifying and downloading the target data set
 
#Identifying and downloading the target data set
#Data cleaning and pre-processing  
+
#Data cleaning and pre-processing
 
#Load the data into your Postgres instance  
 
#Load the data into your Postgres instance  
 
#Develop queries to explore your ideas in the data  
 
#Develop queries to explore your ideas in the data  
 
#Develop and document the model function you are exploring in the data
 
#Develop and document the model function you are exploring in the data
 
#Develop a visualization to show the model/patterns in the data
 
#Develop a visualization to show the model/patterns in the data
 +
 +
===== Tech Details =====
 +
* Node: as8
 +
* Path to storage space: /scratch/big-data/gus
 +
 +
===== Results =====
 +
* The visualization(s)
 +
* The story

Latest revision as of 11:23, 2 December 2011

  • NYC Subways
  • New York MTA data (performance metrics)
Project Tasks
  1. Identifying and downloading the target data set
  2. Data cleaning and pre-processing
  3. Load the data into your Postgres instance
  4. Develop queries to explore your ideas in the data
  5. Develop and document the model function you are exploring in the data
  6. Develop a visualization to show the model/patterns in the data
Tech Details
  • Node: as8
  • Path to storage space: /scratch/big-data/gus
Results
  • The visualization(s)
  • The story