Difference between revisions of "Gus-big-data"
Jump to navigation
Jump to search
Line 1: | Line 1: | ||
− | * | + | * NYC Subways |
− | * | + | * New York MTA data (performance metrics) |
===== Project Tasks ===== | ===== Project Tasks ===== | ||
#Identifying and downloading the target data set | #Identifying and downloading the target data set | ||
− | #Data cleaning and pre-processing | + | #Data cleaning and pre-processing |
#Load the data into your Postgres instance | #Load the data into your Postgres instance | ||
#Develop queries to explore your ideas in the data | #Develop queries to explore your ideas in the data |
Latest revision as of 10:23, 2 December 2011
- NYC Subways
- New York MTA data (performance metrics)
Project Tasks
- Identifying and downloading the target data set
- Data cleaning and pre-processing
- Load the data into your Postgres instance
- Develop queries to explore your ideas in the data
- Develop and document the model function you are exploring in the data
- Develop a visualization to show the model/patterns in the data
Tech Details
- Node: as8
- Path to storage space: /scratch/big-data/gus
Results
- The visualization(s)
- The story