Difference between revisions of "Tristan-big-data"

Revision as of 12:30, 3 December 2011

Examining Trends in a Performance Sport
Data set: WCA Database

If you have multiple tries for a particular task in which you want to do well, the first will be the worst and the last will be the best.
The younger you start something the more you'll be able to improve overtime, whereas if you start something later in your life, it is less likely chance you will be able to improve as drastically, quickly, or as much.

The two tid-bits extrapolated above are by no means proven to be concrete, this are only some spotted information gathered from a cubing database. To examine this further, other performance sports should be investigated.

@@ Line 6: / Line 6: @@
 ===== Project Tasks =====
--Identifying and downloading the target data set
+*Identifying and downloading the target data set
 :The WCA Dataset was easily downloaded as a set of SQL inserts. The file can be downloaded from [http://worldcubeassociation.org/results/misc/export.html here].
--Data cleaning and pre-processing
+*Data cleaning and pre-processing
 :The issue was that the .sql file was in MS-SQL or OracleSQL, so some mass modifications to the file had to be made. Primarily it was with changing smallint(n) to int, and `tablename` without the `.
--Load the data into your Postgres instance
+*Load the data into your Postgres instance
 :It took a few times to get everything from the script all working, but the script was successfully run on my directory on BigFe.

Difference between revisions of "Tristan-big-data"

Revision as of 12:30, 3 December 2011

Contents

Question

Project Tasks

SQL Queries

Tech Details

Results and Discussion

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

websites

wiki

applied groups

Tools