https://wiki.cs.earlham.edu/index.php?title=Fk-vizscidat-notes&feed=atom&action=historyFk-vizscidat-notes - Revision history2024-03-28T12:59:24ZRevision history for this page on the wikiMediaWiki 1.32.1https://wiki.cs.earlham.edu/index.php?title=Fk-vizscidat-notes&diff=13366&oldid=prevAdmin: Created page with "== Course Schedule (DRAFT) == * Week 1 -- Visualization Basics # lab on data collection # begin work on course products ## guide -- do's and don'ts for good infographics ## trans..."2012-08-27T11:52:52Z<p>Created page with "== Course Schedule (DRAFT) == * Week 1 -- Visualization Basics # lab on data collection # begin work on course products ## guide -- do's and don'ts for good infographics ## trans..."</p>
<p><b>New page</b></p><div>== Course Schedule (DRAFT) ==<br />
* Week 1 -- Visualization Basics<br />
# lab on data collection<br />
# begin work on course products<br />
## guide -- do's and don'ts for good infographics<br />
## transferable vignettes<br />
## ??<br />
<br />
* Week 2 -- Visualization Basics<br />
# lab on turning reports into data into information<br />
# continue work on course products<br />
<br />
* Week 3 -- Exploratory Data Analysis<br />
# lab on EDA -- numerical and graphical summaries<br />
# continue work on course products<br />
<br />
* Week 4 -- Exploratory Data Analysis<br />
# lab on EDA<br />
# continue work on course products<br />
<br />
* Week 5 -- Visualization Tools (notice the links below)<br />
# Tools assignment -- low tech, high tech<br />
# continue work on course products<br />
<br />
* Week 6 -- Visualization Tools<br />
# Tools assignment -- critical reviews of existing visualizations<br />
# continue work on course products<br />
<br />
* Week 7 -- Visualization Tools<br />
# Tools assignment <br />
# continue work on course products<br />
<br />
* Week 8 -- Visualization Tools<br />
# Tools assignment<br />
# continue work on course products<br />
<br />
* Week 9 -- Projects<br />
# Projects assignment<br />
# continue work on course products<br />
<br />
* Week 10 -- Projects<br />
# Projects assignment -- documenting choices and assumptions<br />
# continue work on course products<br />
<br />
* Week 11 -- Projects<br />
# Projects assignment<br />
# continue work on course products<br />
<br />
* Week 12 -- Projects<br />
# Projects assignment<br />
# continue work on course products<br />
<br />
* Week 13 -- Projects<br />
# Projects assignment<br />
# continue work on course products<br />
<br />
* Week 14 -- Projects<br />
# Projects assignment<br />
# continue work on course products<br />
<br />
* Week 15 -- Projects<br />
# Projects presentation<br />
# complete work on course products<br />
<br />
== Short-term To Do List ==<br />
# Figure-out books for the library to purchase, probably put them on reserve through the fall (charlie)<br />
# Look at on-line courses in this area (mic)<br />
<br />
== Examples ==<br />
* Good and Bad Statistical Graphs -- http://www.datavis.ca/gallery/<br />
* Eurozone debt - http://www.bbc.co.uk/news/business-15748696<br />
* Wikileaks US embassy cables - http://datavisualization.ch/datasets/wikileaks-us-embassy-cables/<br />
* Stopping SOPA and PIPA - http://visual.ly/stop-sopa<br />
* Auto accident statistics in Britain - http://www.bbc.co.uk/news/magazine-16631597<br />
* A snapshot of the rapidly changing world of computing, communications and technology - http://www.nytimes.com/interactive/2011/12/06/science/1206-world.html?ref=science<br />
* Words by the millions - http://www.nytimes.com/2012/03/25/business/words-by-the-millions-sorted-by-software.html?_r=1&ref=technology<br />
* county health ratings - http://www.countyhealthrankings.org/app<br />
* live wind map - http://hint.fm/wind/index.html<br />
* Factual - http://www.nytimes.com/2012/03/25/business/factuals-gil-elbaz-wants-to-gather-the-data-universe.html?ref=technology<br />
* worldwide health data - http://www.youtube.com/watch?v=jbkSRLYSojo&feature=player_embedded<br />
* Obama's budget proposal - http://www.nytimes.com/interactive/2012/02/13/us/politics/2013-budget-proposal-graphic.html?emc=eta1<br />
* Interactive earthquake map - http://pnsn.org/tremor<br />
* http://visual.ly/education-vs-incarceration - and their tool for building vizs<br />
* shot analysis for NBA finals - http://www.nytimes.com/interactive/2012/06/11/sports/basketball/nba-shot-analysis.html<br />
* European debt -- http://www.aljazeera.com/indepth/interactive/2012/06/20126127221845926.html<br />
* Map of the Market (link behaves oddly, but you can get there) -- http://www.smartmoney.com/map-of-the-market/<br />
* Gallery of R Visualizations -- http://addictedtor.free.fr/graphiques/<br />
* nice quicktime example of the "starchart" Filmfinder -- http://hcil2.cs.umd.edu/video/1994/1994_visualinfo.mpg -- dated but very good<br />
* 2010 U.S. Election Visualizations -- http://www.csc.ncsu.edu/faculty/healey/US_election/<br />
* Gun-related deaths by US State -- http://www.aljazeera.com/indepth/interactive/2012/07/2012726141159587596.html<br />
* Minard's Map of French Wine -- http://en.wikipedia.org/wiki/File:Minard%E2%80%99s_map_of_French_wine_exports_for_1864.jpg#file<br />
* Minard's Map of Napoleon's Russian Invasion -- http://en.wikipedia.org/wiki/File:Minard.png#file<br />
* Krulwich - http://www.npr.org/blogs/krulwich/2012/03/21/149095154/mirror-mirror-on-the-wall-do-the-data-tell-it-all?sc=fb&cc=fp<br />
* Defections of Syrian Leaders -- http://www.aljazeera.com/indepth/interactive/syriadefections/2012730840348158.html<br />
<br />
== Press ==<br />
NPR did a couple of interesting segments on Big Data, visualizations, and the search of mathematicians and others who can do that stuff. (December, 2011)<br />
* Part 1 - http://www.npr.org/2011/11/29/142521910/the-digital-breadcrumbs-that-lead-to-big-data?ps=rs<br />
* Part 2 - http://www.npr.org/2011/11/30/142893065/the-search-for-analysts-to-make-sense-of-big-data<br />
<br />
New York Times article from December, 2011 on bioinformatics and visualization, MicJ<br />
<br />
== Other ==<br />
* http://www.r-bloggers.com/how-the-new-york-times-uses-r-for-data-visualization/<br />
* At some point nyt.com supported a "viz lab" where people could use their data sets to build their own visualizations. I can't find a current reference to this now. 20 January 2012<br />
* IBM's Many Eyes - <br />
* http://www.cc.gatech.edu/~stasko/7450/syllabus.html<br />
<br />
== Presentations ==<br />
* David McCandless: The beauty of data visualizations (TED) - http://www.ted.com/talks/david_mccandless_the_beauty_of_data_visualization.html<br />
** Military spending - http://www.guardian.co.uk/news/datablog/2010/apr/01/information-is-beautiful-military-spending<br />
* What we learned from 5 million books (TED) - http://www.ted.com/talks/what_we_learned_from_5_million_books.html<br />
** Google's ngram interface: http://books.google.com/ngrams/<br />
* Baby names -- NameVoyager (http://www.babynamewizard.com/voyager)<br />
* Wordle (http://www.wordle.net/ )<br />
* Raw Milk Laws in the US (http://farmtoconsumer.org/raw_milk_map.htm)<br />
* International Milk Production (http://chartsbin.com/view/1492)<br />
* Perception in Visualization -- http://www.csc.ncsu.edu/faculty/healey/PP/<br />
<br />
== Keywords ==<br />
* infographics<br />
* Big data<br />
* work flow(s)<br />
<br />
== The People ==<br />
* Mic Jackson, Mathematics & Environmental Science<br />
* Charlie Peck, Computer Science<br />
<br />
# Diana Ainembabazi<br />
# Ivan Babic<br />
# Leif DeJong<br />
# Ryan Lake<br />
# Mobeen Ludin<br />
# Emily Pavlovic<br />
# Mikel Qafa<br />
# Alex Reid<br />
# Elena Sergienko<br />
# Tristan Wright<br />
<br />
== Tools ==<br />
* GPlates - plate tectonics visualizations, multi-platform (http://www.gplates.org/)<br />
* open source visualization toolkits<br />
** Prefuse ( http://prefuse.org/ ), <br />
** Flare ( http://flare.prefuse.org/ )<br />
** Protovis ( http://vis.stanford.edu/protovis/ )<br />
<br />
* groundbreaking visualization projects<br />
** Many Eyes ( http://www.many-eyes.com )<br />
** IBM Visualization and Behavior Group (http://researcher.watson.ibm.com/researcher/view_project.php?id=3419)<br />
<br />
* a review of Tableau software (http://infosthetics.com/archives/2010/06/social_visualization_software_review_tableau_public.html)<br />
* another (http://bitools.org/tableau-software/)<br />
* a Tableau competitor (http://www.inetsoft.com/info/alternative_to_tableau_visualization_dashboards/?utm_vendor=google&utm_source=northamerica&utm_campaign=visual&utm_medium=search&utm_content=12577228682&utm_term=tableau%20software%20review&gclid=CKPZmvbyoLECFQ8CQAody2v2bg)<br />
* Polaris interactive database visualization (http://www.graphics.stanford.edu/projects/polaris/)<br />
* Spotfire (http://www.cs.umd.edu/hcil/spotfire/)<br />
<br />
== Topics ==<br />
# Long-term turtle size, sex, age, climate by year from Western Nebraska (JohnI)<br />
#* Von Bertalanthy (sp) growth model, special case of Fisher models? <br />
# Long-term iguana size, sex, age, climate (8 years only) from Bahamas (Exumas island) (JohnI)<br />
#* Von Bertalanthy (sp) growth model, special case of Fisher models? <br />
# Why do turtles lay the number, size, type and frequency of eggs that they do?<br />
#* What are the common patterns?<br />
#* Which dimensions aren't accounted for? <br />
#** Latitude and longitude? <br />
#** Habitat? <br />
#** Phylogeny?<br />
#** Climate?<br />
#** What other data sets are available?<br />
# How to distinguish between variations within a species vs different species <br />
#* Standardized morphometric data (AOT moristic data, e.g. counts of number of scales between body parts), size standardized<br />
#* Currently using multivariate statistics, about 25 variables<br />
#* Looking for one image with all populations and variables<br />
#* Looking for structure <br />
# Phylogenetic reconstruction, visualizing trees with multiple models (JohnI)<br />
<br />
== Techniques ==<br />
# Principle component analysis<br />
# Discriminate function analysis<br />
# Data conditioning and translation, CSV and XML<br />
# Gridded and non-gridded data <br />
# Ideas that Michael suggested<br />
<br />
== Sources ==<br />
# Mic's books<br />
# Charlie's books <br />
# Dave's viz workshop at Kean<br />
# Web sources<br />
* The Organisation for Economic Co-operation and Development (OECD) statistics -- http://www.oecd.org/statistics/<br />
<br />
== Schedule ==<br />
* Looking for 2-3 hours of meeting time, possibly one shorter and one longer<br />
* Noon on Monday, Thursday, or Friday<br />
* 4p-7p Monday, Wednesday, Thursday, Friday (modulo sport practice)<br />
<br />
== The Plan ==<br />
1) Planning items <br />
* Are there any field trip opportunities?<br />
* Figure-out what books to order<br />
* Figure-out what are the likely conference opportunities?<br />
* Are there any other tools besides R that we should be considering? <br />
** GRASS?<br />
** <br />
<br />
2) Things to learn<br />
* Is there a somewhat canonical process or technique that one can reliably apply to go from readings -> data -> information? At which stage(s) is/are a visualization helpful?<br />
* How to utilize geocoding attributes?<br />
* How to utilize timestamp attributes?<br />
<br />
3) Things to read <br />
* <br />
<br />
4) Things to do during the class<br />
* <br />
<br />
5) Questions<br />
* Which parts of statistics do people need to know? <br />
** correlation for PCA <br />
* What linear algebra do people need to know?<br />
** matrix operations for PCA<br />
<br />
6) Tools<br />
* R under Linux/OSX<br />
<br />
7) Possible sources for data sets<br />
* John Iverson<br />
** turtle birthing data<br />
** phylogenetic reconstruction <br />
* Mike Deibel<br />
* Kathy Milar<br />
* Meg Streepy<br />
** GPlates - visualizing plate tectonics</div>Admin