https://wiki.cs.earlham.edu/index.php?title=Fk-vizscidat-notes&feed=atom&action=history Fk-vizscidat-notes - Revision history 2024-03-28T12:59:24Z Revision history for this page on the wiki MediaWiki 1.32.1 https://wiki.cs.earlham.edu/index.php?title=Fk-vizscidat-notes&diff=13366&oldid=prev Admin: Created page with "== Course Schedule (DRAFT) == * Week 1 -- Visualization Basics # lab on data collection # begin work on course products ## guide -- do's and don'ts for good infographics ## trans..." 2012-08-27T11:52:52Z <p>Created page with &quot;== Course Schedule (DRAFT) == * Week 1 -- Visualization Basics # lab on data collection # begin work on course products ## guide -- do&#039;s and don&#039;ts for good infographics ## trans...&quot;</p> <p><b>New page</b></p><div>== Course Schedule (DRAFT) ==<br /> * Week 1 -- Visualization Basics<br /> # lab on data collection<br /> # begin work on course products<br /> ## guide -- do's and don'ts for good infographics<br /> ## transferable vignettes<br /> ## ??<br /> <br /> * Week 2 -- Visualization Basics<br /> # lab on turning reports into data into information<br /> # continue work on course products<br /> <br /> * Week 3 -- Exploratory Data Analysis<br /> # lab on EDA -- numerical and graphical summaries<br /> # continue work on course products<br /> <br /> * Week 4 -- Exploratory Data Analysis<br /> # lab on EDA<br /> # continue work on course products<br /> <br /> * Week 5 -- Visualization Tools (notice the links below)<br /> # Tools assignment -- low tech, high tech<br /> # continue work on course products<br /> <br /> * Week 6 -- Visualization Tools<br /> # Tools assignment -- critical reviews of existing visualizations<br /> # continue work on course products<br /> <br /> * Week 7 -- Visualization Tools<br /> # Tools assignment <br /> # continue work on course products<br /> <br /> * Week 8 -- Visualization Tools<br /> # Tools assignment<br /> # continue work on course products<br /> <br /> * Week 9 -- Projects<br /> # Projects assignment<br /> # continue work on course products<br /> <br /> * Week 10 -- Projects<br /> # Projects assignment -- documenting choices and assumptions<br /> # continue work on course products<br /> <br /> * Week 11 -- Projects<br /> # Projects assignment<br /> # continue work on course products<br /> <br /> * Week 12 -- Projects<br /> # Projects assignment<br /> # continue work on course products<br /> <br /> * Week 13 -- Projects<br /> # Projects assignment<br /> # continue work on course products<br /> <br /> * Week 14 -- Projects<br /> # Projects assignment<br /> # continue work on course products<br /> <br /> * Week 15 -- Projects<br /> # Projects presentation<br /> # complete work on course products<br /> <br /> == Short-term To Do List ==<br /> # Figure-out books for the library to purchase, probably put them on reserve through the fall (charlie)<br /> # Look at on-line courses in this area (mic)<br /> <br /> == Examples ==<br /> * Good and Bad Statistical Graphs -- http://www.datavis.ca/gallery/<br /> * Eurozone debt - http://www.bbc.co.uk/news/business-15748696<br /> * Wikileaks US embassy cables - http://datavisualization.ch/datasets/wikileaks-us-embassy-cables/<br /> * Stopping SOPA and PIPA - http://visual.ly/stop-sopa<br /> * Auto accident statistics in Britain - http://www.bbc.co.uk/news/magazine-16631597<br /> * A snapshot of the rapidly changing world of computing, communications and technology - http://www.nytimes.com/interactive/2011/12/06/science/1206-world.html?ref=science<br /> * Words by the millions - http://www.nytimes.com/2012/03/25/business/words-by-the-millions-sorted-by-software.html?_r=1&amp;ref=technology<br /> * county health ratings - http://www.countyhealthrankings.org/app<br /> * live wind map - http://hint.fm/wind/index.html<br /> * Factual - http://www.nytimes.com/2012/03/25/business/factuals-gil-elbaz-wants-to-gather-the-data-universe.html?ref=technology<br /> * worldwide health data - http://www.youtube.com/watch?v=jbkSRLYSojo&amp;feature=player_embedded<br /> * Obama's budget proposal - http://www.nytimes.com/interactive/2012/02/13/us/politics/2013-budget-proposal-graphic.html?emc=eta1<br /> * Interactive earthquake map - http://pnsn.org/tremor<br /> * http://visual.ly/education-vs-incarceration - and their tool for building vizs<br /> * shot analysis for NBA finals - http://www.nytimes.com/interactive/2012/06/11/sports/basketball/nba-shot-analysis.html<br /> * European debt -- http://www.aljazeera.com/indepth/interactive/2012/06/20126127221845926.html<br /> * Map of the Market (link behaves oddly, but you can get there) -- http://www.smartmoney.com/map-­of-­the-­market/<br /> * Gallery of R Visualizations -- http://addictedtor.free.fr/graphiques/<br /> * nice quicktime example of the &quot;starchart&quot; Filmfinder -- http://hcil2.cs.umd.edu/video/1994/1994_visualinfo.mpg -- dated but very good<br /> * 2010 U.S. Election Visualizations -- http://www.csc.ncsu.edu/faculty/healey/US_election/<br /> * Gun-related deaths by US State -- http://www.aljazeera.com/indepth/interactive/2012/07/2012726141159587596.html<br /> * Minard's Map of French Wine -- http://en.wikipedia.org/wiki/File:Minard%E2%80%99s_map_of_French_wine_exports_for_1864.jpg#file<br /> * Minard's Map of Napoleon's Russian Invasion -- http://en.wikipedia.org/wiki/File:Minard.png#file<br /> * Krulwich - http://www.npr.org/blogs/krulwich/2012/03/21/149095154/mirror-mirror-on-the-wall-do-the-data-tell-it-all?sc=fb&amp;cc=fp<br /> * Defections of Syrian Leaders -- http://www.aljazeera.com/indepth/interactive/syriadefections/2012730840348158.html<br /> <br /> == Press ==<br /> NPR did a couple of interesting segments on Big Data, visualizations, and the search of mathematicians and others who can do that stuff. (December, 2011)<br /> * Part 1 - http://www.npr.org/2011/11/29/142521910/the-digital-breadcrumbs-that-lead-to-big-data?ps=rs<br /> * Part 2 - http://www.npr.org/2011/11/30/142893065/the-search-for-analysts-to-make-sense-of-big-data<br /> <br /> New York Times article from December, 2011 on bioinformatics and visualization, MicJ<br /> <br /> == Other ==<br /> * http://www.r-bloggers.com/how-the-new-york-times-uses-r-for-data-visualization/<br /> * At some point nyt.com supported a &quot;viz lab&quot; where people could use their data sets to build their own visualizations. I can't find a current reference to this now. 20 January 2012<br /> * IBM's Many Eyes - <br /> * http://www.cc.gatech.edu/~stasko/7450/syllabus.html<br /> <br /> == Presentations ==<br /> * David McCandless: The beauty of data visualizations (TED) - http://www.ted.com/talks/david_mccandless_the_beauty_of_data_visualization.html<br /> ** Military spending - http://www.guardian.co.uk/news/datablog/2010/apr/01/information-is-beautiful-military-spending<br /> * What we learned from 5 million books (TED) - http://www.ted.com/talks/what_we_learned_from_5_million_books.html<br /> ** Google's ngram interface: http://books.google.com/ngrams/<br /> * Baby names -- NameVoyager (http://www.babynamewizard.com/voyager)<br /> * Wordle (http://www.wordle.net/ )<br /> * Raw Milk Laws in the US (http://farmtoconsumer.org/raw_milk_map.htm)<br /> * International Milk Production (http://chartsbin.com/view/1492)<br /> * Perception in Visualization -- http://www.csc.ncsu.edu/faculty/healey/PP/<br /> <br /> == Keywords ==<br /> * infographics<br /> * Big data<br /> * work flow(s)<br /> <br /> == The People ==<br /> * Mic Jackson, Mathematics &amp; Environmental Science<br /> * Charlie Peck, Computer Science<br /> <br /> # Diana Ainembabazi<br /> # Ivan Babic<br /> # Leif DeJong<br /> # Ryan Lake<br /> # Mobeen Ludin<br /> # Emily Pavlovic<br /> # Mikel Qafa<br /> # Alex Reid<br /> # Elena Sergienko<br /> # Tristan Wright<br /> <br /> == Tools ==<br /> * GPlates - plate tectonics visualizations, multi-platform (http://www.gplates.org/)<br /> * open source visualization toolkits<br /> ** Prefuse ( http://prefuse.org/ ), <br /> ** Flare ( http://flare.prefuse.org/ )<br /> ** Protovis ( http://vis.stanford.edu/protovis/ )<br /> <br /> * groundbreaking visualization projects<br /> ** Many Eyes ( http://www.many­-eyes.com )<br /> ** IBM Visualization and Behavior Group (http://researcher.watson.ibm.com/researcher/view_project.php?id=3419)<br /> <br /> * a review of Tableau software (http://infosthetics.com/archives/2010/06/social_visualization_software_review_tableau_public.html)<br /> * another (http://bitools.org/tableau-software/)<br /> * a Tableau competitor (http://www.inetsoft.com/info/alternative_to_tableau_visualization_dashboards/?utm_vendor=google&amp;utm_source=northamerica&amp;utm_campaign=visual&amp;utm_medium=search&amp;utm_content=12577228682&amp;utm_term=tableau%20software%20review&amp;gclid=CKPZmvbyoLECFQ8CQAody2v2bg)<br /> * Polaris interactive database visualization (http://www.graphics.stanford.edu/projects/polaris/)<br /> * Spotfire (http://www.cs.umd.edu/hcil/spotfire/)<br /> <br /> == Topics ==<br /> # Long-term turtle size, sex, age, climate by year from Western Nebraska (JohnI)<br /> #* Von Bertalanthy (sp) growth model, special case of Fisher models? <br /> # Long-term iguana size, sex, age, climate (8 years only) from Bahamas (Exumas island) (JohnI)<br /> #* Von Bertalanthy (sp) growth model, special case of Fisher models? <br /> # Why do turtles lay the number, size, type and frequency of eggs that they do?<br /> #* What are the common patterns?<br /> #* Which dimensions aren't accounted for? <br /> #** Latitude and longitude? <br /> #** Habitat? <br /> #** Phylogeny?<br /> #** Climate?<br /> #** What other data sets are available?<br /> # How to distinguish between variations within a species vs different species <br /> #* Standardized morphometric data (AOT moristic data, e.g. counts of number of scales between body parts), size standardized<br /> #* Currently using multivariate statistics, about 25 variables<br /> #* Looking for one image with all populations and variables<br /> #* Looking for structure <br /> # Phylogenetic reconstruction, visualizing trees with multiple models (JohnI)<br /> <br /> == Techniques ==<br /> # Principle component analysis<br /> # Discriminate function analysis<br /> # Data conditioning and translation, CSV and XML<br /> # Gridded and non-gridded data <br /> # Ideas that Michael suggested<br /> <br /> == Sources ==<br /> # Mic's books<br /> # Charlie's books <br /> # Dave's viz workshop at Kean<br /> # Web sources<br /> * The Organisation for Economic Co-operation and Development (OECD) statistics -- http://www.oecd.org/statistics/<br /> <br /> == Schedule ==<br /> * Looking for 2-3 hours of meeting time, possibly one shorter and one longer<br /> * Noon on Monday, Thursday, or Friday<br /> * 4p-7p Monday, Wednesday, Thursday, Friday (modulo sport practice)<br /> <br /> == The Plan ==<br /> 1) Planning items <br /> * Are there any field trip opportunities?<br /> * Figure-out what books to order<br /> * Figure-out what are the likely conference opportunities?<br /> * Are there any other tools besides R that we should be considering? <br /> ** GRASS?<br /> ** <br /> <br /> 2) Things to learn<br /> * Is there a somewhat canonical process or technique that one can reliably apply to go from readings -&gt; data -&gt; information? At which stage(s) is/are a visualization helpful?<br /> * How to utilize geocoding attributes?<br /> * How to utilize timestamp attributes?<br /> <br /> 3) Things to read <br /> * <br /> <br /> 4) Things to do during the class<br /> * <br /> <br /> 5) Questions<br /> * Which parts of statistics do people need to know? <br /> ** correlation for PCA <br /> * What linear algebra do people need to know?<br /> ** matrix operations for PCA<br /> <br /> 6) Tools<br /> * R under Linux/OSX<br /> <br /> 7) Possible sources for data sets<br /> * John Iverson<br /> ** turtle birthing data<br /> ** phylogenetic reconstruction <br /> * Mike Deibel<br /> * Kathy Milar<br /> * Meg Streepy<br /> ** GPlates - visualizing plate tectonics</div> Admin