CS382:Unit-mashup

From Earlham CS Department
Revision as of 10:50, 18 February 2009 by Edlefma (talk | contribs)
Jump to navigation Jump to search

Data Visualization with Mashups

Background reading, Resources

Lecture notes - outline form

Introduction

  • Models and other datasets describing the world are very large and/or complex.
    • Example the US Censes is X rows and Y columns
  • In most cases with datasets this large you can't just stare at the raw data and get a feel for what it's saying.
  • Even to develop statistical methods to get useful information you still need a notion of what you're looking for.
  • Visualization is a way we can get a look at general trends or anomalies in an intuitive way.
  • Visualization works better with larger data sets that you can clump.

Types of visualizations

  • Overlays (geographical) - example
  • Semantic webs - example
  • Geometrical ( graphical environments where the size/shape/movement/etc of objects is tied to data ) - example
  • more

Process

  • Given a certain problem or question, determine what general catagories of information are needed.
    • What body of information is needed.
  • Data collection
    • finding good sources of data
    • methods of collecting ( online databases, field work, etc... )
    • coordinating multiple data sources
  • Determaning what tools/type of visualization is most appropriate
  • Encoding data to be useful (KML,etc..)
  • Drawing general conclusions from the visualization
  • Use more exact methods like statistics to show truthiness.

Classroom response questions - at least three

Lab activity - materials, process and software

Buliding a google mashup/KML document tying 2 or more datasets together. Datasets will be provided but each group would have distinct information to use. Tools for retrieving and inputting data would be provided but the students would still have to learn KML and interacting with Google Earth/maps. The databases are prefereably something local/personal that can provide interesting results when visualized geographically.

Possible Data sources:

Scheduling - early, late, dependencies on other units, length of unit

Timing

Doesn't matter

Length

One week.

Archived stuff