SE2006:group bar:todo

From Earlham CS Department
Revision as of 10:26, 30 March 2006 by Lemanal (talk | contribs)
Jump to navigation Jump to search
  • 5 data sources
  • scrAPI:
    • proper javadoc
    • testing suite
    • figure out group skipping
      • to skip a group, put a '?' as the first character in that group, e.g., (?.+)
      • see this awesome regexp tutorial (specific to Python but talks about Perl as a baseline too)
    • exception handling
      • (asserts, etc.)
    • auto (periodic) fetching of data
      • make ScrAPI run as a daemon?
    • single place for var name to sql type - done
    • thread support (scrape multiple sources simultaneously)
  • database
  • geocoding