Difference between revisions of "CS382:Scraper Recipes"

From Earlham CS Department
Jump to navigation Jump to search
(New page: =Scraper Recipes= == Getting ready == # get the code. bzr branch ~edlefma/mwscraper)
 
(Getting ready)
Line 1: Line 1:
 
=Scraper Recipes=
 
=Scraper Recipes=
  
== Getting ready ==
+
== Get the code ==
# get the code.
 
 
  bzr branch ~edlefma/mwscraper
 
  bzr branch ~edlefma/mwscraper
 +
 +
== Pick a templte ==
 +
Right now our templates are software.tt, to generate a software dependency page, geneds.tt, and scraper.tt, which creates its own doc page.
 +
 +
== Run & Check results ==
 +
The tool can either print its new wiki pages to STDOUT or upload them to the wiki. Check your results by running mwscraper without --upload:
 +
./mwscraper --host https://wiki.cs.earlham.edu --login --username <user> --password <pass> <template>
 +
 +
== Run and upload changes ==
 +
Now that we know we want to make this change, run again with --upload.
 +
./mwscraper --host https://wiki.cs.earlham.edu --login --username <user> --password <pass> --upload <template>

Revision as of 07:01, 10 April 2009

Scraper Recipes

Get the code

bzr branch ~edlefma/mwscraper

Pick a templte

Right now our templates are software.tt, to generate a software dependency page, geneds.tt, and scraper.tt, which creates its own doc page.

Run & Check results

The tool can either print its new wiki pages to STDOUT or upload them to the wiki. Check your results by running mwscraper without --upload:

./mwscraper --host https://wiki.cs.earlham.edu --login --username <user> --password <pass> <template>

Run and upload changes

Now that we know we want to make this change, run again with --upload.

./mwscraper --host https://wiki.cs.earlham.edu --login --username <user> --password <pass> --upload <template>