Difference between revisions of "Cluster:Todo"

From Earlham CS Department
Jump to navigation Jump to search
Line 20: Line 20:
 
* Work with Betsy Ward to get the plumbing for F@C setup on the D224 OSX machines. Local user, document the setup with a Wiki entry.  (Alex)
 
* Work with Betsy Ward to get the plumbing for F@C setup on the D224 OSX machines. Local user, document the setup with a Wiki entry.  (Alex)
 
* Console (JoshM)
 
* Console (JoshM)
<ul type=none>
+
** Environment variable called $FATCHOME
  <li>Environment variable called $FATCHOME
+
** Command line
  <li>Command line
+
** Sockets to communicate with mother
  <li>Sockets to communicate with mother
+
** Variable in mother.conf for console port
  <li>Variable in mother.conf for console port
+
** Mother listens and responds to commands on the console port
  <li>Mother listens and responds to commands on the console port
+
** Command list: status [(running|paused|stopped), molecular system, x out of y steps completed, estimated time remaining, # nodes started, # of nodes current], checkpoint, pause, resume, stop.
  <li>Command list: status [(running|paused|stopped), molecular system, x out of y steps completed, estimated time remaining, # nodes started, # of nodes current], checkpoint, pause, resume, stop.
+
** Command line option -nn interval for compact, refreshed display.
  <li>Command line option -nn interval for compact, refreshed display.
+
** First version of console has to be supplied with a hostname and port number.
  <li>First version of console has to be supplied with a hostname and port number.
+
** Future versions (possibly when we introduce the grandmother) can take a $FATCHOME environment variable that points to a mother.conf file (to get a port number) as a discovery mechanism.
  <li>Future versions (possibly when we introduce the grandmother) can take a $FATCHOME environment variable that points to a mother.conf file (to get a port number) as a discovery mechanism.
 
</ul>
 
 
* Develop test canon (Alex and Charlie)
 
* Develop test canon (Alex and Charlie)
 
* Document pval_report.pl and compare_walltime.pl in Wiki (headings for each are already under HowTos) (Alex)
 
* Document pval_report.pl and compare_walltime.pl in Wiki (headings for each are already under HowTos) (Alex)
 
* Supervise test runs, non-nfs, a2.7, all molecules, 1-4 nodes, bazaar and cairo, separate table (Alex)
 
* Supervise test runs, non-nfs, a2.7, all molecules, 1-4 nodes, bazaar and cairo, separate table (Alex)
 
* rerun the following configurations and compare nfs/nonnfs (Alex)
 
* rerun the following configurations and compare nfs/nonnfs (Alex)
<ul type=none>
+
** bazaar proteasome
  <li> bazaar proteasome
+
** bazaar villin-urea
  <li> bazaar villin-urea
+
** cairo methanol 1-8 nodes
  <li> cairo methanol 1-8 nodes
+
** cairo mixed
  <li> cairo mixed
+
** cairo proteasome
  <li> cairo proteasome
+
** cairo water 1-8 nodes
  <li> cairo water 1-8 nodes
+
** bazaar water
  <li> bazaar water
 
</ul>
 
  
 
== Curriculum Modules ==
 
== Curriculum Modules ==

Revision as of 00:04, 15 June 2005

(Need a notation for relative priority. Please don't delete anything unless we're updating this during a meeting.)


Plumbing

  • Get PBS working on bazaar, cairo, athena, and ACL (Skylar)
  • Setup WeatherDuck on hopper (Skylar)
    • Problem with some serial ports
  • Setup Amanda (Skylar)
  • Copy /cluster/old-hopper to tape, give it to charlie (Skylar)
  • Fiber uplink for bazaar (Charlie)
  • Network lag, monitoring?
  • Put Athena in the display cabinet
  • Setup Povray on Athena
  • Protect F@C source and molecular systems, open http, ftp?, ssh? at cluster.earlham.edu
  • Update speedup and speedup/efficiency within DVT for endnodes (Alex)

LittleFe

Folding@Clusters

  • Work with Betsy Ward to get the plumbing for F@C setup on the D224 OSX machines. Local user, document the setup with a Wiki entry. (Alex)
  • Console (JoshM)
    • Environment variable called $FATCHOME
    • Command line
    • Sockets to communicate with mother
    • Variable in mother.conf for console port
    • Mother listens and responds to commands on the console port
    • Command list: status [(running|paused|stopped), molecular system, x out of y steps completed, estimated time remaining, # nodes started, # of nodes current], checkpoint, pause, resume, stop.
    • Command line option -nn interval for compact, refreshed display.
    • First version of console has to be supplied with a hostname and port number.
    • Future versions (possibly when we introduce the grandmother) can take a $FATCHOME environment variable that points to a mother.conf file (to get a port number) as a discovery mechanism.
  • Develop test canon (Alex and Charlie)
  • Document pval_report.pl and compare_walltime.pl in Wiki (headings for each are already under HowTos) (Alex)
  • Supervise test runs, non-nfs, a2.7, all molecules, 1-4 nodes, bazaar and cairo, separate table (Alex)
  • rerun the following configurations and compare nfs/nonnfs (Alex)
    • bazaar proteasome
    • bazaar villin-urea
    • cairo methanol 1-8 nodes
    • cairo mixed
    • cairo proteasome
    • cairo water 1-8 nodes
    • bazaar water

Curriculum Modules

  • Producing a cluster/distro specific set of modules out of one base unit
  • Generating a wiki entry and repository entry from one base unit

Recompute