Mrbayes-software

From Earlham CS Department
Jump to navigation Jump to search

The CCG supports MrBayes on the Cairo (PowerPC) and BobSCEd (quad core Xeon) clusters. The software is installed in /cluster/{cairo, bobsced}/bin/mb. Both versions are built with the Message Passing Interface (MPI) libraries enabling parallel runs using up to 4 cores. Scaling beyond 4 tasks is limited by the software's architecture. One way around this limitation is to break your large runs into groups of 4.

Both versions of MrBayes, Cairo and BobSCEd, are built with the LAM MPI libraries. While this improves the performance of MrBayes somewhat over other MPI bindings it does make the running of parallel jobs slightly more complicated.

The difference between LAM and other MPI bindings is that you need to start the LAM world before running the parallel program, and then shut it down when the parallel program is complete. The simplest way to accomplish this is with a small shell script.

Cairo

Copy and paste these commands into a file, replacing the name of the MrBayes input file with your own (Hybrid_Bayesian.txt in this example). The vi and emacs editors are both available on the Cairo nodes.

!#/bin/bash
lamwipe # stops LAM (if it's running) lamboot -v lambhosts.def # starts LAM with the hosts listed in lambhosts.def
mpirun -np 4 /cluster/cairo/bin/mb Hybrid_Bayesian.txt < /dev/null > Hybrid_Baye sian.log# starts mb on four cores
lamwipe # stops LAM

You will also need a lam-hosts.def file listing the cluster nodes you want to run your jobs on. Remember that the node you start the job on (c4 in this example) must be listed in the lam-hosts.def file.

c4 cpu=2
c5 cpu=2

Once these two files and your MrBayes input file are ready you can run job in the background with the command:

c4$ nohup ./this-run.sh &

There are sample scripts and MrBayes input files in /cluster/home/charliep/mb-test-cairo. Note near the end of the MrBayes input file Hybrid_Bayesian.txt, the mcmcp command has an option for nchains, this should be set to 4 in your input file per the information above.

You can use the top command to see if your program is running. 'q' exits top. The data set and analysis described in Hybrid_Bayesian.txt takes about N days to run on 4 Cairo cores.

BobSCEd

N.B. Currently MrBayes is only available on Cairo, the BobSCEd version is being upgraded.