Cluster: New BobSCEd Install Log
Jump to navigation
Jump to search
Scratch Space
Log
Green color indicates something that still needs to be done.
Head Node
Yum installed:
- gcc.x86_64, gcc-c++.x86_64
- for Ganglia:
- apr.x86_64 and apr-devel.x86_64
- libconfuse-2.5-4.el5.x86_64.rpm, libconfuse-devel-2.5-4.el5.x86_64.rpm (from Fedora repositories)
- expat-devel.x86_64
Install C3 tools from http://www.csm.ornl.gov/torc/C3/C3softwarepage.shtml
- Downloaded full install rpm on bs0, installed with
yum --nogpgcheck localinstall c3-4.0.1-1.noarch.rpm
- See C3 Tools README and C3 Tools INSTALL
- Put root's keys in the home directory and authorized itself, then copied that to the worker node image
Ganglia
- On hopper, added the data_source line for bs0 to
/usr/local/etc/gmetad.conf
and restarted it with/usr/local/etc/rc.d/gmetad restart
- Downloaded tar ball from http://sourceforge.net/projects/ganglia/
- See Ganglia README
./configure --prefix=/cluster
- The head node uses a different Ganglia gmond.conf in /etc/ganglia/gmond.conf and the workers just have theirs symlinked to /cluster/etc/gmond.conf
- By default, iptables is running on the CentOS install and blocks hopper's Ganglia requests
- Turned off by clearing it and then running
/sbin/service iptables save
- Turned off by clearing it and then running
Networking
- Shorewall, see
/etc/shorewall/params
for almost all of the important definitions- Natting is done through
/etc/shorewall/masq
- Natting is done through
- DHCP relay, added to boot with
chkconfig on
, set for hopper (installed as part of dhcp yum package)- See
/etc/sysconfig/dhcrelay
- This means that a dhcp server is also installed, but it is not set to run and is not configured, either
- Hopper needs to have a static route added in order to have the responses return, these are in
/etc/rc.conf
:
- See
static_routes="bs0"
route_bs0="192.168.0.1 159.28.234.200"
Modules
- Installed environment-modules from http://download.fedora.redhat.com/pub/epel/5/x86_64/repoview/environment-modules.html
- Important directories:
/usr/share/Modules/
- Important directories:
Torque
- Installed from source with
./configure --with-default-server=bs0.bobsced.loc --with-rc=scp --disable-mom --with-server-home=/var/spool/pbs
("clients" is what installs qmgr) - Installs to /usr/local/
- Set up according to [http://debianclusters.cs.uni.edu/index.php/Resource_Manager:_Torque Debian Clusters setup
- Reran the ./configure but without --disable-moms, then ran
make packages
, copied this to worker node