Difference between revisions of "Cluster: New BobSCEd Install Log"
Jump to navigation
Jump to search
(→Cloning) |
(→Cloning) |
||
Line 11: | Line 11: | ||
*** Enable tftpd in <code>/etc/inetd.conf</code> by removing the comments and restart inetd with <code>/etc/rc.d/inetd restart</code> | *** Enable tftpd in <code>/etc/inetd.conf</code> by removing the comments and restart inetd with <code>/etc/rc.d/inetd restart</code> | ||
*** The following lines were already in <code>/usr/local/etc/dhcpd.conf: allow booting; allow bootp;</code>, put the filename in the particular group (see [http://debianclusters.cs.uni.edu/index.php/Udpcast_Cloning:_Preparing_the_DHCP_Server_for_PXE_Booting Debian Clusters]) | *** The following lines were already in <code>/usr/local/etc/dhcpd.conf: allow booting; allow bootp;</code>, put the filename in the particular group (see [http://debianclusters.cs.uni.edu/index.php/Udpcast_Cloning:_Preparing_the_DHCP_Server_for_PXE_Booting Debian Clusters]) | ||
− | *** <code>cp /usr/local/share/syslinux/pxelinux.0 / | + | *** <code>cp /usr/local/share/syslinux/pxelinux.0 /tftpboot/</code> (the /tftpboot directory needs to be created) |
− | *** Download linux and | + | *** Download linux, initrd, and default from [http://udpcast.linux.lu/bootloader.html#pxe the udpcast site] into /tftpboot |
+ | *** Move default into /tftpboot/pxelinux.cfg | ||
+ | *** Restart dhcpd (<code>killall -KILL dhcpd</code> and <code>/usr/local/sbin/dhcpd -q -cf /usr/local/etc/dhcpd.conf -lf /var/db/dhcpd/dhcpd.leases -pf /var/run/dhcpd/dhcpd.pid -user dhcpd -group dhcpd</code> | ||
=== Head Node === | === Head Node === |
Revision as of 15:22, 26 August 2009
Contents
Scratch Space
Log
Green color indicates something that still needs to be done.
Cloning
- Download the udpcast rpm from http://udpcast.linux.lu/source.html
- Install with
yum --nogpgcheck localinstall udpcast-20081213-1.i386.rpm
- On hopper, installed the syslinux and tftpd-hpa ports
- Enable tftpd in
/etc/inetd.conf
by removing the comments and restart inetd with/etc/rc.d/inetd restart
- The following lines were already in
/usr/local/etc/dhcpd.conf: allow booting; allow bootp;
, put the filename in the particular group (see Debian Clusters) cp /usr/local/share/syslinux/pxelinux.0 /tftpboot/
(the /tftpboot directory needs to be created)- Download linux, initrd, and default from the udpcast site into /tftpboot
- Move default into /tftpboot/pxelinux.cfg
- Restart dhcpd (
killall -KILL dhcpd
and/usr/local/sbin/dhcpd -q -cf /usr/local/etc/dhcpd.conf -lf /var/db/dhcpd/dhcpd.leases -pf /var/run/dhcpd/dhcpd.pid -user dhcpd -group dhcpd
- Enable tftpd in
- Install with
Head Node
Yum installed:
- gcc.x86_64, gcc-c++.x86_64
- for Ganglia:
- apr.x86_64 and apr-devel.x86_64
- libconfuse-2.5-4.el5.x86_64.rpm, libconfuse-devel-2.5-4.el5.x86_64.rpm (from Fedora repositories)
- expat-devel.x86_64
- for Intel updates:
- compat-libstdc++-33.i386
Install C3 tools from http://www.csm.ornl.gov/torc/C3/C3softwarepage.shtml
- Downloaded full install rpm on bs0, installed with
yum --nogpgcheck localinstall c3-4.0.1-1.noarch.rpm
- See C3 Tools README and C3 Tools INSTALL
- Put root's keys in the home directory and authorized itself, then copied that to the worker node image
Ganglia
- On hopper, added the data_source line for bs0 to
/usr/local/etc/gmetad.conf
and restarted it with/usr/local/etc/rc.d/gmetad restart
- Downloaded tar ball from http://sourceforge.net/projects/ganglia/
- See Ganglia README
./configure --prefix=/cluster
- The head node uses a different Ganglia gmond.conf in /etc/ganglia/gmond.conf and the workers just have theirs symlinked to /cluster/etc/gmond.conf
- By default, iptables is running on the CentOS install and blocks hopper's Ganglia requests
- Turned off by clearing it and then running
/sbin/service iptables save
- Turned off by clearing it and then running
Networking
- Shorewall, see
/etc/shorewall/params
for almost all of the important definitions- Natting is done through
/etc/shorewall/masq
- Natting is done through
- DHCP relay, added to boot with
chkconfig on
, set for hopper (installed as part of dhcp yum package)- See
/etc/sysconfig/dhcrelay
- This means that a dhcp server is also installed, but it is not set to run and is not configured, either
- Hopper needs to have a static route added in order to have the responses return, these are in
/etc/rc.conf
:
- See
static_routes="bs0"
route_bs0="192.168.0.1 159.28.234.200"
Modules
- Installed environment-modules from http://download.fedora.redhat.com/pub/epel/5/x86_64/repoview/environment-modules.html
- Important directories:
/usr/share/Modules/
- Important directories:
Torque
- Installed from source with
./configure --with-default-server=bs0.bobsced.loc --with-rc=scp --disable-mom --with-server-home=/var/spool/pbs
("clients" is what installs qmgr) - Installs to /usr/local/
- Set up according to Debian Clusters setup
- Reran the ./configure but without --disable-moms, then ran
make packages
, copied this to worker node
Maui
- installed Maui according to same link as above
Intel Firmware Updates