Difference between revisions of "Cluster: New BobSCEd Install Log"

From Earlham CS Department
Jump to navigation Jump to search
(Cloning)
(Cloning)
Line 9: Line 9:
 
** Install with <code>yum --nogpgcheck localinstall udpcast-20081213-1.i386.rpm</code>
 
** Install with <code>yum --nogpgcheck localinstall udpcast-20081213-1.i386.rpm</code>
 
** On hopper, installed the syslinux and tftpd-hpa ports
 
** On hopper, installed the syslinux and tftpd-hpa ports
*** Enable tftpd in <code>/etc/inetd.conf</code> by removing the comments and restart inetd with <code>/etc/rc.d/inetd restart</code>
+
*** Enable tftpd in <code>/etc/inetd.conf</code> by removing the comments and restart inetd with <code>/etc/rc.d/inetd restart</code>, and then also run the command listed on that line to start tftpd
 
*** The following lines were already in <code>/usr/local/etc/dhcpd.conf: allow booting; allow bootp;</code>, put the filename in the particular group (see [http://debianclusters.cs.uni.edu/index.php/Udpcast_Cloning:_Preparing_the_DHCP_Server_for_PXE_Booting Debian Clusters])
 
*** The following lines were already in <code>/usr/local/etc/dhcpd.conf: allow booting; allow bootp;</code>, put the filename in the particular group (see [http://debianclusters.cs.uni.edu/index.php/Udpcast_Cloning:_Preparing_the_DHCP_Server_for_PXE_Booting Debian Clusters])
 
*** <code>cp /usr/local/share/syslinux/pxelinux.0 /tftpboot/</code> (the /tftpboot directory needs to be created)
 
*** <code>cp /usr/local/share/syslinux/pxelinux.0 /tftpboot/</code> (the /tftpboot directory needs to be created)

Revision as of 16:03, 26 August 2009

Scratch Space

Log

Green color indicates something that still needs to be done.

Cloning

  • Download the udpcast rpm from http://udpcast.linux.lu/source.html
    • Install with yum --nogpgcheck localinstall udpcast-20081213-1.i386.rpm
    • On hopper, installed the syslinux and tftpd-hpa ports
      • Enable tftpd in /etc/inetd.conf by removing the comments and restart inetd with /etc/rc.d/inetd restart, and then also run the command listed on that line to start tftpd
      • The following lines were already in /usr/local/etc/dhcpd.conf: allow booting; allow bootp;, put the filename in the particular group (see Debian Clusters)
      • cp /usr/local/share/syslinux/pxelinux.0 /tftpboot/ (the /tftpboot directory needs to be created)
      • Download linux, initrd, and default from the udpcast site into /tftpboot
      • Move default into /tftpboot/pxelinux.cfg
      • Restart dhcpd (killall -KILL dhcpd and /usr/local/sbin/dhcpd -q -cf /usr/local/etc/dhcpd.conf -lf /var/db/dhcpd/dhcpd.leases -pf /var/run/dhcpd/dhcpd.pid -user dhcpd -group dhcpd

Head Node

Yum installed:

  • gcc.x86_64, gcc-c++.x86_64
  • for Ganglia:
    • apr.x86_64 and apr-devel.x86_64
    • libconfuse-2.5-4.el5.x86_64.rpm, libconfuse-devel-2.5-4.el5.x86_64.rpm (from Fedora repositories)
    • expat-devel.x86_64
  • for Intel updates:
    • compat-libstdc++-33.i386

Install C3 tools from http://www.csm.ornl.gov/torc/C3/C3softwarepage.shtml

  • Downloaded full install rpm on bs0, installed with yum --nogpgcheck localinstall c3-4.0.1-1.noarch.rpm

Ganglia

  • On hopper, added the data_source line for bs0 to /usr/local/etc/gmetad.conf and restarted it with /usr/local/etc/rc.d/gmetad restart
  • Downloaded tar ball from http://sourceforge.net/projects/ganglia/
    • See Ganglia README
    • ./configure --prefix=/cluster
    • The head node uses a different Ganglia gmond.conf in /etc/ganglia/gmond.conf and the workers just have theirs symlinked to /cluster/etc/gmond.conf
    • By default, iptables is running on the CentOS install and blocks hopper's Ganglia requests
      • Turned off by clearing it and then running /sbin/service iptables save

Networking

  • Shorewall, see /etc/shorewall/params for almost all of the important definitions
    • Natting is done through /etc/shorewall/masq
  • DHCP relay, added to boot with chkconfig on, set for hopper (installed as part of dhcp yum package)
    • See /etc/sysconfig/dhcrelay
    • This means that a dhcp server is also installed, but it is not set to run and is not configured, either
    • Hopper needs to have a static route added in order to have the responses return, these are in /etc/rc.conf:
static_routes="bs0"
route_bs0="192.168.0.1 159.28.234.200"

Modules

Torque

  • Installed from source with ./configure --with-default-server=bs0.bobsced.loc --with-rc=scp --disable-mom --with-server-home=/var/spool/pbs ("clients" is what installs qmgr)
  • Installs to /usr/local/
  • Set up according to Debian Clusters setup
  • Reran the ./configure but without --disable-moms, then ran make packages, copied this to worker node

Maui

  • installed Maui according to same link as above

Intel Firmware Updates