Sysadmin

From Earlham CS Department
Revision as of 13:17, 23 February 2019 by Craigje (talk | contribs)
Jump to navigation Jump to search


Machines and Brief Descriptions of Services

CS Machines

Server layout as of May 2017
NET
(vm1)
LDAP Server
DNS
DHCP

Backup to Dali: etc, var
WEB
(vm2)
Mailman
Mail Stack
Apache2
PostgresQL
MySQL
Wiki

Backup to Dali: etc, var
TOOLS
(vm3)
SageNB Server
Jupyterhub Server
Software Modules
NginX
SSH
Users

Backup to Dali: etc, var, mnts, sage
BABBAGE
Firewall
PROTO
Weather Monitoring
GPS/NTP
Energy Monitoring

Backup to Dali: etc, var
BOWIE
PostgreSQL
Docker

Backup to Dali: etc, var
SMILEY
XenDocs
NET
WEB
NFS

Backup to Dali: etc, var
SHINKEN
Users
SSH
Add machines














Cluster Machines

HOPPER
Users
SSH
NFS server
LDAP server
Software Modules
PostgreSQL
Wiki
Apache2
DNS
DHCP

Backup to Indiana: etc, var, cluster
INDIANA
New Storage Server
DALI
Storage Server
Gitlab
Backups
NginX

Backup to Indiana (/media/r10_vol/backups/): etc, var/opt/gitlab/backups
AL-SALAM
WebMO
Software Modules
Apache2

Backup to Indiana: etc, var
WHEDON
Software Modules

Backups to Indiana: etc, var
LAYOUT
Jupyterhub Server
Software Modules
NginX
Apache2
WebMO

Backup to Indiana: etc, var
BRONTE
Software Modules

Backup to Indiana: etc, var, nbserver
POLLOCK
Software Modules
WebMO
NginX

Backup to Indiana: etc, var
KAHLO
Storage Server
Backups
NginX

Backup to Indiana: etc, var
BIGFE
Software Modules

Hosts BCCD related repositories and distributions.
T-VOC
Software Modules
ELWOOD
Software Modules

Used by BCCD to host www.bccd.net and www.littlefe.net. Will be deprecated when BCCD project offloads their sites onto cloud-based hosting platforms.
krasner
Docker platform on an old lovelace machine upgraded to have 16GB of RAM.











































Switches

SG538SF02J
  • Model: HP Procurve 3400cl
  • Ports: 24
  • Backplane bandwidth:
    • 88 Gbps
    • 64 million pps
  • Memory:
    • 2MB packet buffer
    • 16 MB dual flash
    • 128 MB SDRAM
  • Cut-through switching: No
  • Unused as of May 12, 2017
CN63FP762S
  • Model: HP 2530-24G
  • Ports: 24
  • Switching Capacity:
    • 56 Gbps
    • 41.6 million pps
  • Memory:
    • 1.5 MB packet buffer
    • 256 MB flash
    • 128 MB DDR3 DIMM
  • Cut-through switching: No
  • Connected to Al-Salam as of May 12, 2017
SG525SG025
  • Model: HP Procurve 3400cl
  • Ports: 24
  • Backplane bandwidth:
    • 88 Gbps
    • 64 million pps
  • Memory:
    • 2MB packet buffer
    • 16 MB dual flash
    • 128 MB SDRAM
  • Cut-through switching: No
  • Connected to layout and whedon as of May 12, 2017
Netgear JGS524
  • Current cluster head-node
  • Unmanaged (no console/configuration)
  • Ports: 24
  • Switching bandwidth:
    • 48 Gbps
    • 1.5 million pps
  • Memory:
    • 2MB packet buffer
  • Cut-through switching: No
  • Connected to Al-Salam, Hopper, Pollock, Nagios, Dali, Kahlo, Bronte as of May 12, 2017
cs-main
  • Model: HP 5920AF-24XG
  • Ports: 24
  • Backplane bandwidth:
    • 480 Gbps
    • 367 million pps
  • Memory:
    • 3.6 GB packet buffer
    • 256 MB dual flash
    • 2 GB SDRAM
  • Cut-through switching: Yes
  • IP Address: 159.28.31.66
  • Connected to layout, kahlo, and dali as of May 12, 2017
5500denniscs-sw1
  • Model: HP 5500 JG542A
  • Ports: 24
  • Backplane bandwidth:
    • 224 Gbps
    • 166.6 million pps
  • Memory:
    • 6 MB packet buffer
    • 512 MB dual flash
    • 1 GB SDRAM
  • Cut-through switching: No
  • IP Address: 159.28.31.67
  • Connected to Babbage, Bowie, Nagios, and the cluster's netgear switch (via port 14) as of May 12, 2017


































Systems Administration Documentation

For old documentation, see: Old Wiki Information

Current Projects

This is the list we will work from in addition to service requests.

Some important procedural pages:

Please update specific projects at their own page.

  • Documentation - a meta-project - please click here to update projects you've worked on but haven't documented yet
  • Web logins
  • Password management
  • Docker and WebODM on Bronte
  • Fix shinken server access
  • Backup in Lilly basement
  • Backup on all machines - includes backup.cs.e.e (indiana?)
  • Post shutdown
  • power map additions and updates - update the power map and get as much of our power data (how much we use, how much we could theoretically be using, etc.) all together
  • clean up system variables, e.g. verifying that all our systems have Python 2 as a default not Python 3, using Charlie’s earlier email about cexecs as a starting point (in your inbox or the mailing list, search for “A bit of cleaning on the cluster side”)
  • install and configure RT
  • install and configure bioinformatics software (a focus for Laurence but one that others may want to look at) - includes making sure qsub works on both bronte and pollock

Smaller projects

  • Layout Layout
  • Fix man pages (on each machine, check that man pages come up as expected - e.g. run `man ls` - and fix them if not)
  • Double-check our time servers.
  • Fix Bronte's IP address problem.
    • see nfstab; check the IP backups on hopper and dali (sysconfig,networks,scripts in the backup dir - also check ~charliep for a capture of ifconfig's eth*)
    • take notes for future CentOS upgrades
  • note the hardware specs for each server (including nodes and processors in each cluster)