Difference between revisions of "Sysadmin"

From Earlham CS Department
Jump to navigation Jump to search
m (Current Projects)
m (Current Projects)
Line 322: Line 322:
 
* [[Fix Lovelace machines]]
 
* [[Fix Lovelace machines]]
 
* Fix man pages (on each machine, check that man pages come up as expected - e.g. run `man ls` - and fix them if not)
 
* Fix man pages (on each machine, check that man pages come up as expected - e.g. run `man ls` - and fix them if not)
 +
* Double-check our time servers.

Revision as of 15:05, 11 February 2019


Machines and Brief Descriptions of Services

CS Machines

Server layout as of May 2017
NET
(vm1)
LDAP Server
DNS
DHCP

Backup to Dali: etc, var
WEB
(vm2)
Mailman
Mail Stack
Apache2
PostgresQL
MySQL
Wiki

Backup to Dali: etc, var
TOOLS
(vm3)
SageNB Server
Jupyterhub Server
Software Modules
NginX
SSH
Users

Backup to Dali: etc, var, mnts, sage
BABBAGE
Firewall
PROTO
Weather Monitoring
GPS/NTP
Energy Monitoring

Backup to Dali: etc, var
BOWIE
PostgreSQL
Docker

Backup to Dali: etc, var
SMILEY
XenDocs
NET
WEB
NFS

Backup to Dali: etc, var
SHINKEN
Users
SSH
Add machines














Cluster Machines

HOPPER
Users
SSH
NFS server
LDAP server
Software Modules
PostgreSQL
Wiki
Apache2
DNS
DHCP

Backup to Indiana: etc, var, cluster
INDIANA
New Storage Server
DALI
Storage Server
Gitlab
Backups
NginX

Backup to Indiana (/media/r10_vol/backups/): etc, var/opt/gitlab/backups
AL-SALAM
WebMO
Software Modules
Apache2

Backup to Indiana: etc, var
WHEDON
Software Modules

Backups to Indiana: etc, var
LAYOUT
Jupyterhub Server
Software Modules
NginX
Apache2
WebMO

Backup to Indiana: etc, var
BRONTE
Software Modules

Backup to Indiana: etc, var, nbserver
POLLOCK
Software Modules
WebMO
NginX

Backup to Indiana: etc, var
KAHLO
Storage Server
Backups
NginX

Backup to Indiana: etc, var
BIGFE
Software Modules

Hosts BCCD related repositories and distributions.
T-VOC
Software Modules
ELWOOD
Software Modules

Used by BCCD to host www.bccd.net and www.littlefe.net. Will be deprecated when BCCD project offloads their sites onto cloud-based hosting platforms.
krasner
Docker platform on an old lovelace machine upgraded to have 16GB of RAM.










































Switches

SG538SF02J
  • Model: HP Procurve 3400cl
  • Ports: 24
  • Backplane bandwidth:
    • 88 Gbps
    • 64 million pps
  • Memory:
    • 2MB packet buffer
    • 16 MB dual flash
    • 128 MB SDRAM
  • Cut-through switching: No
  • Unused as of May 12, 2017
CN63FP762S
  • Model: HP 2530-24G
  • Ports: 24
  • Switching Capacity:
    • 56 Gbps
    • 41.6 million pps
  • Memory:
    • 1.5 MB packet buffer
    • 256 MB flash
    • 128 MB DDR3 DIMM
  • Cut-through switching: No
  • Connected to Al-Salam as of May 12, 2017
SG525SG025
  • Model: HP Procurve 3400cl
  • Ports: 24
  • Backplane bandwidth:
    • 88 Gbps
    • 64 million pps
  • Memory:
    • 2MB packet buffer
    • 16 MB dual flash
    • 128 MB SDRAM
  • Cut-through switching: No
  • Connected to layout and whedon as of May 12, 2017
Netgear JGS524
  • Current cluster head-node
  • Unmanaged (no console/configuration)
  • Ports: 24
  • Switching bandwidth:
    • 48 Gbps
    • 1.5 million pps
  • Memory:
    • 2MB packet buffer
  • Cut-through switching: No
  • Connected to Al-Salam, Hopper, Pollock, Nagios, Dali, Kahlo, Bronte as of May 12, 2017
cs-main
  • Model: HP 5920AF-24XG
  • Ports: 24
  • Backplane bandwidth:
    • 480 Gbps
    • 367 million pps
  • Memory:
    • 3.6 GB packet buffer
    • 256 MB dual flash
    • 2 GB SDRAM
  • Cut-through switching: Yes
  • IP Address: 159.28.31.66
  • Connected to layout, kahlo, and dali as of May 12, 2017
5500denniscs-sw1
  • Model: HP 5500 JG542A
  • Ports: 24
  • Backplane bandwidth:
    • 224 Gbps
    • 166.6 million pps
  • Memory:
    • 6 MB packet buffer
    • 512 MB dual flash
    • 1 GB SDRAM
  • Cut-through switching: No
  • IP Address: 159.28.31.67
  • Connected to Babbage, Bowie, Nagios, and the cluster's netgear switch (via port 14) as of May 12, 2017


































Systems Administration Documentation

For old documentation, see: Old Wiki Information

Current Projects

This is the list we will work from in addition to service requests.

Some important procedural pages:

Please update specific projects at their own page.

  • Documentation - a meta-project - please click here to update projects you've worked on but haven't documented yet
  • Web logins
  • Password management
  • Docker and WebODM on Bronte
  • Fix shinken server access
  • Backup in Lilly basement
  • Backup on all machines - includes backup.cs.e.e (indiana?)
  • Post shutdown
  • power map additions and updates - update the power map and get as much of our power data (how much we use, how much we could theoretically be using, etc.) all together
  • clean up system variables, e.g. verifying that all our systems have Python 2 as a default not Python 3, using Charlie’s earlier email about cexecs as a starting point (in your inbox or the mailing list, search for “A bit of cleaning on the cluster side”)
  • install and configure RT
  • install and configure bioinformatics software (a focus for Laurence but one that others may want to look at) - includes making sure qsub works on both bronte and pollock

Smaller projects

  • update our DNS files (and thus our store of reserved IP addresses) based on which machines are currently in use (a good chance to learn DNS, both generally and how we run it here) - cf. Verify Lovelace DNS
  • Layout Layout
  • Fix Lovelace machines
  • Fix man pages (on each machine, check that man pages come up as expected - e.g. run `man ls` - and fix them if not)
  • Double-check our time servers.