Difference between revisions of "Sysadmin"

From Earlham CS Department
Jump to navigation Jump to search
(Sysadmin 2014 to do list:)
(Compute (servers and clusters))
 
(625 intermediate revisions by 20 users not shown)
Line 1: Line 1:
__NOTOC__
+
This is the hub for the CS sysadmins on the wiki.
 +
 
 +
= Overview =
 +
 
 +
[https://docs.google.com/drawings/d/1XaULz5IxXV_BZQjrko3QJ8wV5aXsSTYcSWxxT49OyZk/edit If you're visually inclined, we have a colorful and easy-to-edit map of our servers here!]
 +
 
 +
== Server room ==
 +
 
 +
Our servers are in Noyes, the science building that predates the CST. For general information about the server room and how to use it, check out [[Sysadmin:Server Room|this page]].
 +
 
 +
Columns: machine name, IPs, type (virtual, metal), purpose, dies, cores, RAM
 +
 
 +
== Compute Resources ==
  
== Sysadmin Responsibilities ==
 
This is the basic list of tasks that Earlham CS system administrators are in charge of.
 
  
 
{| class="wikitable"
 
{| class="wikitable"
 +
|+ CS machines and VMs
 +
|-
 +
! Machine name !! 159 Ip Address !! 10Gb Ip address !! Operating System !! Metal or Virtual !! Description !! RAM
 +
|-
 +
| Bowie || 159.28.22.5 || 10.10.10.15 || Debian 9 || Metal || hosts and exports user files; Jupyterhub; landing server || 198 GB
 +
|-
 +
| Smiley || 159.28.22.251 || 10.10.10.252 || Ubuntu 18.04 || Metal || VM host, not accessible to regular users || 156 GB
 +
|-
 +
| Web || 159.28.22.2 || 10.10.10.200 || Ubuntu 18.04 || Virtual || Website host || 8 GB
 +
|-
 +
| Auth || 159.28.22.39 || No 10Gb internet|| CentOS 7 || Virtual || host of LDAP user database || 4 GB
 
|-
 
|-
! Responsibilities !! Wilson !! Eamon
+
| Code || 159.28.22.42 || 10.10.10.42 || Ubuntu 18.04 || Virtual || Gitlab host || 8 GB
 
|-
 
|-
| Install software on Debian (ACL) || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Net || 159.28.22.1 || 10.10.10.100 || Ubuntu 18.04 || Virtual || network administration host for CS || 4 GB
 
|-
 
|-
| Install software on FreeBSD (servers) || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Central || 159.28.22.177 || No 10Gb internet || Debian 9 || Virtual || ODK Central Host || 4 GB
 
|-
 
|-
| Make a CS user account || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> ||
+
| Urey || 159.28.22.139 || No 10Gb internet || XCP-ng || Metal || Sysadmin Sandbox Environment || 16 GB
 +
|}
 +
 
 +
{| class="wikitable"
 +
|+ Cluster machines
 
|-
 
|-
| Change users CS password || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> ||
+
! Machine name !! 159 Ip Address !! 10Gb Ip address !! Operating System !! Metal or Virtual !! Description !! RAM
 
|-
 
|-
| Add DNS & DHCP entry || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> ||
+
| Hopper || 159.28.23.1 || 10.10.10.1 || Debian 10 || Metal || landing server, NFS host for cluster || 64 GB
 
|-
 
|-
| Being able to edit wiki || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Lovelace || 159.28.23.35 || 10.10.10.35 || CentOS 7 || Metal || Large compute server || 96 GB
 
|-
 
|-
| Make a CS wiki account || ||
+
| Pollock || 159.28.23.8 || 10.10.10.8 || CentOS 7 || Metal || Large compute server || 131 GB
 +
|-
 +
| Bronte || 159.28.23.140 || No 10Gb internet || CentOS 7 || Metal || Large compute server || 115 GB
 
|-
 
|-
| Add people to different groups (ldap) || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> ||
+
| Sakurai || 159.23.23.3 || 10.10.10.3 || Debian 10 || Metal || Runs Backup || 12 GB
 
|-
 
|-
| Modification and maintenance of Nagios || ||
+
| Miyamoto || 159.28.23.45 || No 10Gb currently || Debian 10 || Metal || Runs Backup || 16 GB
 
|-
 
|-
| DD a new ACL image || ||
+
| HopperPrime || 159.28.23.142 || 10.10.10.142 || Debian 10 || Metal || Runs Backup || 16 GB
 
|-
 
|-
| Set up a new ACL || ||  
+
| Monitor || 159.28.23.250 || No 10Gb internet || Debian 11 || Metal || Server Monitoring || 8 GB
 
|-
 
|-
| Shut down / start up of the entire machine room || ||  
+
| Layout 0 || 159.28.23.2 || 10.10.10.2 || CentOS 7 || Metal || Head Node || 32 GB
 
|-
 
|-
| Creating and configuring mailing lists (electron) || ||  
+
| Layout 1 || None || None || CentOS 7 || Metal || Compute Node || 32 GB
 
|-
 
|-
| Admin list moderating || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> ||
+
| Layout 2 || None || None || CentOS 7 || Metal || Compute Node || 32 GB
 
|-
 
|-
| Backups and restore (bacula) || ||  
+
| Layout 3 || None || None || CentOS 7 || Metal || Compute Node || 32 GB
 
|-
 
|-
| Create and configure jails  || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> ||
+
| Layout 4 || None || None || CentOS 7 || Metal || Compute Node || 32 GB
 
|-
 
|-
| VMware || ||
+
| Whedon 0 || 159.28.23.4 || No 10Gb internet|| CentOS 7 || Metal || Head Node || 256 GB
|}
 
 
 
== Sysadmin basic Training ==
 
This is the list of skills that our System Administrators are trained during their orientation.
 
 
 
{| class="wikitable"
 
 
|-
 
|-
! Training Sections !! Wilson !! Eamon
+
| Whedon 1 || None || None || CentOS 7 || Metal || Compute Node || 256 GB
 
|-
 
|-
| | ||
+
| Whedon 2 || None || None || CentOS 7 || Metal || Compute Node || 256 GB
 
|-
 
|-
| Installing operating systems (Debian and FreeBSD), including single-user mode || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Whedon 3 || None || None || CentOS 7 || Metal || Compute Node || 256 GB
 
|-
 
|-
| Installing packages || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Whedon 4 || None || None || CentOS 7 || Metal || Compute Node || 256 GB
 
|-
 
|-
| *nix Filesystem layout || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Whedon 5 || None || None || CentOS 7 || Metal || Compute Node || 256 GB
 
|-
 
|-
| Command line tools including I/O redirections and pipes || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Whedon 6 || None || None || CentOS 7 || Metal || Compute Node || 256 GB
 
|-
 
|-
| TCP,  UDP and ICMP packets, including 3-way handshake || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Whedon 7 || None || None || CentOS 7 || Metal || Compute Node || 256 GB
 
|-
 
|-
| Ports || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Hamilton 0 || 159.28.23.5 || No 10Gb internet || Debian 11 || Metal || Head Node || 128 GB
 
|-
 
|-
| DNS || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Hamilton 1 || None || None || Debian 11 || Metal || Compute Node || 256 GB
 
|-
 
|-
| DHCP  || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Hamilton 2 || None || None || Debian 11 || Metal || Compute Node || 256 GB
 
|-
 
|-
| Network debugging tools (tcpdump, ping, traceroute, netstat) || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Hamilton 3 || None || None || Debian 11 || Metal || Compute Node || 256 GB
 
|-
 
|-
| Simple shell scripting || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div>
+
| Hamilton 4 || None || None || Debian 11 || Metal || Compute Node || 256 GB
 
|-
 
|-
| Jails || <div style="text-align: center;"> [[File:StarIconBronze.png|20px]] </div> ||
+
| Hamilton 5 || None || None || Debian 11 || Metal || Compute Node || 256 GB
 
|}
 
|}
  
== Sysadmin 2014 to do list: ==
+
{| class="wikitable"
 
+
|+ Lab machines
* () Spam Filter (CP, JR)  
+
|-
* () new-proto from the outside world
+
! Machine name !! 159 Ip Address !! Location !! Operating System !! RAM
* (I) check hydra with Charlie
+
|-
* Getting rid of Quark <br />
+
| Borg || 159.28.22.10 || Turing (CST 222) || Ubuntu 20 || 16 GB
* (?) On ACLs login disappears after pressing cancel (Reported JR)  
+
|-
* ACL screensaver, leave only lightweight? (JR)  
+
| Gao || 159.28.22.11 || Turing (CST 222) || Ubuntu 20 || 8 GB
* Removing mailman form quark <br />
+
|-
* Mailman Heather (not all of them accepted the changes electron to cs.earlham.edu) <br />
+
| Snyder || 159.28.22.12 || Turing (CST 222) || Ubuntu 20 || 8 GB
* (W) Script for changing users password
+
|-
* (W) Script for changing and adding groups
+
| Goldwasser || 159.28.22.13 || Lovelace (CST 219) || Ubuntu 20 || 8 GB
* (W) Brushing the add a user script
+
|-
* (W) Check machines chooser can choose from
+
| Bartik || 159.28.22.14 || Lovelace (CST 219) || Ubuntu 20 || 8 GB
* (E) Script that will send an e-mail to all people
+
|-
* (E) Improve nagios settings
+
| Wilson || 159.28.22.15 || Lovelace (CST 219) || Ubuntu 20 || 8 GB
* (W) Can we change Kristin Muterspaw CS username (from kmmuter11 to buzzlightyear)
+
|-
 +
| Bilas || 159.28.22.16 || Lovelace (CST 219) || Ubuntu 20 || 8 GB
 +
|-
 +
| Johnson || 159.28.22.17 || Lovelace (CST 219) || Ubuntu 20 || 8 GB
 +
|-
 +
| Graham || 159.28.22.14 || Lovelace (CST 219) || Ubuntu 20 || 8 GB
 +
|}
  
** Done
+
=== CS Machine Address List ===
* Wireshark should be run only be 410 students (Reported JR)
+
<pre>bowie.cs.earlham.edu smiley.cs.earlham.edu web.cs.earlham.edu auth.cs.earlham.edu code.cs.earlham.edu net.cs.earlham.edu central.cs.earlham.edu urey.cs.earlham.edu</pre>
* Re-imaging ENI machine (ACL21) <br />
 
* () DNS troubles (Reported CP)
 
* (I) fab lab list (HL)
 
* (I) Hassan, SSH Trouble to Electron (Hassan, JR)
 
  
 +
=== Cluster Machine Address List ===
 +
<pre>
 +
hopper.cluster.earlham.edu lovelace.cluster.earlham.edu pollock.cluster.earlham.edu bronte.cluster.earlham.edu sakurai.cluster.earlham.edu miyamoto.cluster.earlham.edu hopperprime.cluster.earlham.edu monitor.cluster.earlham.edu whedon.cluster.earlham.edu layout.cluster.earlham.edu hamilton.cluster.earlham.edu</pre>
  
 +
=== Lab Machine Address List ===
 +
<pre>borg.cs.earlham.edu gao.cs.earlham.edu snyder.cs.earlham.edu goldwasser.cs.earlham.edu bartik.cs.earlham.edu wilson.cs.earlham.edu bilas.cs.earlham.edu johnson.cs.earlham.edu graham.cs.earlham.edu</pre>
  
 +
=== Specialized resources ===
  
 +
Specialized computing applications are supported on the following machines:
  
 +
* [[Sysadmin:GPGPU|GPU’s for AI/ML/data science]]: layout cluster
 +
* virtualization: smiley
 +
* containers: bowie
  
 +
== Network ==
  
 +
We have two network fabrics linking the machines together. There are three subdomains.
  
 +
=== 10 Gb ===
  
'''Documentation:'''
+
We have 10Gb fabric to mount files over NFS. Machines with 10Gb support have an IP address in the class C range 10.10.10.0/24 and we want to add DNS to these addresses.
  
Wilson:
+
=== 1 Gb (cluster, cs) ===
* DNS & DHCP (done)  
 
* Sage (done)
 
* Add User
 
* Add/change group
 
* Password change
 
* Firewall
 
  
Eamon:
+
We have two class C subnets on the 1Gb fabric: 159.28.22.0/24 (CS) and 159.28.23.0/24 (cluster). This means we have double the IP addresses on the 1Gb fabric that we have on the 10Gb fabric.
* Cups
 
* PSSH
 
  
Ivan
+
Any user accessing *.cluster.earlham.edu and *.cs.earlham.edu is making calls on a 1Gb network.
* Cloning ACL box
 
  
== Systems Administration Documentation ==
+
=== Intra-cluster fabrics ===
  
{|
+
The layout cluster has an Infiniband infrastructure. Wachowski has only a 1Gb infrastructure.
|- valign="top"
 
|
 
<div style="border:10px solid #E3E0FA; padding:5px">
 
<div style="background-color:#D7D1F8; padding:5px;">
 
=== Works in Progress ===
 
</div>
 
  
* [[Sysadmin:todo13|To do before Fall 13 starts]]
+
== Power ==
* [[Sysadmin:handbook|Handbook (WIP)]]
 
* [[Sysadmin:Temporary Page | Temporary Page for Wiki Adjustment]]
 
* [[Sysadmin: Upgrading FreeBSD | Upgrading FreeBSD]]
 
* [[Sysadmin:Fail2Ban on FreeBSD | Fail2Ban on FreeBSD]]
 
* [[Sysadmin:Running Nessus | Running Nessus]]
 
* [[Sysadmin:SrvcCheck|Things to check when things go down]]
 
* [[Sysadmin:AaronsHowTo| Aaron's How-To Pages]]
 
* [[Sysadmin:Sonresources| Son's "Cook" Pages]]
 
* [[Sysadmin:Installing ACLs]]
 
  
<!-- This has to stay as part of the formatting -->
+
We have a backup power supply, with batteries last upgraded in 2019 (?). We’ve had a few outages since then and power has held up well.
</div>
 
| style="width:50px;" |
 
|
 
<div style="border:10px solid #E0EAF8; padding:5px;">
 
<div style="background-color:#CEDEF4; padding:5px;">
 
  
=== Admin Tasks ===
+
== HVAC ==
</div>
 
  
* [[Sysadmin: NEWcupssetup|NEW CUPS/Printer Adiministration]]
+
HVAC systems are static and are largely managed by Facilities.
* [[Sysadmin: NEWAddComputer|NEW Add a computer]]
 
* [[Sysadmin:NEWStart/Shutdown|NEW Shutdown/Start]]
 
* [[Sysadmin:NEWMailman|NEW Mailman]]
 
* [[Sysadmin:NEWNagios|NEW Nagios]]
 
* [[Sysadmin:Backup|Backup]] (needs to be updated after new setup)
 
* [[Sysadmin:Contacting all users|Contacting all users]]
 
* [[Sysadmin:New Sysadmins|Welcoming a new sysadmin to the fold]]
 
* [[Sysadmin:RT Ticketing|RT Ticketing]]
 
* [[Sysadmin:AddComputer|Add a computer]]
 
  
 +
[[Topology|See full topology diagrams here.]]
  
<!-- This has to stay as part of the formatting -->
+
[[Sysadmin:Layers of abstraction for filesystems|A word about what's happening between files and the drives they live on.]]
</div>
 
|}
 
  
  
{|
+
= New sysadmins =
|- valign="top"
 
|
 
  
<div style="border:10px solid #FFDFFF; padding:5px;">
+
These pages will be helpful for you if you're just starting in the group:
<div style="background-color:#FFCEFF; padding:5px;">
 
  
=== Services ===
+
* [[Sysadmin:New Sysadmins | Welcoming a new sysadmin ]]
</div>
+
* [[Sysadmin:Troubleshooting|General troubleshooting tips for admins]]
* [[Sysadmin:Services:Apache2|Apache2]]
+
* [[Sandbox Notes|Sandbox Notes]]
* [[Sysadmin:Services:Databases|Databases]]
+
* [[Password managers]]
* [[Sysadmin:Services:DNS and DHCP|NEW DNS and DHCP]]
+
* [[Server safety]]
* [[Sysadmin:Services:Email|Email]]
+
* [https://code.cs.earlham.edu/sysadmin/ticket-tracker Ticket tracking for current projects]
* [[Sysadmin:Services:LVM|LVM]]
 
* [[Sysadmin:User Management|User Management]]
 
* [[Sysadmin:positron|NFS]]
 
* [[Sysadmin:Services:Printers|Printers]]
 
* [[Sysadmin:services:Sage|NEW Sage]]
 
* [[Sysadmin:Services:SystemImager|System Imager]]
 
* [[Sysadmin:Services:TracSVN|Trac + svn]]
 
* [[Sysadmin:Services:Virtualization | Virtualization]]
 
* [[Sysadmin:Services:ZFS | ZFS]]
 
  
<!-- This has to stay as part of the formatting -->
+
Note: you'll need to log in with wiki credentials to see most Sysadmin pages.
</div>
 
| style="width:50px;" |
 
|
 
  
<div style="border:10px solid #DBF0F7; padding:5px;">
+
= Additional information =
<div style="background-color:#C9EAF3; padding:5px;">
 
  
=== Servers ===
+
These pages contain a lot of the most important information about our systems and how we operate.
</div>
 
* [[Sysadmin:PhysicalServers | Physical Servers]]
 
* [[Sysadmin:VirtualServersAndJails | Virtual Servers and Jails]]
 
* [[Sysadmin:SvcChart|Service Chart]]
 
* [[Sysadmin:Monitoring|Monitoring]]
 
* [[Sysadmin:Quark | Quark]]
 
* [[Sysadmin:Forty-Two | Forty-two]]
 
* [[Sysadmin:Lovelace | Lovelace]]
 
* [[Sysadmin:Proto | Proto]]
 
* [[Sysadmin:RetiredServers | Retired Servers]]
 
 
 
<!-- This has to stay as part of the formatting -->
 
</div>
 
| style="width:50px;" |
 
|
 
<div style="border:10px solid #FFFFC8; padding:5px;">
 
<div style="background-color:#FFFFB5; padding:5px;">
 
 
 
=== ACL Workstations ===
 
</div>
 
* [[Sysadmin:ACL:Installation|ACL Installation procedure]]
 
* [[Sysadmin:AclImage|ACL Package Information]]
 
* [[Sysadmin:Acl Locations|ACL Locations]]
 
* [[Sysadmin:Software for Chemistry ACLs|Software for Chemistry ACLs]]
 
* [[Sysadmin:ACL:UpProp|Proposed ACL Update policy]]
 
 
 
<!-- This has to stay as part of the formatting -->
 
</div>
 
|}
 
 
 
 
 
{|
 
|- valign="top"
 
|
 
<div style="border:10px solid #D6F8DE; padding:5px;">
 
<div style="background-color:#BDF4CB; padding:5px;">
 
=== Networking ===
 
</div>
 
* [[Sysadmin:Networking:NetworkLayout|Network Layout (as of 08/2006)]]
 
* [[Sysadmin:Networking:D224 cable plant|D224 cable plant]]
 
* [[Sysadmin:Networking:Fiber plans|Fiber plans]]
 
* [[Sysadmin:Networking:Switches|Switches]]
 
* [[Sysadmin:Networking:Rack notes|Rack notes]]
 
* [[Sysadmin:Networking:Public|Public Network]]
 
* [[Sysadmin:Networking:NetworkTopo|Old Network Topo Figures]]
 
* [[Sysadmin:Networking:NetworkDiagram|Network layout (May 2007)]]
 
* [[Sysadmin:Networking:Alternate Network Path|Alt Network path]]
 
* [[Sysadmin:UPS Setup]]
 
 
 
<!-- This has to stay as part of the formatting -->
 
</div>
 
| style="width:50px;" |
 
|
 
<div style="border:10px solid #F0DDD5; padding:5px;">
 
<div style="background-color:#E4C0B1; padding:5px;">
 
 
 
=== Miscellaneous ===
 
</div>
 
* [[SysadminContactInfo|Contact Information]]
 
* [[Sysadmin:ImportantInfo:PhoneNumbers|Phone Numbers]]
 
* [[Sysadmin:ImportantInfo:WebSites|Web Sites]]
 
* [[Sysadmin:ImportantInfo:AuthenticationInfo|Authentication Information]]
 
* [[Sysadmin:ImportantInfo:PowerFailure|Power Failure]]
 
* [[Sysadmin:ImportantInfo:UPS|UPS]]
 
* [[Sysadmin:ImportantInfo:SSLcerts|Generating SSL Certificates]]
 
* [[Sysadmin:Power draws|Power draws]]
 
* [[Sysadmin:ImportantInfo:SunHardware|Working with Sun Hardware]]
 
* [[Sysadmin:Passwords]]
 
* Patching
 
** [[LinuxKernelPatching|Linux Kernel Patching]]
 
** [[FreeBSDKernelPatching|FreeBSD Kernel Patching]]
 
* [[Sysadmin:SerialConsoleCableEnds|Cable Ends]]
 
 
 
<!-- This has to stay as part of the formatting -->
 
</div>
 
|}
 
  
 +
===Technical docs===
  
 +
* [https://code.cs.earlham.edu/sysadmin/ticket-tracker Ticket tracking for current projects]
 +
* [[Server safety]]
 +
* [[Sysadmin:Backup|Backup]]
 +
* [[Sysadmin:Monitoring | Monitoring ]]
 +
* [[Sysadmin:SSH|SSH info relevant to admins]]
 +
* [[Sysadmin:User Management | User Management]] and [[Sysadmin:LDAP|LDAP]] generally
 +
* [[Sysadmin:Jupyterhub Notebook Server|Jupyterhub]] and [[Nbgrader notes|NBGrader]]
 +
* [[Sysadmin:MailStack|Email service]]
 +
* [[Sysadmin:XenDocs | Xen Server]]
 +
* [[Sysadmin:NFS|Network File System (NFS)]]
 +
* [[Sysadmin:Web Servers|Web Servers and Websites]]
 +
* [[Sysadmin:Services:Databases|Databases]]
 +
* [[Sysadmin:DNS & DHCP|DNS and DHCP]]
 +
* [[Sysadmin:AWS|AWS]]
 +
* [[Bash_start_up_script|Bash startup scripts]]
 +
* [[Sysadmin:VirtualBox | VirtualBox]]
 +
* [[X Applications]]
 +
* [[Sysadmin:Services:ClusterOverview|Cluster Overview]] and [[Sysadmin:Ccg-admin|additional details]]
 +
* [[Sysadmin:Firewall|Firewall]] running on babbage.cs.e.e
 +
* [[Sysadmin:Setting_up_Lovelace_Lab_Machines|Setting up Lab Machines]]
  
 +
===Common tasks===
 +
* [[Sysadmin:Recurring Tasks | Recurring tasks - e.g. software updates, hardware replacements]]
 +
* [[Sysadmin:Contacting all users|Contacting all users]]
 +
* [[Reset password]]
 +
* [[Sysadmin:Software installation | Software installation]]
 +
* [[Modules | Installing software under modules ]]
 +
* [[Sysadmin:AddComputer|Add a computer to CS or cluster domains]]
 +
* [[Senior projects|Supporting senior projects]]
 +
* [[ShutdownProcedure|How to do a planned shutdown and reboot of the system]]
 +
** [[Sysadmin:TestingServices | Testing services]] (after a reboot, upgrade, change in the phase of the moon, etc.)
 +
* [[Sysadmin:Upgrading SSL Certificate | Upgrading SSL Certificates ]]
 +
* [[Sysadmin:Launch at startup|Launch a process at startup]]
 +
* [[Sysadmin:Psql-setup | setup psql for cs430 students]]
  
 
+
===Group and institution information===
=== Old ===
+
* [[Sysadmin:CS-ITS Interoperability|Working with ITS]]
 
+
* [[Sysadmin:Recurring spending | Recurring spending ]]
Important Notes:
+
* [[Sysadmin:SlackAndGitLab | Slack and GitLab integration]]
* '''''ALL of the admin '''''  '''CVS/SVN stuff has been centralized to trac.cs.earlham.edu/admin'''.  You'll need to create a username/password for yourself by running (from quark):
 
:<code>htpasswd /usr/local/trac/adminontrac.htpasswd <username></code>
 
* To check out the repository, run (from quark):
 
:<code>svn checkout file:///clients/users/svn/admin</code>
 
* [[Sysadmin:IRC|Chatting on IRC]]
 
 
 
'''Curent Sysadmins 2013:'''
 
{| class="wikitable"
 
|-
 
! SysAdmin Name !! Year !! Working time !! Progress notes
 
|-
 
| Wilson || SO || 100% || link to notes
 
|-
 
| Demise || SR || 100% || link to notes
 
|-
 
| Craig || FR || 100% || link to notes
 
|-
 
| Zane || SO || 100% || link to notes
 
|-
 
| Jordan || SO || 100% || link to notes
 
|-
 
| Sonny || JU || 100% || link to notes
 
|-
 
| Elena || SR || 40% || link to notes
 
|-
 
| Kristin || JU || 40% || link to notes
 
|-
 
| Aaron || SR || 20% || link to notes
 
|-
 
| Michael || SR || 0% || link to notes
 
|}
 

Latest revision as of 11:15, 1 June 2022

This is the hub for the CS sysadmins on the wiki.

Overview

If you're visually inclined, we have a colorful and easy-to-edit map of our servers here!

Server room

Our servers are in Noyes, the science building that predates the CST. For general information about the server room and how to use it, check out this page.

Columns: machine name, IPs, type (virtual, metal), purpose, dies, cores, RAM

Compute Resources

CS machines and VMs
Machine name 159 Ip Address 10Gb Ip address Operating System Metal or Virtual Description RAM
Bowie 159.28.22.5 10.10.10.15 Debian 9 Metal hosts and exports user files; Jupyterhub; landing server 198 GB
Smiley 159.28.22.251 10.10.10.252 Ubuntu 18.04 Metal VM host, not accessible to regular users 156 GB
Web 159.28.22.2 10.10.10.200 Ubuntu 18.04 Virtual Website host 8 GB
Auth 159.28.22.39 No 10Gb internet CentOS 7 Virtual host of LDAP user database 4 GB
Code 159.28.22.42 10.10.10.42 Ubuntu 18.04 Virtual Gitlab host 8 GB
Net 159.28.22.1 10.10.10.100 Ubuntu 18.04 Virtual network administration host for CS 4 GB
Central 159.28.22.177 No 10Gb internet Debian 9 Virtual ODK Central Host 4 GB
Urey 159.28.22.139 No 10Gb internet XCP-ng Metal Sysadmin Sandbox Environment 16 GB
Cluster machines
Machine name 159 Ip Address 10Gb Ip address Operating System Metal or Virtual Description RAM
Hopper 159.28.23.1 10.10.10.1 Debian 10 Metal landing server, NFS host for cluster 64 GB
Lovelace 159.28.23.35 10.10.10.35 CentOS 7 Metal Large compute server 96 GB
Pollock 159.28.23.8 10.10.10.8 CentOS 7 Metal Large compute server 131 GB
Bronte 159.28.23.140 No 10Gb internet CentOS 7 Metal Large compute server 115 GB
Sakurai 159.23.23.3 10.10.10.3 Debian 10 Metal Runs Backup 12 GB
Miyamoto 159.28.23.45 No 10Gb currently Debian 10 Metal Runs Backup 16 GB
HopperPrime 159.28.23.142 10.10.10.142 Debian 10 Metal Runs Backup 16 GB
Monitor 159.28.23.250 No 10Gb internet Debian 11 Metal Server Monitoring 8 GB
Layout 0 159.28.23.2 10.10.10.2 CentOS 7 Metal Head Node 32 GB
Layout 1 None None CentOS 7 Metal Compute Node 32 GB
Layout 2 None None CentOS 7 Metal Compute Node 32 GB
Layout 3 None None CentOS 7 Metal Compute Node 32 GB
Layout 4 None None CentOS 7 Metal Compute Node 32 GB
Whedon 0 159.28.23.4 No 10Gb internet CentOS 7 Metal Head Node 256 GB
Whedon 1 None None CentOS 7 Metal Compute Node 256 GB
Whedon 2 None None CentOS 7 Metal Compute Node 256 GB
Whedon 3 None None CentOS 7 Metal Compute Node 256 GB
Whedon 4 None None CentOS 7 Metal Compute Node 256 GB
Whedon 5 None None CentOS 7 Metal Compute Node 256 GB
Whedon 6 None None CentOS 7 Metal Compute Node 256 GB
Whedon 7 None None CentOS 7 Metal Compute Node 256 GB
Hamilton 0 159.28.23.5 No 10Gb internet Debian 11 Metal Head Node 128 GB
Hamilton 1 None None Debian 11 Metal Compute Node 256 GB
Hamilton 2 None None Debian 11 Metal Compute Node 256 GB
Hamilton 3 None None Debian 11 Metal Compute Node 256 GB
Hamilton 4 None None Debian 11 Metal Compute Node 256 GB
Hamilton 5 None None Debian 11 Metal Compute Node 256 GB
Lab machines
Machine name 159 Ip Address Location Operating System RAM
Borg 159.28.22.10 Turing (CST 222) Ubuntu 20 16 GB
Gao 159.28.22.11 Turing (CST 222) Ubuntu 20 8 GB
Snyder 159.28.22.12 Turing (CST 222) Ubuntu 20 8 GB
Goldwasser 159.28.22.13 Lovelace (CST 219) Ubuntu 20 8 GB
Bartik 159.28.22.14 Lovelace (CST 219) Ubuntu 20 8 GB
Wilson 159.28.22.15 Lovelace (CST 219) Ubuntu 20 8 GB
Bilas 159.28.22.16 Lovelace (CST 219) Ubuntu 20 8 GB
Johnson 159.28.22.17 Lovelace (CST 219) Ubuntu 20 8 GB
Graham 159.28.22.14 Lovelace (CST 219) Ubuntu 20 8 GB

CS Machine Address List

bowie.cs.earlham.edu smiley.cs.earlham.edu web.cs.earlham.edu auth.cs.earlham.edu code.cs.earlham.edu net.cs.earlham.edu central.cs.earlham.edu urey.cs.earlham.edu

Cluster Machine Address List

hopper.cluster.earlham.edu lovelace.cluster.earlham.edu pollock.cluster.earlham.edu bronte.cluster.earlham.edu sakurai.cluster.earlham.edu miyamoto.cluster.earlham.edu hopperprime.cluster.earlham.edu monitor.cluster.earlham.edu whedon.cluster.earlham.edu layout.cluster.earlham.edu hamilton.cluster.earlham.edu

Lab Machine Address List

borg.cs.earlham.edu gao.cs.earlham.edu snyder.cs.earlham.edu goldwasser.cs.earlham.edu bartik.cs.earlham.edu wilson.cs.earlham.edu bilas.cs.earlham.edu johnson.cs.earlham.edu graham.cs.earlham.edu

Specialized resources

Specialized computing applications are supported on the following machines:

Network

We have two network fabrics linking the machines together. There are three subdomains.

10 Gb

We have 10Gb fabric to mount files over NFS. Machines with 10Gb support have an IP address in the class C range 10.10.10.0/24 and we want to add DNS to these addresses.

1 Gb (cluster, cs)

We have two class C subnets on the 1Gb fabric: 159.28.22.0/24 (CS) and 159.28.23.0/24 (cluster). This means we have double the IP addresses on the 1Gb fabric that we have on the 10Gb fabric.

Any user accessing *.cluster.earlham.edu and *.cs.earlham.edu is making calls on a 1Gb network.

Intra-cluster fabrics

The layout cluster has an Infiniband infrastructure. Wachowski has only a 1Gb infrastructure.

Power

We have a backup power supply, with batteries last upgraded in 2019 (?). We’ve had a few outages since then and power has held up well.

HVAC

HVAC systems are static and are largely managed by Facilities.

See full topology diagrams here.

A word about what's happening between files and the drives they live on.


New sysadmins

These pages will be helpful for you if you're just starting in the group:

Note: you'll need to log in with wiki credentials to see most Sysadmin pages.

Additional information

These pages contain a lot of the most important information about our systems and how we operate.

Technical docs

Common tasks

Group and institution information