Difference between revisions of "Sysadmin"

From Earlham CS Department
Jump to navigation Jump to search
(Current Projects (updated 13 Oct 16))
 
(325 intermediate revisions by 18 users not shown)
Line 1: Line 1:
__NOTOC__
+
This is the hub for the CS sysadmins on the wiki.
  
= Machines and Brief Descriptions of Services =
+
= Overview =
{| style="float:left; margin-right:2px;"
 
| style="height:40px; width:150px; text-align:center; background-color:#ADDFFF; border-left:solid 5px #ADDFFF; border-top:solid 5px #ADDFFF; border-bottom:solid 1px white; border-right:solid 5px      #ADDFFF; font-size:120%;" | HOME <br> (vm0)
 
|-
 
| style="height:135px; width:150px; background-color:#ADDFFF; border-left:solid 5px #ADDFFF; border-bottom:solid 5px #ADDFFF; border-right:solid 5px #ADDFFF;" | Users <br> SSH <br> NFS
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
[https://docs.google.com/drawings/d/1XaULz5IxXV_BZQjrko3QJ8wV5aXsSTYcSWxxT49OyZk/edit If you're visually inclined, we have a colorful and easy-to-edit map of our servers here!]
| style="height:40px; width:150px; text-align:center; background-color:#54C571; border-left:solid 5px #54C571; border-top:solid 5px #54C571; border-bottom:solid 1px white; border-right:solid 5px #54C571; font-size:120%;" | NET <br> (vm1)
 
|-
 
| style="height:135px; width:150px; background-color:#54C571; border-left:solid 5px #54C571; border-bottom:solid 5px #54C571; border-right:solid 5px #54C571;" | LDAP server <br> [[Sysadmin:DNS & DHCP | DNS]] <br> [[Sysadmin:DNS & DHCP | DHCP]]
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
== Server room ==
| style="height:40px; width:150px; text-align:center; background-color:#E77471; border-left:solid 5px #E77471; border-top:solid 5px #E77471; border-bottom:solid 1px white; border-right:solid 5px #E77471; font-size:120%;" | WEB <br> (vm2)
 
|-
 
| style="height:135px; width:150px; background-color:#E77471; border-left:solid 5px #E77471; border-bottom:solid 5px #E77471; border-right:solid 5px #E77471;" | Mailman <br> [[Sysadmin:Mail Stack | Mail Stack]]<br> Apache2 <br> PostgresQL <br> MySQL <br> Wiki
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
Our servers are in Noyes, the science building that predates the CST. For general information about the server room and how to use it, check out [[Sysadmin:Server Room|this page]].
| style="height:40px; width:150px; text-align:center; background-color:#C38EC7; border-left:solid 5px #C38EC7; border-top:solid 5px #C38EC7; border-bottom:solid 1px white; border-right:solid 5px #C38EC7; font-size:120%;" | TOOLS <br> (vm3)
 
|-
 
| style="height:135px; width:150px; background-color:#C38EC7; border-left:solid 5px #C38EC7; border-bottom:solid 5px #C38EC7; border-right:solid 5px #C38EC7;" | [[Sysadmin:SageNB Server | SageNB Server]] <br> [[Sysadmin:Jupyterhub Notebook Server | Jupyterhub Server]] <br> [[Sysadmin:Software Modules | Software Modules]] <br> NginX
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
Columns: machine name, IPs, type (virtual, metal), purpose, dies, cores, RAM
| style="height:55px; width:150px; text-align:center; background-color:#E3A869; border-left:solid 5px #E3A869; border-top:solid 5px #E3A869; border-bottom:solid 1px white; border-right:solid 5px #E3A869; font-size:120%;" | BABBAGE
 
|-
 
| style="height:135px; width:150px; background-color: #E3A869; border-left:solid 5px #E3A869; border-bottom:solid 5px #E3A869; border-right:solid 5px #E3A869;" | [[Sysadmin:Firewall | Firewall]]
 
|}
 
  
{|
+
== Compute Resources ==
| style="height:55px; width:150px; text-align:center; background-color:#EEDC82; border-left:solid 5px #EEDC82; border-top:solid 5px #EEDC82; border-bottom:solid 1px white; border-right:solid 5px #EEDC82; font-size:120%;" | [[Sysadmin:Servers:Proto | PROTO]]
 
|-
 
| style="height:135px; width:150px; background-color: #EEDC82; border-left:solid 5px #EEDC82; border-bottom:solid 5px #EEDC82; border-right:solid 5px #EEDC82;" | Weather Monitoring <br> GPS/NTP <br> Energy Monitoring
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
[https://wiki.cs.earlham.edu/index.php/Sysadmin:Computer_Resources Machines and VMs related information here!]
| style="height:55px; width:150px; text-align:center; background-color:#0099cc; border-left:solid 5px #0099cc; border-top:solid 5px #0099cc; border-bottom:solid 1px white; border-right:solid 5px      #0099cc; font-size:120%;" | HOPPER
 
|-
 
| style="height:200px; width:150px; background-color:#0099cc; border-left:solid 5px #0099cc; border-bottom:solid 5px #0099cc; border-right:solid 5px #0099cc;" | Users <br> SSH <br> NFS <br> [[Sysadmin:Software Modules | Software Modules]] <br> PostgresQL <br> Wiki <br> Apache2 <br> [[Sysadmin:DNS & DHCP | DNS]] <br> [[Sysadmin:DNS & DHCP | DHCP]]
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
== Network ==
| style="height:55px; width:150px; text-align:center; background-color:#ffdb4d; border-left:solid 5px #ffdb4d; border-top:solid 5px #ffdb4d; border-bottom:solid 1px white; border-right:solid 5px #ffdb4d; font-size:120%;" | DALI
 
|-
 
| style="height:200px; width:150px; background-color:#ffdb4d; border-left:solid 5px #ffdb4d; border-bottom:solid 5px #ffdb4d; border-right:solid 5px #ffdb4d;" | [[Sysadmin:Gitlab | Gitlab]] <br> Backups <br> NginX
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
We have two network fabrics linking the machines together. There are three subdomains.
| style="height:55px; width:150px; text-align:center; background-color:#ff4d94; border-left:solid 5px #ff4d94; border-top:solid 5px #ff4d94; border-bottom:solid 1px white; border-right:solid 5px #ff4d94; font-size:120%;" | AL-SALAM
 
|-
 
| style="height:200px; width:150px; background-color:#ff4d94; border-left:solid 5px #ff4d94; border-bottom:solid 5px #ff4d94; border-right:solid 5px #ff4d94;" | WebMO <br> [[Sysadmin:Software Modules | Software Modules]] <br> Apache2
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
=== 10 Gb ===
| style="height:55px; width:150px; text-align:center; background-color:#39ad39; border-left:solid 5px #39ad39; border-top:solid 5px #39ad39; border-bottom:solid 1px white; border-right:solid 5px #39ad39; font-size:120%;" | LAYOUT
 
|-
 
| style="height:200px; width:150px; background-color:#39ad39; border-left:solid 5px #39ad39; border-bottom:solid 5px #39ad39; border-right:solid 5px #39ad39;" | [[Sysadmin:Jupyterhub Notebook Server | Jupyterhub Server]] <br> [[Sysadmin:Software Modules | Software Modules]] <br> NginX <br> Apache2 <br> WebMO
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
We have 10Gb fabric to mount files over NFS. Machines with 10Gb support have an IP address in the class C range 10.10.10.0/24 and we want to add DNS to these addresses.
| style="height:55px; width:150px; text-align:center; background-color:#0099cc; border-left:solid 5px #0099cc; border-top:solid 5px #0099cc; border-bottom:solid 1px white; border-right:solid 5px #0099cc; font-size:120%;" | BRONTE
 
|-
 
| style="height:200px; width:150px; background-color:#0099cc; border-left:solid 5px #0099cc; border-bottom:solid 5px #0099cc; border-right:solid 5px #0099cc;" |  [[Sysadmin:Software Modules | Software Modules]]
 
|}
 
  
{| style="float:left; margin-right:2px;"
+
=== 1 Gb (cluster, cs) ===
| style="height:55px; width:150px; text-align:center; background-color:#0099cc; border-left:solid 5px #0099cc; border-top:solid 5px #0099cc; border-bottom:solid 1px white; border-right:solid 5px #0099cc; font-size:120%;" | POLLOCK
 
|-
 
| style="height:200px; width:150px; background-color:#0099cc; border-left:solid 5px #0099cc; border-bottom:solid 5px #0099cc; border-right:solid 5px #0099cc;" |  [[Sysadmin:Software Modules | Software Modules]] <br> WebMO <br> NginX
 
|}
 
  
<br><br><br><br><br><br><br><br><br><br><br><br><br>
+
We have two class C subnets on the 1Gb fabric: 159.28.22.0/24 (CS) and 159.28.23.0/24 (cluster). This means we have double the IP addresses on the 1Gb fabric that we have on the 10Gb fabric.
  
= Systems Administration Documentation =
+
Any user accessing *.cluster.earlham.edu and *.cs.earlham.edu is making calls on a 1Gb network.
For old documentation, see: [[Sysadmin:Old | Old Wiki Information]]
 
  
{|
+
=== Intra-cluster fabrics ===
|- valign:"top"
 
|
 
<div style="border:10px solid #E0EAF8; padding:5px; width:230px; height:400px">
 
<div style="background-color:#CEDEF4; padding:5px;">
 
  
=== Admin Tasks ===
+
The layout cluster has an Infiniband infrastructure. Wachowski has only a 1Gb infrastructure.
</div>
+
 
* [[Sysadmin:Nagios | Nagios Monitoring ]]
+
== Power ==
* [[Sysadmin:Shinken | Shinken Monitoring ]]
+
 
* [[Sysadmin:Upgrading SSL Certificate | Upgrading SSL Certificates ]]
+
We have a backup power supply, with batteries last upgraded in 2019 (?). We’ve had a few outages since then and power has held up well.
* [[Sysadmin:User Management | User Management]]
+
 
* [[Sysadmin:Software:Installing Software | Installing Software]]
+
== HVAC ==
* [[Sysadmin:Backup|Backup]]
+
 
* [[Sysadmin:Contacting all users|Contacting all users]]
+
HVAC systems are static and are largely managed by Facilities.
* [[Sysadmin:New Sysadmins | Welcoming a new sysadmin to the fold]]
+
 
* [[Sysadmin:AddComputer|Add a computer]]
+
[[Topology|See full topology diagrams here.]]
* [[Sysadmin:Setting up Lovelace Lab Machines | Setting up Lovelace Lab Machines]]
+
 
 +
[[Sysadmin:Layers of abstraction for filesystems|A word about what's happening between files and the drives they live on.]]
 +
 
 +
= New sysadmins =
 +
 
 +
These pages will be helpful for you if you're just starting in the group:
  
 +
* [[Sysadmin:New Sysadmins | Welcoming a new sysadmin ]]
 +
* [[Sysadmin:Troubleshooting|General troubleshooting tips for admins]]
 +
* [[Sandbox Notes|Sandbox Notes]]
 +
* [[Password managers]]
 +
* [[Server safety]]
 +
* [https://code.cs.earlham.edu/sysadmin/ticket-tracker Ticket tracking for current projects]
  
<!-- This has to stay as part of the formatting -->
+
Note: you'll need to log in with wiki credentials to see most Sysadmin pages.
</div>
 
| style="float:left;" |
 
|
 
<div style="border:10px solid #FFDFFF; padding:5px; width:230px; height:400px;">
 
<div style="background-color:#FFCEFF; padding:5px;">
 
  
=== Services ===
+
= Additional information =
</div>
 
* [[Sysadmin:Services:Apache2|Apache2]]
 
* [[Sysadmin:Services:Databases|Databases]]
 
* [[Sysadmin:DNS & DHCP|DNS and DHCP]]
 
* [[Sysadmin:Services:Virtualization | Virtualization]]
 
* [[Sysadmin:Services:XenServerSetup | Xen Server]]
 
  
<!-- This has to stay as part of the formatting -->
+
These pages contain a lot of the most important information about our systems and how we operate.
</div>
 
| style="float:left;" |
 
|
 
<div style="border:10px solid #F0DDD5; padding:5px; width:230px; height:400px;">
 
<div style="background-color:#E4C0B1; padding:5px;">
 
  
=== Miscellaneous ===
+
===Handy Tools===
</div>
+
* [http://monitor.cluster.earlham.edu:8088/packages Porter's Package Explorer]
* [[SysadminContactInfo| Contact Information]]
 
* [[Sysadmin:ImportantInfo:PhoneNumbers| Phone Numbers]]
 
* [[Sysadmin:ImportantInfo:WebSites| Web Sites]]
 
* [[Sysadmin:ImportantInfo:AuthenticationInfo| Authentication Information]]
 
* [[Sysadmin:ImportantInfo:PowerFailure| Power Failure]]
 
* [[Sysadmin:ImportantInfo:UPS| UPS]]
 
* [[Sysadmin:ImportantInfo:SSLcerts| Generating SSL Certificates]]
 
* [[Sysadmin:Power draws| Power draws]]
 
* [[Sysadmin:ImportantInfo:SunHardware|Working with Sun Hardware]]
 
* [[Sysadmin:Passwords]]
 
* Patching
 
** [[LinuxKernelPatching|Linux Kernel Patching]]
 
* [[Sysadmin:SerialConsoleCableEnds|Cable Ends]]
 
* [[Sysadmin:VirtualizationComparison|NEW Virtualization Comparison]]
 
  
<!-- This has to stay as part of the formatting -->
+
===Technical docs===
</div>
 
| style="float:left;" |
 
|
 
<div style="border:10px solid #D6F8DE; padding:5px; width:230px; height:400px;">
 
<div style="background-color:#BDF4CB; padding:5px;">
 
  
=== Networking ===
+
* [https://code.cs.earlham.edu/sysadmin/ticket-tracker Ticket tracking for current projects]
</div>
+
* [[Server safety]]
* [[Sysadmin:Networking:NetworkLayout|Network Layout (as of 08/2006)]]
+
* [[Sysadmin:Backup|Backup]]
* [[Sysadmin:Networking:D224 cable plant|D224 cable plant]]
+
* [[Sysadmin:Monitoring | Monitoring ]]
* [[Sysadmin:Networking:Fiber plans|Fiber plans]]
+
* [[Sysadmin:SSH|SSH info relevant to admins]]
* [[Sysadmin:Networking:Switches|Switches]]
+
* [[Sysadmin:User Management | User Management]] and [[Sysadmin:LDAP|LDAP]] generally
* [[Sysadmin:Networking:Rack notes|Rack notes]]
+
* [[Sysadmin:Jupyterhub Notebook Server|Jupyterhub]] and [[Nbgrader notes|NBGrader]]
* [[Sysadmin:Networking:Public|Public Network]]
+
* [[Sysadmin:MailStack|Email service]]
* [[Sysadmin:Networking:NetworkTopo|Old Network Topo Figures]]
+
* [[Sysadmin:XenDocs | Xen Server]]
* [[Sysadmin:Networking:NetworkDiagram|Network layout (May 2007)]]
+
* [[Sysadmin:NFS|Network File System (NFS)]]
* [[Sysadmin:Networking:Alternate Network Path|Alt Network path]]
+
* [[Sysadmin:Web Servers|Web Servers and Websites]]
* [[Sysadmin:UPS Setup]]
+
* [[Sysadmin:Services:Databases|Databases]]
 +
* [[Sysadmin:DNS & DHCP|DNS and DHCP]]
 +
* [[Sysadmin:AWS|AWS]]
 +
* [[Bash_start_up_script|Bash startup scripts]]
 +
* [[Sysadmin:VirtualBox | VirtualBox]]
 +
* [[X Applications]]
 +
* [[Sysadmin:Services:ClusterOverview|Cluster Overview]] and [[Sysadmin:Ccg-admin|additional details]]
 +
* [[Sysadmin:Firewall|Firewall]] running on babbage.cs.e.e
 +
* [[Sysadmin:Setting_up_Lovelace_Lab_Machines|Setting up Lab Machines]]
  
<!-- This has to stay as part of the formatting -->
+
===Common tasks===
</div>
+
* [[Sysadmin:Recurring Tasks | Recurring tasks - e.g. software updates, hardware replacements]]
|}
+
* [[Sysadmin:Contacting all users|Contacting all users]]
 +
* [[Reset password]]
 +
* [[Sysadmin:Software installation | Software installation]]
 +
* [[Modules | Installing software under modules ]]
 +
* [[Sysadmin:AddComputer|Add a computer to CS or cluster domains]]
 +
* [[Senior projects|Supporting senior projects]]
 +
* [[ShutdownProcedure|How to do a planned shutdown and reboot of the system]]
 +
** [[Sysadmin:TestingServices | Testing services]] (after a reboot, upgrade, change in the phase of the moon, etc.)
 +
* [[Sysadmin:Upgrading SSL Certificate | Upgrading SSL Certificates ]]
 +
* [[Sysadmin:Launch at startup|Launch a process at startup]]
 +
* [[Sysadmin:Psql-setup | setup psql for cs430 students]]
  
== Current Projects (updated 13 Oct 16) ==  
+
===Group and institution information===
* ~~Groups and LDAP and sudo - James~~
+
* [[Sysadmin:CS-ITS Interoperability|Working with ITS]]
* Amber and VMD - James and Vitalli
+
* [[Sysadmin:Recurring spending | Recurring spending ]]
* Edward's setup - Vitalli
+
* [[Sysadmin:SlackAndGitLab | Slack and GitLab integration]]
* Puppet - James and Vitalii
 
* Bacula - Nirdesh (Kristin)
 
* SSL certificate upgrade and documentation - Kristin
 
* Listserv merging with archives preserved - Nirdesh
 
* Ganglia - Bret
 
* Shenken - Vitalii (Kristin)
 
** latency, UPS
 
* New Layout node - Adam and ?
 
* Provision Sappho (compute) - after Puppet
 
* Provision Kahlo (storage) -
 
** replace broken drive
 
* I2 setup
 
** DTN, storage nodes, head nodes, ports in CST
 
* Provision Whedon (compute) - after Puppet
 
* Shutdown and startup test - tentatively scheduled for a Sunday in mid October
 
* Disk cleaning - Charlie
 
* Done (for now, at least)
 
** Password changing in the CS and cluster domains - Vitalii and James
 

Latest revision as of 08:32, 20 March 2024

This is the hub for the CS sysadmins on the wiki.

Overview

If you're visually inclined, we have a colorful and easy-to-edit map of our servers here!

Server room

Our servers are in Noyes, the science building that predates the CST. For general information about the server room and how to use it, check out this page.

Columns: machine name, IPs, type (virtual, metal), purpose, dies, cores, RAM

Compute Resources

Machines and VMs related information here!

Network

We have two network fabrics linking the machines together. There are three subdomains.

10 Gb

We have 10Gb fabric to mount files over NFS. Machines with 10Gb support have an IP address in the class C range 10.10.10.0/24 and we want to add DNS to these addresses.

1 Gb (cluster, cs)

We have two class C subnets on the 1Gb fabric: 159.28.22.0/24 (CS) and 159.28.23.0/24 (cluster). This means we have double the IP addresses on the 1Gb fabric that we have on the 10Gb fabric.

Any user accessing *.cluster.earlham.edu and *.cs.earlham.edu is making calls on a 1Gb network.

Intra-cluster fabrics

The layout cluster has an Infiniband infrastructure. Wachowski has only a 1Gb infrastructure.

Power

We have a backup power supply, with batteries last upgraded in 2019 (?). We’ve had a few outages since then and power has held up well.

HVAC

HVAC systems are static and are largely managed by Facilities.

See full topology diagrams here.

A word about what's happening between files and the drives they live on.

New sysadmins

These pages will be helpful for you if you're just starting in the group:

Note: you'll need to log in with wiki credentials to see most Sysadmin pages.

Additional information

These pages contain a lot of the most important information about our systems and how we operate.

Handy Tools

Technical docs

Common tasks

Group and institution information