Difference between revisions of "Sysadmin"

From Earlham CS Department
Jump to navigation Jump to search
 
(767 intermediate revisions by 28 users not shown)
Line 1: Line 1:
<big><strong><font color="blue">AHH CRAZY TRANSITION STATE!</font></strong></big>
+
This is the hub for the CS sysadmins on the wiki.
'''Kay is currently going crazy rearranging the wiki and trying to clean up the old information'''.  I apologize if things get moved around on you.  If you can't find something, check the Change Log and see if it got moved, or do an advanced search within the Sysadmin namespace (the regular search doesn't search in here) or come blame me. :)
 
  
== Systems Administration Documentation ==
+
= Overview =
  
 +
[https://docs.google.com/drawings/d/1XaULz5IxXV_BZQjrko3QJ8wV5aXsSTYcSWxxT49OyZk/edit If you're visually inclined, we have a colorful and easy-to-edit map of our servers here!]
  
{| style="border:1px solid black"
+
== Server room ==
|- valign="top"
 
|
 
=== Works in Progress ===
 
* [[Sysadmin:handbook|Handbook (WIP)]]
 
* [[Sysadmin:Looking at DNS and DHCP|Looking at DNS and DHCP]]
 
* [[Sysadmin:Temporary Page | Temporary Page for Wiki Adjustment]]
 
* [[Sysadmin: Upgrading Ports in FreeBSD]]
 
* [[Sysadmin:Fail2Ban on FreeBSD]]
 
* [[Sysadmin:Rebuilding the lovelace kernel]]
 
* [[Sysadmin:Using Xen for Virtualization|Using Xen for Virtualization]]
 
  
<!-- This has to stay as part of the table -->
+
Our servers are in Noyes, the science building that predates the CST. For general information about the server room and how to use it, check out [[Sysadmin:Server Room|this page]].
| style="width:50px;" |
+
 
|
+
Columns: machine name, IPs, type (virtual, metal), purpose, dies, cores, RAM
=== Regular Stuff ===
+
 
* [[Sysadmin:Backup|Backup]] (needs to be updated after new setup)
+
== Compute Resources ==
* [[Sysadmin:Contacting all users|Contacting all users]]
+
 
* [[Sysadmin:AddComputer|Add a computer]]
+
[https://wiki.cs.earlham.edu/index.php/Sysadmin:Computer_Resources Machines and VMs related information here!]
* [[Sysadmin:New Sysadmins|Welcoming a new sysadmin to the fold]]
+
 
 +
== Network ==
 +
 
 +
We have two network fabrics linking the machines together. There are three subdomains.
 +
 
 +
=== 10 Gb ===
 +
 
 +
We have 10Gb fabric to mount files over NFS. Machines with 10Gb support have an IP address in the class C range 10.10.10.0/24 and we want to add DNS to these addresses.
 +
 
 +
=== 1 Gb (cluster, cs) ===
 +
 
 +
We have two class C subnets on the 1Gb fabric: 159.28.22.0/24 (CS) and 159.28.23.0/24 (cluster). This means we have double the IP addresses on the 1Gb fabric that we have on the 10Gb fabric.
 +
 
 +
Any user accessing *.cluster.earlham.edu and *.cs.earlham.edu is making calls on a 1Gb network.
 +
 
 +
=== Intra-cluster fabrics ===
 +
 
 +
The layout cluster has an Infiniband infrastructure. Wachowski has only a 1Gb infrastructure.
 +
 
 +
== Power ==
 +
 
 +
We have a backup power supply, with batteries last upgraded in 2019 (?). We’ve had a few outages since then and power has held up well.
 +
 
 +
== HVAC ==
 +
 
 +
HVAC systems are static and are largely managed by Facilities.
 +
 
 +
[[Topology|See full topology diagrams here.]]
 +
 
 +
[[Sysadmin:Layers of abstraction for filesystems|A word about what's happening between files and the drives they live on.]]
  
<!-- This has to stay as part of the table -->
+
= New sysadmins =
|}
 
  
 +
These pages will be helpful for you if you're just starting in the group:
  
{| style="border:1px solid black"
+
* [[Sysadmin:New Sysadmins | Welcoming a new sysadmin ]]
|- valign="top"
+
* [[Sysadmin:Troubleshooting|General troubleshooting tips for admins]]
|
+
* [[Sandbox Notes|Sandbox Notes]]
=== Services ===
+
* [[Password managers]]
* [[Sysadmin:User Management|User Management]]
+
* [[Server safety]]
* [[Sysadmin:Services:Databases|Databases]]
+
* [https://code.cs.earlham.edu/sysadmin/ticket-tracker Ticket tracking for current projects]
* [[Sysadmin:Services:Email|Email]]
 
* [[Sysadmin:Services:Apache2|Apache2]]
 
* [[Sysadmin:Services:SystemImager|System Imager]]
 
* [[Sysadmin:Services:TracSVN|Trac + svn]]
 
* [[Sysadmin:Services:DNS and DHCP|DNS and DHCP]]
 
* [[Sysadmin:Services:VBox|VirtualBox]]
 
* [[Sysadmin:Services:LVM|LVM]]
 
* [[Sysadmin:Services:Printers|Printers]]
 
  
<!-- This has to stay as part of the table -->
+
Note: you'll need to log in with wiki credentials to see most Sysadmin pages.
| style="width:50px;" |
 
|
 
=== Servers ===
 
* [[Sysadmin:SvcChart|Service Chart]]
 
* [[Sysadmin:Quark | Quark]]
 
* [[Sysadmin:Forty-Two | Forty-two]]
 
* [[Sysadmin:Lovelace | Lovelace]]
 
* [[Sysadmin:Proto | Proto]]
 
* [[Sysadmin:RetiredServers | Retired Servers]]
 
  
<!-- This has to stay as part of the table -->
+
= Additional information =
| style="width:50px;" |
 
|
 
=== ACL Workstations ===
 
* [[Sysadmin:ACL:Installation|ACL Installation procedure]]
 
* [[Sysadmin:AclImage|ACL Package Information]]
 
* [[Sysadmin:Acl Locations|ACL Locations]]
 
* [[Sysadmin:Software for Chemistry ACLs|Software for Chemistry ACLs]]
 
* [[Sysadmin:ACL:UpProp|Proposed ACL Update policy]]
 
  
<!-- This has to stay as part of the table -->
+
These pages contain a lot of the most important information about our systems and how we operate.
|}
 
  
 +
===Handy Tools===
 +
* [http://monitor.cluster.earlham.edu:8088/packages Porter's Package Explorer]
  
{| style="border:1px solid black"
+
===Technical docs===
|- valign="top"
 
|
 
=== Networking ===
 
* [[Sysadmin:Nagios|Nagios]]
 
* [[Sysadmin:Networking:NetworkLayout|Network Layout (as of 08/2006)]]
 
* [[Sysadmin:Networking:D224 cable plant|D224 cable plant]]
 
* [[Sysadmin:Networking:Fiber plans|Fiber plans]]
 
* [[Sysadmin:Networking:Machine_room|Machine room]]
 
* [[Sysadmin:Networking:PF|PF packet filter (lovelace)]]
 
* [[Sysadmin:Networking:Rack notes|Rack notes]]
 
* [[Sysadmin:Networking:Public|Public Network]]
 
* [[Sysadmin:Networking:NetworkTopo|Old Network Topo Figures]]
 
* [[Sysadmin:Networking:NetworkDiagram|Network layout (May 2007)]]
 
* [[Sysadmin:Networking:Alternate Network Path|Alt Network path]]
 
* [[Sysadmin:Networking:Updating Proto's Timezone Rules| Timezone Rules on Proto]]
 
* [[Sysadmin:UPS Setup]]
 
  
<!-- This has to stay as part of the table -->
+
* [https://code.cs.earlham.edu/sysadmin/ticket-tracker Ticket tracking for current projects]
| style="width:50px;" |
+
* [[Server safety]]
|
+
* [[Sysadmin:Backup|Backup]]
=== Miscellaneous ===
+
* [[Sysadmin:Monitoring | Monitoring ]]
* [[SysadminContactInfo|Contact Information]]
+
* [[Sysadmin:SSH|SSH info relevant to admins]]
* [[Sysadmin:ImportantInfo:PhoneNumbers|Phone Numbers]]
+
* [[Sysadmin:User Management | User Management]] and [[Sysadmin:LDAP|LDAP]] generally
* [[Sysadmin:ImportantInfo:WebSites|Web Sites]]
+
* [[Sysadmin:Jupyterhub Notebook Server|Jupyterhub]] and [[Nbgrader notes|NBGrader]]
* [[Sysadmin:ImportantInfo:AuthenticationInfo|Authentication Information]]
+
* [[Sysadmin:MailStack|Email service]]
* [[Sysadmin:ImportantInfo:PowerFailure|Power Failure]]
+
* [[Sysadmin:XenDocs | Xen Server]]
* [[Sysadmin:ImportantInfo:UPS|UPS]]
+
* [[Sysadmin:NFS|Network File System (NFS)]]
* [[Sysadmin:ImportantInfo:SSLcerts|Generating SSL Certificates]]
+
* [[Sysadmin:Web Servers|Web Servers and Websites]]
* [[Sysadmin:Power draws|Power draws]]
+
* [[Sysadmin:Services:Databases|Databases]]
* [[Sysadmin:ImportantInfo:SunHardware|Working with Sun Hardware]]
+
* [[Sysadmin:DNS & DHCP|DNS and DHCP]]
* [[Sysadmin:Passwords]]
+
* [[Sysadmin:AWS|AWS]]
* Patching
+
* [[Bash_start_up_script|Bash startup scripts]]
** [[LinuxKernelPatching|Linux Kernel Patching]]
+
* [[Sysadmin:VirtualBox | VirtualBox]]
** [[FreeBSDKernelPatching|FreeBSD Kernel Patching]]
+
* [[X Applications]]
* [[Sysadmin:SerialConsoleCableEnds|Cable Ends]]
+
* [[Sysadmin:Services:ClusterOverview|Cluster Overview]] and [[Sysadmin:Ccg-admin|additional details]]
 +
* [[Sysadmin:Firewall|Firewall]] running on babbage.cs.e.e
 +
* [[Sysadmin:Setting_up_Lovelace_Lab_Machines|Setting up Lab Machines]]
 +
 
 +
===Common tasks===
 +
* [[Sysadmin:Recurring Tasks | Recurring tasks - e.g. software updates, hardware replacements]]
 +
* [[Sysadmin:Contacting all users|Contacting all users]]
 +
* [[Reset password]]
 +
* [[Sysadmin:Software installation | Software installation]]
 +
* [[Modules | Installing software under modules ]]  
 +
* [[Sysadmin:AddComputer|Add a computer to CS or cluster domains]]
 +
* [[Senior projects|Supporting senior projects]]
 +
* [[ShutdownProcedure|How to do a planned shutdown and reboot of the system]]
 +
** [[Sysadmin:TestingServices | Testing services]] (after a reboot, upgrade, change in the phase of the moon, etc.)
 +
* [[Sysadmin:Upgrading SSL Certificate | Upgrading SSL Certificates ]]
 +
* [[Sysadmin:Launch at startup|Launch a process at startup]]
 +
* [[Sysadmin:Psql-setup | setup psql for cs430 students]]
  
<!-- This has to stay as part of the table -->
+
===Group and institution information===
|}
+
* [[Sysadmin:CS-ITS Interoperability|Working with ITS]]
 +
* [[Sysadmin:Recurring spending | Recurring spending ]]
 +
* [[Sysadmin:SlackAndGitLab | Slack and GitLab integration]]

Latest revision as of 09:32, 20 March 2024

This is the hub for the CS sysadmins on the wiki.

Overview

If you're visually inclined, we have a colorful and easy-to-edit map of our servers here!

Server room

Our servers are in Noyes, the science building that predates the CST. For general information about the server room and how to use it, check out this page.

Columns: machine name, IPs, type (virtual, metal), purpose, dies, cores, RAM

Compute Resources

Machines and VMs related information here!

Network

We have two network fabrics linking the machines together. There are three subdomains.

10 Gb

We have 10Gb fabric to mount files over NFS. Machines with 10Gb support have an IP address in the class C range 10.10.10.0/24 and we want to add DNS to these addresses.

1 Gb (cluster, cs)

We have two class C subnets on the 1Gb fabric: 159.28.22.0/24 (CS) and 159.28.23.0/24 (cluster). This means we have double the IP addresses on the 1Gb fabric that we have on the 10Gb fabric.

Any user accessing *.cluster.earlham.edu and *.cs.earlham.edu is making calls on a 1Gb network.

Intra-cluster fabrics

The layout cluster has an Infiniband infrastructure. Wachowski has only a 1Gb infrastructure.

Power

We have a backup power supply, with batteries last upgraded in 2019 (?). We’ve had a few outages since then and power has held up well.

HVAC

HVAC systems are static and are largely managed by Facilities.

See full topology diagrams here.

A word about what's happening between files and the drives they live on.

New sysadmins

These pages will be helpful for you if you're just starting in the group:

Note: you'll need to log in with wiki credentials to see most Sysadmin pages.

Additional information

These pages contain a lot of the most important information about our systems and how we operate.

Handy Tools

Technical docs

Common tasks

Group and institution information