Awesome
server-responsibilities
You want to host your application on servers provided by someone, then this checklist might help. Who is responsible for what?
Details like SLA (service-level agreements) can be build on top of each topic.
Feedback is welcome.
Focus is on: Virtual Private Servers
Daily
No manual interaction should be needed for the daily usage.
Topic | Responsible | Note |
---|
provide dns server | | |
provide ntp (networking protocol for clock synchronization) server | | |
run dns client | | |
run ntp client | | |
provide database | | |
run database client | | |
run web server | | |
provide RPM/DPKG repository | | |
run mail server on this server | | |
provide smarthost for mail server. | | |
provide IMAP/POP3 services | | |
provide storage server | | |
run storage client | | |
run ssh server | | |
Uninterruptible power supply | | |
Provide access to BIOS | | |
Provide access to boot process | | |
Initial, Expand, Repair
Topic | Responsible | Note |
---|
provide more disc space (if needed) | | |
provide more CPU, RAM, IO | | |
replace hardware parts | | |
upgrade operating system | | |
create new linux users | | |
manage Hardware RAID | | |
manage Software RAID | | |
create Cert (for example for https) | | |
create SSH-server cert | | |
unsure ssh-server cert does not change (on upgrade, or server move) | | |
install new packages (rpm, dpkg) | | |
upgrade packages (fixes) | | |
provide IP addresses | | |
Create DNS entries for IP adresses. | | |
create root password | | |
know root password | | |
manage crontab entries | | |
enable/disable services | | |
Licenses for commercial third-party software | | |
Networking
Topic | Responsible | Note |
---|
provide default gateway | | |
maintain local firewall (running on the server) (default should be "none") | | |
maintain network firewall (running outside the server) | | |
maintain local routes (running on the server) (default should be "none" (only Default-GW)) | | |
Monitor
Topic | Responsible | Note |
---|
enough disk space | | |
enough CPU, RAM, IO | | |
system logs | | |
application logs | | |
Hardware RAID | | |
Software RAID | | |
Mail Queue | | |
Hardware-Sensors (like temperature) | | |
Service Availablity | | |
Filesystem health | | |
Network health | | |
Hardware
Topic | Responsible | Note |
---|
provide power | | |
provide network | | |
stand by hardware | | |
Backup
Topic | Responsible | Note |
---|
responsible for backup | | |
responsible for restore from backup | | |
responsible for checking if restore from backup would work. | | |
responsible for checking uninterruptible power supply | | |
Backup for database. | | |
Misc
Topic | Responsible | Note |
---|
Notify users about sheduled downtime | | |