post: Rough overview of the extended downtime from Jan 2023
This commit is contained in:
parent
db3ed9b0bd
commit
54d1fd5821
1 changed files with 50 additions and 0 deletions
50
src/postmortem/2023-01-12_Loss-of-network-access.md
Normal file
50
src/postmortem/2023-01-12_Loss-of-network-access.md
Normal file
|
@ -0,0 +1,50 @@
|
|||
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
|
||||
|
||||
title = '2023-01-12 Loss of Network Access'
|
||||
date = 2023-01-12
|
||||
|
||||
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
|
||||
|
||||
# 2023-01-12 Loss of Network Access Postmortem
|
||||
|
||||
Key people: Brendan (silver).
|
||||
|
||||
## What happened
|
||||
On 2023-01-12 we received an email from ITD stating that 193.1.99.123 was registered as an attack source by Heanet and external access was removed by ITD to contain it.
|
||||
Later the same day it had been upgraded to our entire subnet, the ITD security team saw our servers as a security risk.
|
||||
|
||||
## What was the root cause
|
||||
Unlike other incidents we knew from the outset what the root cause was.
|
||||
One of the Wordpress instances on www.skynet.ie decided its true calling was to become a spambot, this is what caught the attention of Heanet.
|
||||
|
||||
## Restoring network access
|
||||
In order to restore network access ITD had two requirements:
|
||||
|
||||
1. Servers are patched on both an OS & application level.
|
||||
2. Maintained in such a manner to prevent unauthorized access or misuse
|
||||
|
||||
|
||||
## Rebuilding services
|
||||
Due to the age of the software on virtually all machines in place upgrades were neither feasible nor maintainable.
|
||||
|
||||
As a result user data and config files were backed up from the servers that would be reused in the future.
|
||||
Other servers were archived in case of a need to get more data in the future.
|
||||
|
||||
Using the newest server with adequate hard drive bays Proxmox was installed.
|
||||
Various containers were then created to serve the roles needed to run the cluster.
|
||||
The OS chosen for these containers was NixOS, a config based operating system, which allows us to easily update with consistency.
|
||||
|
||||
This process took until the end of April.
|
||||
We were then able to request that ITD open specific ports for servers.
|
||||
Over the summer more services became active.
|
||||
|
||||
|
||||
## Outcomes
|
||||
As a result of all this:
|
||||
|
||||
* We have a far better relationship with ITD
|
||||
* We have reliable systems
|
||||
* We have improved security and access controls.
|
||||
* We have embraced automation
|
||||
* We are far more open and transparent (config is open source)
|
||||
|
Loading…
Reference in a new issue