Incident description
Wordpress site (wordpress1.geant.org) become unreachable at 12:10 CET
Incident severity: CRITICAL
Data loss: NO
Timeline
Time (CET) | |
---|---|
12:10 | Apache server stop accepting incoming requests |
12:12 | Chris Atherton reported on #it channel that site aac-project.eu is not working correnctly |
12:21 | Konstantin Lepikhov confirmed the issue with worldpress1 site on #devops channel |
12:23 | Dick Visser connected to VM via console and confirmed that network is down (router not reachable) |
12:29 | Massimiliano Adamo have restarted network service inside VM, after that everything started working and network link came up. |
12:30 | Konstantin Lepikhov announced that problem fixed. |
Total downtime: 20 minutes.
Current situation
We're currently investigating the nature of the issue. I could either VMWware network adapter issue or something related network configuration in VMWhare cluster.
Monitoring alerted: YES