Incident Description
Dashboard stopped processing traps from Sunday 08:37 UTC until Monday approximately 08:10 UTC
The impact of this service degradation was:
- The production dashboard stopped processing traps and therefore no alarms were presented to NOC.
- No alarms or notifications were generated during the outage period.
Incident severity: CRITICAL Complete service outage
Data loss: YES
Total duration of incident: 1 day
Timeline
All times are in UTC
Date | Time | Description |
---|---|---|
| 08:36 | LON/AMS ... etc |
| 08:37 | rmq partition, correlator stopped proce3ssing |
| time ... | notifications |
| time | oc/josep ... etc (todo) |
| time | erik restarted rmq (todo) |
Proposed Solution
- ....