Notification Type: PROBLEM
Service: Check unit status of wikitech_run_jobs
Host: cloudweb2002-dev
Address: 208.80.153.41
State: CRITICAL
Date/Time: Tue May 3 20:42:22 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Analytics/Systems/Managing_systemd_time…
Acknowledged by :
Additional Info:
CRITICAL: Status of the systemd unit wikitech_run_jobs
Notification Type: RECOVERY
Service: Check systemd state
Host: cloudbackup2001
Address: 10.192.0.130
State: OK
Date/Time: Mon May 2 18:23:34 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
OK - running: The system is fully operational
Notification Type: PROBLEM
Service: Check unit status of backup_cinder_volumes
Host: cloudcontrol1005
Address: 208.80.154.85
State: CRITICAL
Date/Time: Mon May 2 18:18:13 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Runbooks/Check_u…
Acknowledged by :
Additional Info:
CRITICAL: Status of the systemd unit backup_cinder_volumes
Notification Type: PROBLEM
Service: Check systemd state
Host: cloudbackup2001
Address: 10.192.0.130
State: CRITICAL
Date/Time: Mon May 2 18:17:49 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
CRITICAL - degraded: The following units failed: cinder-backup.service
Notification Type: RECOVERY
Service: Check for VMs leaked by the nova-fullstack test
Host: cloudcontrol1003
Address: 208.80.154.23
State: OK
Date/Time: Mon May 2 17:35:06 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Runbooks/Check_f…
Acknowledged by :
Additional Info:
3 instances in the admin-monitoring project
Notification Type: PROBLEM
Service: Check for VMs leaked by the nova-fullstack test
Host: cloudcontrol1003
Address: 208.80.154.23
State: CRITICAL
Date/Time: Mon May 2 17:33:34 UTC 2022
Notes URLs: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Runbooks/Check_f…
Acknowledged by :
Additional Info:
10 instances in the admin-monitoring project
Notification Type: PROBLEM
Host: cloudservices1004
State: DOWN
Address: 208.80.154.11
Info: PING CRITICAL - Packet loss = 100%
Date/Time: Mon May 2 13:48:30 UTC 2022
Acknowledged by :