- Use alarms on
StatusCheckFailed_System
to detect system failures.
Status Checks
- Instance Status – health of the VM.
- System Status – health of underlying hardware.
- Attached EBS Status – health of attached volumes.
Recovery Process
- Automatically restores the same: private IP, public IP, Elastic IP, metadata, and placement group.
- Can send SNS notifications upon recovery.
Good to Know
- Alarms can be based on CloudWatch Logs Metric Filters.
- Enables triggering alerts from specific log patterns or extracted values.