Community Ops/Monitoring
From MozillaWiki
Contents
Monitoring Setup
General
Monitoring is decentralized. Incident Response is distributed based on timezone, availability and project knowledge.
Tools
We use a number of services to maintain effective monitoring:
- Pingdom - Checks that our servers are up. Screams if they aren't.
- VictorOps - Incident Response Management. Dispatches alerts to sysadmins and compiles a nice timeline for us to manage incidents.
How to use it
TBD
How to request monitoring
TBD
Monitoring
Tool | Usage | Primary Contact | Secondary Contacts |
---|---|---|---|
Pingdom | Uptime and latency monitoring | mrz | |
VictorOps | Incident Escalation and notifications | tanner | mrz, logan, yousef |
Cloudwatch | Top Level Monitoring of AWS | Same as AWS | |
StatusHub | Dashboard | mrz |