{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreib7igwbljwhrosbyxb74fhx2rabzhhwgac6thutabaeal36hwswpe",
"uri": "at://did:plc:25rdn5elo5izoxrmtis34zuk/app.bsky.feed.post/3mpbm7kux3el2"
},
"coverImage": {
"$type": "blob",
"ref": {
"$link": "bafkreiatrd6go55ixdscqxyuuelkk64y7c7oj7flqbr5y6i3wwzi4feqsa"
},
"mimeType": "image/webp",
"size": 254144
},
"path": "/samson_tanimawo/on-call-wellness-protecting-your-engineers-from-burnout-46d9",
"publishedAt": "2026-06-27T13:21:24.000Z",
"site": "https://dev.to",
"tags": [
"sre",
"oncall",
"burnout",
"culture",
"Nova AI Ops",
"https://novaaiops.com"
],
"textContent": "## The On-Call Burnout Epidemic\n\nI watched three senior SREs leave our team in six months. Exit interviews all said the same thing: on-call was unsustainable.\n\nWe were spending $500K+ recruiting replacements for a problem that could have been fixed with $0 and better practices.\n\n## The Warning Signs\n\nBefore someone quits, they show these signals:\n\n 1. **Cynicism in post-mortems** — \"This will never get fixed\"\n 2. **Alert numbness** — Slow to respond, missed pages\n 3. **Vacation avoidance** — \"I can't take time off, who would cover?\"\n 4. **Scope creep rejection** — \"That's not my problem\"\n 5. **Meeting silence** — Previously engaged, now checked out\n\n\n\nIf you see three or more of these in someone on your team, they're already halfway out the door.\n\n## What We Changed\n\n### 1. Hard Cap on Pages\n\nWe set a maximum of 2 pages per 8-hour on-call shift. If someone gets paged more than that, the secondary automatically takes over and the incident is escalated as a process failure.\n\n\n\n on_call_policy:\n max_pages_per_shift: 2\n shift_duration_hours: 8\n overflow_action: \"escalate_to_secondary\"\n overflow_review: \"weekly_ops_review\"\n\n\n### 2. Follow-the-Sun Rotation\n\nWe stopped asking people to be on-call at 3am. With team members across US timezones, we created overlapping business-hours shifts:\n\n\n\n Shift A (Eastern): 6am - 2pm ET\n Shift B (Central): 11am - 7pm CT\n Shift C (Pacific): 2pm - 10pm PT\n Overnight: Managed by alert automation + escalation\n\n\nNobody gets paged between 10pm and 6am unless it's a true P1.\n\n### 3. On-Call Compensation\n\nWe implemented:\n\n * $500 flat fee per on-call week\n * $200 per off-hours page\n * Comp day after any overnight incident > 30 minutes\n * On-call swaps require zero management approval\n\n\n\n### 4. The \"Toil Budget\"\n\nEach engineer gets a toil budget: maximum 30% of their time on operational work. If toil exceeds 30%, they're pulled from on-call until the team automates the excess.\n\n\n\n Weekly toil tracking:\n Alert response: 4 hours\n Manual deployments: 2 hours\n Config updates: 1 hour\n Ad-hoc debugging: 3 hours\n ─────────────────────────\n Total: 10 hours (25% of 40hr week) ✓\n\n\n### 5. Quarterly On-Call Reviews\n\nEvery quarter, we review:\n\n * Pages per person\n * Off-hours disruptions\n * Toil percentages\n * Team sentiment survey\n * Attrition risk signals\n\n\n\n## The Results\n\nMetric | Before | After (6 months)\n---|---|---\nAttrition rate | 40%/year | 8%/year\nPages per shift | 4.7 | 1.2\nOff-hours pages | 12/week | 2/week\nTeam NPS | -15 | +45\nRecruitment cost saved | - | ~$400K/year\n\n## The Key Insight\n\nOn-call wellness isn't a perk. It's a business decision. Replacing a senior SRE costs $150-200K in recruiting, onboarding, and lost productivity. Preventing burnout costs almost nothing.\n\nIf you're looking to reduce on-call toil and protect your team from burnout, check out what we're building at Nova AI Ops.\n\n**Written by Dr. Samson Tanimawo**\nBSc · MSc · MBA · PhD\nFounder & CEO, Nova AI Ops. https://novaaiops.com",
"title": "On-Call Wellness: Protecting Your Engineers from Burnout"
}