{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreiadkv66dejvgzusi5rkrfpu5p75vuymu5tr6t7owhao432nsbn4my",
    "uri": "at://did:plc:qazk4aclfsyynk7bi2zso4os/app.bsky.feed.post/3ml76krof42y2"
  },
  "coverImage": {
    "$type": "blob",
    "ref": {
      "$link": "bafkreideqnvz7jry3vf5eb2i5cpeedr3oawsxxcx36xyzqt6hdezhnvnie"
    },
    "mimeType": "image/png",
    "size": 1540343
  },
  "description": "Apple's Sleep Score barely moved across a 7-month documented deterioration in my Central Sleep Apnea data. Two years of personal data shows why that matters.",
  "path": "/articles/your-apple-watch-knows-you-have-sleep-apnea-it-just-wont-let-that-affect-your-score/",
  "publishedAt": "2026-05-06T17:02:28.000Z",
  "site": "https://www.techbetweenthelines.com",
  "textContent": "On the morning of March 31st, my Apple Watch Ultra notified me that I may have sleep apnea and should speak with a doctor. It wasn't the first time. The notification is generated by watchOS's FDA-cleared sleep apnea detection feature, which uses the Watch's accelerometer to identify breathing irregularities during sleep. Apple earned that clearance. The feature works.\n\nWhat the notification didn't tell me was what the preceding three nights looked like in the data the Watch had already collected. My blood oxygen averaged 91.4%, 92.0%, and 92.1% across March 29, 30, and 31 the worst three-night cluster in 95 days of continuous tracking. My respiratory rate hit 28.2 breaths per minute on March 30, the second-highest reading in that entire dataset. My heart rate variability collapsed to 22 milliseconds, roughly half what it should be for someone my age. The Watch saw all of it. It connected enough dots to file a clinical-grade alarm. And then, almost certainly, it gave me a Sleep Score that didn't reflect any of it.\n\nI have a formal diagnosis of Central Sleep Apnea. I wear both an Apple Watch Ultra and an Oura Ring. I've spent more than two years collecting data from both, and what that data shows isn't just a gap between two consumer devices. It's a contradiction sitting inside a single piece of hardware, between two systems that have reached opposite conclusions about the same person's health on the same night.\n\nApple built a Sleep Score optimized to make you feel good about your sleep. That's a design choice. For most people, it's probably fine. For anyone with a diagnosed sleep disorder, it can quietly work against the clinical system Apple built right alongside it.\n\n❗\n\nAll health data cited in this article is my own, collected over two years using an Apple Watch Ultra and an Oura Ring Gen 4. I carry a formal diagnosis of Central Sleep Apnea. Apple Health data was pulled directly via HealthKit. Oura data was retrieved from the full app data export via Oura Cloud API. No data was estimated or reconstructed.\n\n## Two Systems, One Device\n\nApple's sleep apnea detection feature arrived with watchOS 11, available on Apple Watch Series 9, Series 10, Ultra 2, and Ultra 3. It uses the accelerometer to detect wrist movements associated with breathing disturbances during sleep, a method Apple validated through clinical trials and submitted to the FDA for de novo clearance as a medical device feature. When the algorithm crosses its confidence threshold over a 30-day observation window, it surfaces a notification telling you to see a doctor.\n\nThat notification is serious. Apple didn't build it to engage you with the Health app. It built it because sleep apnea, left undetected, carries real cardiovascular risk. The FDA clearance exists precisely because the stakes justify regulatory oversight.\n\nThe Sleep Score is a different creature entirely. Introduced alongside the sleep apnea feature in watchOS 11, it distills a night of sleep into a single number between 0 and 100. Apple weights total sleep duration, efficiency, time in each stage, heart rate, and respiratory rate. The goal is clarity. Sleep is complicated, and a single score is easier to act on than a wall of metrics.\n\nThe problem is what that simplification costs when the sleeper has a diagnosed breathing disorder.\n\nApple Watch — two systems, one device\n\nSystem 1 — FDA cleared\n\nSleep Apnea Detection\n\nUses accelerometer data over a 30-day window to detect breathing irregularities. Fires a clinical notification when confidence threshold is crossed.\n\nAccelerometer 30-day window FDA cleared\n\nSystem 2 — Consumer wellness\n\nSleep Score\n\nDistills one night of sleep into a number from 0–100. Weights total duration, efficiency, stage breakdown, heart rate, and respiratory rate.\n\nNightly composite 0–100 scale Score not queryable via HealthKit\n\nThe contradiction\n\nBoth systems run on the same hardware, reading the same sensor data, on the same night. System 1 can determine you have a serious breathing disorder with enough confidence to file a **clinical alert**. System 2 can look at the same night and hand you a **score in the 80s**. Apple has never publicly addressed how these two outputs are reconciled.\n\nApple Watch Ultra 2 / Ultra 3 · watchOS 11 · Data: author's own Apple Health export\n\n## What Two Years of Data Actually Shows\n\nOver 643 nights tracked by the Oura Ring between April 2024 and May 2026, and 95 consecutive nights tracked by Apple Watch from February through May 2026, a picture emerges that no press release walkthrough of either product's features would prepare you for.\n\nThe Oura data alone contains a story worth sitting with. My average sleep score across those 643 nights was 72.2. On 33% of all nights, the score fell below 70. On 69 nights it fell below 60. The lowest single score was 28, recorded on October 8, 2025, the kind of number that in any other context would prompt a follow-up conversation with a clinician.\n\nApple Watch Ultra · Feb 1 – May 6, 2026 · 95 nights\n\nBlood oxygen during sleep — daily average SpO2\n\n3-month avg\n\n93.8%\n\nNormal: 95–100%\n\nNights below 95%\n\n93 / 95\n\n98% of all nights\n\nLongest streak <95%\n\n48 days\n\nConsecutive nights\n\nLowest recorded\n\n91.1%\n\nMar 5, 2026\n\nDaily avg SpO2 95% healthy threshold Sleep apnea notification (Mar 31)\n\n9092949597!Feb 1\n\nSource: Apple Health · HealthKit oxygenSaturation · Author's own data\n\nBut the number that tells the real story isn't the sleep score. It's the Breathing Disturbance Index: the per-hour count of respiratory irregularities Oura tracks throughout each night. Oura's own documentation flags a BDI above 20 as a potential indicator of sleep-disordered breathing. My average across the full two-year dataset was 20.2. That average is itself sitting on the threshold. The distribution underneath it is what matters.\n\n## The Escalation Nobody Scored\n\nFrom April 2024 through April 2025, my BDI averaged 16.5 per night. Elevated for a healthy adult, unsurprising for someone with CSA, but relatively stable. Then something shifted.\n\nFrom May through November 2025, my monthly average BDI never dropped below 28.8. For seven straight months, breathing disturbances averaged 30 per hour. In September 2025, a single night hit a BDI of 68. Twenty-one nights across that stretch exceeded 40. On 67% of the nights in that seven-month window, the BDI crossed Oura's own warning threshold of 20.\n\nThis was a documented clinical deterioration. Not a bad week. Seven months of worsening sleep-disordered breathing, visible in the data, accumulating night after night.\n\nThen in December 2025, the BDI collapsed. From 31.2 in November to 13.2 in December to 8.7 in January 2026. No deliberate intervention that I can identify. Central apnea fluctuates with stress load, cardiovascular changes, sleep position, and factors that often don't announce themselves. Whatever drove the escalation apparently resolved on its own.\n\nApple Watch Ultra · Feb 1 – May 6, 2026\n\nHRV and respiratory rate — 95 nights\n\nAvg HRV (SDNN)\n\n34.5 ms\n\nMedian for age: ~55ms\n\nFeb–Mar avg HRV\n\n29.7 ms\n\nWorst sustained period\n\nApr–May avg HRV\n\n42.2 ms\n\n+12.5ms improvement\n\nAvg resp. rate\n\n21.0 br/min\n\nHealthy ceiling: 20\n\nHRV SDNN (ms) — dashed red line = 40ms poor recovery threshold\n\n1526384960Feb 1\n\nRespiratory rate (br/min) — dashed green line = 20 br/min healthy ceiling\n\n1519242832!Feb 1\n\nSource: Apple Health · heartRateVariabilitySDNN + respiratoryRate · Author's own data\n\nWhat Oura's sleep score did during all of this is instructive. During the escalation period, the average score was 72.6. During the recovery period, it was 74.0. A difference of 1.4 points across a clinical arc that saw BDI drop by more than 20. The score did not track the deterioration. It did not track the recovery. It produced essentially the same number throughout a two-year period in which my breathing during sleep went from manageable to severely disrupted and back again.\n\nTo be clear: Oura isn't completely blind to the problem. My worst nights during the escalation did tend to score lower. The correlation exists. It's just weak, a correlation coefficient of -0.033 between sleep score and BDI across the full dataset, meaning BDI barely moves the needle. On 30% of the 91 nights when my BDI exceeded 30, Oura still scored my sleep above 75. On 14 of those nights, above 80. Neither platform is giving a fully honest accounting. The difference is that Oura doesn't also have an FDA-cleared clinical alarm sitting in the same app, and it doesn't advertise that alarm while simultaneously smoothing over the signals that drive it.\n\n## The Night the Watch Finally Said Something\n\nThe March 31st notification didn't arrive in a vacuum. It arrived during one of the worst physiological stretches in 95 days of Apple Watch data. Three consecutive nights with SpO2 in the low 92s. Respiratory rate peaking at 28.2 breaths per minute, a number more consistent with moderate physical exertion than sleep. HRV floored at 22 milliseconds across all three nights. Every metric the Watch tracks pointed in the same direction for 72 consecutive hours.\n\nThe morning after the notification, my Oura ring scored that night a 47. Readiness: 56. Oura was unambiguous, something was wrong, the body hadn't recovered, the day should be adjusted. The Watch had fired its clinical alarm the morning before. And yet neither system has a mechanism to connect those events in a way visible to the user. The notification happened. The low score happened. The Sleep Score whatever it showed, sat beside both of them, doing its own calculation.\n\nApple Watch Ultra + Oura Ring · March 29 – April 2, 2026\n\nThe worst five-night window in 95 days\n\nApple Watch fired a sleep apnea notification on March 31. Every metric in the dataset bottomed out simultaneously.\n\nMar 29\n\nApple SpO2\n\n91.4%\n\nResp. rate\n\n26.5\n\nHRV\n\n23ms\n\nResting HR\n\n78 bpm\n\nOura ring not worn\n\nMar 30\n\nApple SpO2\n\n92.0%\n\nResp. rate\n\n28.2\n\nHRV\n\n22ms\n\nResting HR\n\n84 bpm\n\nOura not worn · 28.2 br/min is the 2nd highest respiratory rate in 95 nights\n\nMar 31\n\nAlert\n\n⚠️\n\nApple Watch: Sleep Apnea Notification\n\n\"Signs of sleep apnea were detected while you were sleeping. You may want to speak with your doctor.\" — Morning of March 31, 2026.\n\nApple SpO2\n\n92.1%\n\nResp. rate\n\n25.9\n\nHRV\n\n30ms\n\nOura not worn\n\nApr 1\n\nRing on\n\nApple SpO2\n\n92.6%\n\nOura score\n\n47\n\nOura readiness\n\n56\n\nApr 2\n\nRecovery\n\nApple SpO2\n\n93.9%\n\nOura score\n\n77\n\nOura readiness\n\n69\n\nSource: Apple Health HealthKit + Oura Cloud API · Author's own data · Diagnosis: Central Sleep Apnea\n\nA score of 47 from Oura isn't a yellow flag. It's a system telling you plainly that last night was bad. Apple's Sleep Score value for the same window is something I can't confirm, because Apple doesn't write Sleep Score values back to HealthKit as queryable data. The score lives inside the Sleep app and doesn't persist in a format that allows historical analysis. That's a design choice worth naming: the metric Apple puts most prominently in front of users is the one it makes hardest to audit over time.\n\n## What the Sensor Gap Adds\n\nComparing Oura and Apple Watch SpO2 on the 34 nights where both devices recorded blood oxygen produces a consistent gap. Oura averaged 95.4% on those nights. Apple averaged 93.9%. A 1.5 percentage point systematic difference, in the same direction, every time.\n\nFinger-based optical sensors like Oura's are generally considered more accurate than wrist-based PPG for blood oxygen measurement. The wrist has lower capillary density, and the Watch's sensor has to contend with movement artifacts and skin contact variability in ways a ring does not. Apple has acknowledged wrist PPG limitations in its own device documentation.\n\nApple Watch Ultra + Oura Ring · 49 matched nights · Feb–May 2026\n\nOura sleep scores against Apple SpO2 — same nights\n\nOura avg score\n\n74.8\n\nRange: 47–90\n\nOura nights below 70\n\n9 nights\n\n18% of matched nights\n\nApple SpO2 avg\n\n93.9%\n\nNormal: 95–100%\n\nSpO2 below 94%\n\n32 / 49\n\n65% of matched nights\n\nOura sleep score (bars, left axis 0–100) Apple SpO2 (line, right axis 90–98%) 95% threshold\n\n0255075100Feb 2Feb 9Feb 15Feb 24Mar 12Apr 1Apr 11Apr 28May 6\n\nSource: Oura Cloud API (daily_sleep) + Apple Health HealthKit (oxygenSaturation) · Author's own data · Diagnosis: Central Sleep Apnea\n\n34 nights with both sensors recording SpO2 · Feb–Apr 2026\n\nThe sensor gap: Oura ring vs Apple Watch SpO2\n\nOura Ring Gen 4 — finger sensor\n\n95.4%\n\nAvg SpO2 · 34 nights\n\nMin: 91.9% · Nights <95%: 11/34\n\n1.5%\n\ngap\n\nApple Watch Ultra — wrist PPG\n\n93.9%\n\nAvg SpO2 · same 34 nights\n\nMin: 91.4% · Nights <95%: 32/34\n\nOura SpO2 Apple SpO2\n\n90929598100Feb 2Feb 7Feb 11Feb 15Feb 22Feb 27Mar 12Mar 27Apr 6\n\nSource: Oura Cloud API (daily_spo2) + Apple Health HealthKit · 34 nights with both sensors recording · Author's own data\n\nIf Apple's sensor reads 1.5 points lower than a more accurate reference device, the SpO2 data feeding into the Sleep Score algorithm is already starting from a depressed baseline. The Watch isn't scoring against what your oxygen saturation actually was. It's scoring against what its wrist sensor estimated, and that estimate runs consistently low. For a healthy sleeper, a 1.5-point gap at the 96-97% range doesn't change much. At 93-94%, where I sit chronically, it matters.\n\n## The Design Philosophy Problem\n\nNone of this means Apple's health engineering is careless. The sleep apnea detection feature is genuinely impressive work, and FDA de novo clearance for a consumer wearable is not a trivial achievement. Apple has invested seriously in turning the Watch into a clinical instrument for specific, high-stakes conditions.\n\nThe problem is the layer sitting on top of that work. The Sleep Score isn't a clinical instrument. It's a consumer engagement feature, designed with consumer psychology in mind. Scores that consistently land in the 60s cause users to disengage. Scores in the 80s keep them opening the app. Apple has every structural incentive to weight the algorithm toward the high end, and the input that would most reliably drag it down for someone with CSA breathing disturbance data the Watch is already collecting is precisely the input that doesn't appear to move the needle.\n\nThat's the contradiction Apple hasn't resolved. They built a feature that says you may be seriously ill. They built it on the same hardware as a feature that says you slept great. Both outputs exist. Only one is designed to keep you engaged with the product. For someone without a prior diagnosis, that dynamic isn't neutral. They receive the notification, feel the appropriate alarm, schedule a sleep study. And then every morning while they wait for that appointment, their Watch hands them a reassuring number that quietly tells them most nights are actually fine.\n\nThe score doesn't intend to undermine the notification. It just does.\n\n## What Better Would Look Like\n\nAcross 643 nights of Oura data, the BDI moved dramatically. My condition visibly worsened for seven months, then recovered. The sleep score barely registered either event. A scoring system that genuinely incorporated breathing disturbance data would have tracked that arc. It would have been lower during the escalation. It would have recovered when the BDI recovered. It would have given me information instead of reassurance.\n\nApple already has the data. The Watch tracks respiratory rate every night. The sleep apnea detection feature processes movement patterns associated with breathing irregularities. The raw ingredients for a more honest score exist inside the device I already wear.\n\nSeveral changes would close the gap meaningfully. Surfacing the breathing disturbance count as a visible nightly metric the way both platforms surface time in each sleep stage would be a start. Allowing Sleep Score to be queried historically through HealthKit would let users actually audit the relationship between their physiology and their scores over time. And when a user has received a sleep apnea notification within the previous 30 days, the Sleep Score algorithm should weight respiratory metrics differently. The Watch already knows the clinical context. It is choosing not to use it in the output users see most.\n\nThe Apple Watch Ultra I wear has sensors precise enough to detect a respiratory event, run it through an FDA-cleared algorithm, and decide whether to file a medical-grade alert. That capability is real. The score that appears alongside it should reflect the same seriousness. Until it does, the Sleep Score isn't just incomplete for someone with a diagnosed sleep disorder. It's working in direct opposition to the feature Apple is most proud of building.",
  "title": "Your Apple Watch Knows You Have Sleep Apnea. It Just Won't Let That Affect Your Score.",
  "updatedAt": "2026-05-06T19:02:28.890Z"
}