Raw Record Source

{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreiblpfxxycmqgxdermpo7v6hqjj6dr6iy4j4xc4lvoj5hw2wozzt2i",
    "uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mh372ic2q3z2"
  },
  "path": "/t/a-persistent-8-15-failure-rate-across-domains-evidence-for-an-epistemic-boundary/174264#post_1",
  "publishedAt": "2026-03-14T18:19:09.000Z",
  "site": "https://discuss.huggingface.co",
  "tags": [
    "https://doi.org/10.5281/zenodo.19020581",
    "GitHub - elly99-AI/MarCognity-AI: A research framework for structured LLM evaluation, claim verification and reflective reasoning mechanisms. · GitHub"
  ],
  "textContent": "**LLMs can’t fix their own epistemic blind spots.**\n\nNot with better prompting.\nNot with stricter verification.\nNot with more sophisticated agents.\n\nThe **MarCognity benchmark** reveals a persistent **8–15% failure rate across 8 domains** ,\neven when the system is equipped with a **full metacognitive layer**.\n\nThis residual error is **not noise**.\n\nIt may be an **emergent property of autoregressive generation itself** —\na structural boundary where the epistemic space accessible to a probabilistic model\nis narrower than the space of claims requiring justification.\n\nI call this the **Epistemic Boundary**.\n\n\n\n**Resources:**\n\n• Zenodo : https://doi.org/10.5281/zenodo.19020581\n\n• GitHub : GitHub - elly99-AI/MarCognity-AI: A research framework for structured LLM evaluation, claim verification and reflective reasoning mechanisms. · GitHub",
  "title": "A Persistent 8–15% Failure Rate Across Domains: Evidence for an Epistemic Boundary"
}