Raw Record Source

{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreihqdecuf2eogh5xugpkn3lpwy37gkgzydbvou56v6ufkrwgohmrx4",
    "uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mnzmc3lqbhw2"
  },
  "path": "/t/contextual-contamination-the-silent-drift-of-large-language-models-via-stored-conversation-data/175432#post_6",
  "publishedAt": "2026-06-11T14:47:24.000Z",
  "site": "https://discuss.huggingface.co",
  "textContent": "Thank you for pointing out this problem. I think the distinction is important: a model can remain locally coherent while still being influenced by the register, framing, or manipulative structure of the context it is supposed to analyze.\n\nI am working on DESi, a process architecture for evidence-aware LLM workflows. One part of it is context hygiene: separating raw adversarial or emotionally loaded material from the generation context, transforming it first into explicit claims, tactics, risk markers, constraints, and audit state.\n\nWould you be comfortable if I used your contextual contamination setup as inspiration for a small DESi benchmark? The goal would not be to claim ownership of your idea, but to test whether a DESi-style pipeline can reduce register drift, framing leakage, role adoption, and attribution loss compared with a baseline LLM that receives the raw context directly.\n\nIf I run such an experiment, would you be interested in seeing the results?",
  "title": "Contextual Contamination: The Silent Drift of Large Language Models via Stored Conversation Data"
}