{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreicvkaj6urec3k7ecc732ylp4e5xjdrzbipejx5h6ojm5cwi5enk7e",
    "uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mjpuoxhnx2x2"
  },
  "path": "/t/how-do-you-handle-sensitive-data-pii-in-datasets-before-training-models/175312#post_3",
  "publishedAt": "2026-04-17T20:25:41.000Z",
  "site": "https://discuss.huggingface.co",
  "textContent": "I think it’s a waste of time. I developed a new architecture that fully externalizes all AI memory function. Models are nothing but interchangeable compute power too me. I simply drag and drop what ever I want into it. Hit Digest to memory and then it’s done. The model can pull up any part of it anytime you want. Retraining models for anything = time and money both are wasted on an end result that isn’t perfect and or deterministic.",
  "title": "How do you handle sensitive data (PII) in datasets before training models?"
}