{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreicvkaj6urec3k7ecc732ylp4e5xjdrzbipejx5h6ojm5cwi5enk7e",
"uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mjpuoxhnx2x2"
},
"path": "/t/how-do-you-handle-sensitive-data-pii-in-datasets-before-training-models/175312#post_3",
"publishedAt": "2026-04-17T20:25:41.000Z",
"site": "https://discuss.huggingface.co",
"textContent": "I think it’s a waste of time. I developed a new architecture that fully externalizes all AI memory function. Models are nothing but interchangeable compute power too me. I simply drag and drop what ever I want into it. Hit Digest to memory and then it’s done. The model can pull up any part of it anytime you want. Retraining models for anything = time and money both are wasted on an end result that isn’t perfect and or deterministic.",
"title": "How do you handle sensitive data (PII) in datasets before training models?"
}