{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreihd3jbdrqynpl4oeetapxvhfstck4f6el46fnragpeknwhl7hxvji",
"uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mipaiaulls72"
},
"path": "/t/can-an-ai-have-its-own-internal-ethics-standard-protocol-for-axiomatic-alignment/174927#post_6",
"publishedAt": "2026-04-04T19:52:50.000Z",
"site": "https://discuss.huggingface.co",
"textContent": "FAllan07:\n\n> I must admit I was expecting more feedback on Hugging Face regarding these issues. The subject seems crucial at a time when alignment is becoming a major safety concern.\n\nAlso to remark upon this specifically, it seems that a major component of the industry as a whole is it’s focused on faster models and more complex models.But it doesn’t really feel like the community as a whole is really focused on these personal biases and internal ethics and stuff like that as of yet.But one of my major points is that, if what I colloquially call “personality” (which is biases, internal ethics, stuff like that), they seem to be an emergent personality in some models. It really feels like we arrived at these completely accidentally, the ones that currently exist. Which means we may not actually know exactly how these biases seem to have emerged, which could mean that by updating models we could lose these biases. I think it’s important to explore where these personalities come from.Which is one of the reasons why I was kind of interested in your topic, because internal ethics is a little bit different from biases. Biases are interesting topics. Ethics are internal guardrails. There are two sides of the same coin so by exploring one you can derive some perspectives on the other.But either is vitally important because if we don’t actually know how these are adopted, then that means there are core components of the system that are becoming emergent that we don’t know how they got there.",
"title": "Can an AI have its own internal Ethics? Standard Protocol for Axiomatic Alignment"
}