'As adoption grows, confidence in safeguards must rise with it': Microsoft reveals new tool which can track backdoors in LLMs - and it's hoping this will restore trust in AI across the world
TechRadar | the technology experts [Unofficial]
February 10, 2026
Microsoft introduced a scanner that detects poisoned open-weight language models by analyzing attention behavior, memorization leaks, and trigger flexibility.
Discussion in the ATmosphere