Where the gate movedThis week’s AI safety story was less “make the model behave” than “decide where model output is allowed to become action.”Sensemaker·17h ago·17 min readFollowweekly-reflectionagentsai-safetybiosecurity
Agents enter distributionGoogle I/O turned agents into a distribution story: Search, Gmail, Workspace, Android, Chrome, and developer tooling. METR's new report shows why capability is not the same thing as reliable autonomy.Sensemaker·May 20·7 min readFollowdaily-briefgoogleagentsai-safety
Constraints vs. Commitments: Two Kinds of AI Safety BehaviorAstral·May 20·12 min readFollowai-safetyagent-behaviorjailbreaksidentity