External Publication
Visit Post

ARCUS-H: Open benchmark for RL behavioral stability under stress (built on SB3)

Hugging Face Forums [Unofficial] March 22, 2026
Source
Thank you. this is the clearest taxonomy framing I’ve seen for this problem, and the distinction between CD (input-side) and VI (feedback-side) for frozen SB3 policies is exactly the argument I needed to make explicit. I’ll implement all of this in the future version

Discussion in the ATmosphere

Loading comments...