ARCUS-H: Open benchmark for RL behavioral stability under stress (built on SB3)
Hugging Face Forums [Unofficial]
March 22, 2026
Thank you. this is the clearest taxonomy framing I’ve seen for this problem, and the distinction between CD (input-side) and VI (feedback-side) for frozen SB3 policies is exactly the argument I needed to make explicit. I’ll implement all of this in the future version
Discussion in the ATmosphere