Raw Record Source

{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreihatljzmcaqof2w36p5ierwbc66dzfr53rxuflxyxcyo4wqr4ua7a",
    "uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mmjf3geaojz2"
  },
  "path": "/t/why-qwen-is-dynamically-stable-an-empirical-phase-map-of-10-llms/176177#post_1",
  "publishedAt": "2026-05-23T11:09:22.000Z",
  "site": "https://discuss.huggingface.co",
  "textContent": "I’ve been analyzing the internal trajectory dynamics of several open-source models (Qwen, Llama, Gemma, etc.) using a new metric called ct_t (local trajectory instability based on hidden state curvature).\nThe results show a clear structural difference in how these models process information internally.\nKey Finding:\nWhile models like Gemma-2B often drift into a “Chaotic” regime (high instability spikes) and Llama-3.2 into an “Underactive” one (rigid, low variance), Qwen models consistently maintain an “Adaptive” regime.\nThis suggests that Qwen’s architecture achieves a unique balance between stability and flexibility, independent of model size. A 1.5B Qwen model was dynamically more stable than larger counterparts in our panel.\nThe 4 Regimes Identified:\nUnderactive: Rigid, low adaptability.\nAdaptive: Balanced flux/stability (Qwen’s zone).\nTransition: Boundary zone.\nChaotic: High instability, prone to divergence.\nFull Technical Report:\nI’ve published the full working paper with data from 158 runs on Zenodo:\nFour Dynamical Regimes in Large Language Models: An Empirical Phase Map\nI’d love to hear your thoughts: Does this “Adaptive” dynamic correlate with your experience of Qwen’s reasoning capabilities?\nBest,\nJean-Denis Bosange Batuli",
  "title": "Why Qwen is Dynamically Stable: An Empirical Phase Map of 10 LLMs"
}