{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreih2t7ys3shcdjnm7r7xzfnfnzcfqe54dmk4gh7ko46jm5booxytgi",
"uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mpand3kjgwy2"
},
"path": "/t/llm-curving-via-prompting/177166#post_7",
"publishedAt": "2026-06-27T00:52:35.000Z",
"site": "https://discuss.huggingface.co",
"tags": [
"(click for more details)"
],
"textContent": "Yeah. That direction is probably the right one:\n\n* * *\n\nI think the useful part of your reframing is exactly this: keeping the current claim **behavioral for now** , while making the stronger mechanistic claims **testable rather than asserted**.\n\nI cannot promise that I can help at a high technical level. I do not have much compute available, and I do not want to overstate my role. At most, I may be able to help with the documentation / clarification side: making a table of what each metric appears to measure, what its input source seems to be, and what would still be needed for someone else to reproduce it.\n\nI also do not think the next step has to be:\n\n> prove or disprove the whole field interpretation\n\nA more useful next step might be:\n\n> make the measurement layer legible enough that someone else can reproduce, challenge, or extend it without first accepting the interpretation.\n\nThe encouraging part is that several of the figures appear to be derived from hidden-state tensors, not only generated text. So I would not dismiss them as purely rhetorical visualizations. But I would still separate two things:\n\nLayer | Example\n---|---\nNeutral formula / measurement | layer-to-layer hidden-state variation, deep-layer norm statistic, PCA of layer trajectories\nInterpretive label | residual jittering, ontological grip, attractor hold, gravity well, braiding\n\nBoth can coexist. The interpretive names may be useful for intuition, but a technical collaborator will probably need the neutral measurement contract first.\n\nA short version of that contract could look like this:\n\nCurrent label | Neutral measurement name | Likely source | What a collaborator would need\n---|---|---|---\nResidual Jittering / Chaos Force | layer-to-layer hidden-state variation | hidden states | formula, normalization, controls, raw series\nAttractor Hold / Ontological Grip | normalized deep-layer norm statistic | hidden states | layer range, formula, controls\nBalance of Power | overlay of two separately scaled hidden-state summaries | hidden states | raw values, baseline/style-control\nBraided Invariants | PCA view of token/layer hidden trajectories | hidden states | projection params, seed, controls\nManifold Resonance | mid-vs-final layer cosine similarity | hidden states | exact layer indices, controls\nGeometric Density / Gravity Well Depth | SVD/spectral concentration statistic | hidden states | raw spectral values, directionality, controls\nSpecificity Flux | final-layer vector dispersion over steps | hidden states | raw time series, controls\nProbabilistic Drift / Logit Entropy | LM-head projection metrics | hidden states + LM head | exact layers, logits/probs, controls\n\nSo if someone with mechanistic-interpretability experience joins later, the first task does not need to be “evaluate EPE as a theory.” It can be something much smaller:\n\n> reproduce these metrics on one open model, with a baseline prompt, an EPE/curving prompt, and a style-control prompt.\n\nThat is probably much easier to collaborate on.\n\nLonger measurement-contract sketch (click for more details) Minimal collaboration target (click for more details) Implementation details that should probably be recorded (click for more details) Possible technical extension paths (click for more details) Visualization caveat (click for more details)\n\nSo my current practical suggestion would be:\n\n 1. keep the main public claim behavioral for now;\n 2. preserve the field interpretation as a hypothesis or intuition layer;\n 3. make the existing hidden-state-derived metrics legible in neutral terms;\n 4. add baseline/style controls;\n 5. publish raw metric tables and plotting code;\n 6. let a future technical collaborator extend it toward representation comparison, activation steering, or patching.\n\n\n\nThat seems like the smallest useful bridge between the current work and the kind of mechanistic test you are looking for.",
"title": "LLM \"curving\" via prompting"
}