Raw Record Source

{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreihpz7jp74pqm3f6fxfpikmjk6njd5cx575crfb4fhgaijgz3zwzye",
    "uri": "at://did:plc:wwyqal4cnqhuwyacdj7rqq3n/app.bsky.feed.post/3mmh47zblzz42"
  },
  "path": "/t/relaxing-assumptions-and-targeted-estimands-with-most/28755#post_1",
  "publishedAt": "2026-05-22T12:37:32.000Z",
  "site": "https://discourse.datamethods.org",
  "tags": [
    "RMS Semiparametric Ordinal Longitudinal Model - data analysis / models - Datamethods Discussion Forum"
  ],
  "textContent": "Hi all,\n\nI would very much appreciate your thoughts on an idea before diving into the technical details and potentially getting stuck in tunnel vision. The idea is based on several points, so I’ll set the context first:\n\n  1. I have a background in Generalized Pairwise Comparisons (Win Ratio and alike). I recently submitted a paper with Prof. Harrell and colleagues on the contrast between GPC, DOOR and MOST (Markov Ordinal State Transition model - discussed here RMS Semiparametric Ordinal Longitudinal Model - data analysis / models - Datamethods Discussion Forum). All that to say that I tend to think of Y in RCTs (what happens to patients) as inherently multi-component.\n\n  2. I am drawn to MOST as the most faithful way of capturing Y (setting aside cases where ordering is not meaningful over short intervals). Two aspects appeal to me in particular: it retains more information than approaches such as the Win Ratio, and it does not impose a specific estimand.\n\n  3. I have also recently spent some time on the baseline covariate adjustment literature, including TMLE. My question is not about TMLE per se, but rather about borrowing its philosophy. In my understanding, this consists of two steps:\n\n     * first, modeling a component as flexibly and accurately as possible (in TMLE, the outcome regression for which they allow crazy ML methods if one thinks that helps),\n\n     * then, a “targeting” step, which brings that flexible model back into a framework that yields valid inference for the estimand of interest. It essentially tweaks the initial model as little as possible but in order to be back on the objective (usually the ATE in this literature)\n\n\n\n\nUsually, that literature focuses solely on the X, trying to be “the best” on that part without discussing anywhere the Y.\n\n  4. In practice, I operate in an environment where change has to be incremental. A pragmatic first step, as I see it, is to leave the standard estimands unchanged (e.g., a risk difference at a given time point). I am also working within a frequentist context, with its associated constraints such as multiplicity. While I am aware of Bayesians’ views on this, I am trying to move things forward gradually.\n\n\n\nAgainst that background, here is the idea I would like to explore:\n\n  * First, represent the data using MOST, to capture the patient trajectories as faithfully as possible. The components would be the ones that allow me to go back to primary and secondary in usual trials (along all relevant intercurrent events).\n\n  * Second, fit an intermediate (“parent”) model that is optimized for these trajectories as a whole, and thus naturally introduces structural assumptions that allow information to be borrowed across time points and outcome levels.\n\n  * Third, starting from that parent model, introduce an estimand-specific adjustment: one parent model, but different estimands (primary, secondary, etc.).\n\n\n\n\nThe key point is that I would like this adjustment to remain small: not a complete re-specification, but rather a minimal modification that nudges the model toward the estimand of interest. Conceptually, I imagine this as optimizing something akin to an MSE criterion: accepting a small increase in bias in exchange for a larger reduction in variance due to the structure imposed by the parent model.\n\nThere are two motivations behind this:\n\n  * The assumptions that work well globally (e.g., proportional odds) may not be optimal for a specific estimand. For instance, if the estimand is tied to a cutoff, one might want to relax proportional odds except locally around that cutoff, so that behaviors far from it do not introduce unnecessary bias.\n\n  * At the same time, maintaining a shared parent model and constraining how much it can vary across estimands could offer gains from a multiplicity perspective (which, again, reflects the practical constraints of my setting).\n\n\n\n\nLastly, an imperfect analogy that comes to mind is subgroup analysis: rather than analyzing each subgroup independently or pooling everything, the best approach is often somewhere in between. Here, I see a similar trade-off between a fully unbiased but inefficient approach, and a “targeted” version of MOST that introduces structure and accepts a small bias.\n\nStepping back, the ambition would be to say:\n\ni. I aim to represent Y as faithfully as possible, what actually happens to patients.\nii. I aim to make full use of what is known at baseline (X).\niii. I aim to preserve the estimands that stakeholders are used to, even if more informative alternatives may be possible in the future.\niv. The statistical machinery required to achieve this sits on my side.\n\nBefore taking this further, I would be very interested in your view: does this overall strategy make sense to you, or do you see fundamental issues that would make it untenable?\n\nMany thanks in advance!",
  "title": "Relaxing Assumptions and Targeted Estimands with MOST"
}