Collider in RCT Subgroup Analysis
I would tend to agree with you that the enrollment gate selects a coherent biological data generating process (BDGP) because COVID-19 is one disease, and that the subgroup gate preserves causal integrity randomized controlled trial (CIR) status because oxygen requirement proxies the underlying inflammatory state.
The third question, which is what I raised, is whether conditioning on this proxy introduces collider bias. This is not addressed in your framework. Exchangeability within oxygen strata supports internal causal identification, meaning that within each oxygen group, the dexamethasone and no-dexamethasone patients are comparable. But internal causal identification within a stratum defined by a proxy for the treatment’s own mechanism is not the same as clean causal identification of the treatment effect itself.
The most important gap I feel is that you don’t consider what I would call the treatment pathway problem. If dexamethasone works by suppressing inflammation, and oxygen requirement reflects how far that inflammation has already progressed, then the greatest causal contribution of dexamethasone should be in patients not yet requiring oxygen, where the drug intercepts the inflammatory process earliest, before irreversible lung injury has occurred. Yet RECOVERY’s subgroup analysis shows no benefit in that group. This means the subgroup gate may have systematically misidentified where the biological process actually generates the treatment response. This is not a cause agnostic randomized trial (CAR) problem at the main enrollment gate, rather it is a causal inversion at the subgroup gate that your CIR framework as currently presented can probably not detect.
Regarding your statement that RECOVERY is Grade A evidence, the overall intention-to-treat estimate, which is that dexamethasone reduces mortality in hospitalized COVID-19 patients, is probably Grade A and reflects a solid CIR finding. But grading the subgroup finding as Grade A requires accepting three things: first, that oxygen status is causally independent of dexamethasone’s mechanism; second, that conditioning on it doesn’t introduce collider bias; and third, that the subgroup treatment effect is transportable to other settings and populations. None of these are established by the trial design itself.
Your framework is a genuine advance on standard thinking about trial design and how treatment effect estimates should be interpreted. The CIR versus CAR distinction, and the concept of a synthetic data generating process (SDGP), where the enrollment gate artificially aggregates patients from distinct biological disease processes rather than selecting a single coherent one, are valuable and clarifying contributions. Your cause-mixture paradox, where treatment effects reverse or disappear across trials simply because the mixture of underlying diseases at the enrollment gate has shifted, is particularly important for understanding why trials of the same drug sometimes contradict each other.
However, the framework may benefit from the discussion in this thread and I do not want us to change focus back to your framework as that is how this thread started: Asking the question that even within a coherent single-disease biological process, a subgroup gate defined by a variable that proxies the mediator of treatment effect , the biological pathway through which the treatment works, cannot guarantee CIR status at the subgroup level, regardless of how physiologically plausible that proxy appears. This is because conditioning on a mediator proxy may artificially partition what is actually a single continuous biological process into apparent subgroups that do not correspond to distinct natural disease states. In other words, a coherent biological process at the main gate can generate what amounts to a synthetic data generating process (SDGP) at the subgroup level, not through disease mixture as you describe, but through causal pathway mixture induced by the subgroup gate itself. The cause-mixture paradox you identify across trials may have a precise analogue within a single trial when subgroup gates induce causal pathway mixture
This is not a contradiction of your framework. It is a natural extension of your own estimand logic applied one level deeper, to the subgroup gate rather than the main enrollment gate. However I would propose defining such a third variable (e.g. O2 status in RECOVERY) as an effect modifier if it can be thought to be a pre-randomization proxy for the mediator’s baseline state (hyperinflammatory state), a variable that appears to be a standard pre-randomization effect modifier but is in fact a proxy for the baseline state of the biological mediator through which treatment operates. This distinction matters because standard effect modifiers that are causally exogenous to the treatment mechanism can be safely used to define subgroups within a CIR (I do not know if these truly exist and that is another question for you!). Mediator proxy effect modifiers cannot because conditioning on them may induce collider bias and artificially partition what is actually a single continuous biological process into apparent subgroups that do not correspond to distinct causal states.
The treatment pathway approach I proposed, comparing each treatment-defined pathway against a single undivided control arm, with balance achieved through randomization, is intended as a practical solution to this problem, avoiding conditioning on the mediator proxy entirely. However there are clearly more thoughts needed on this, and I would be very keen to hear your views on whether treatment pathway analysis could serve as a viable alternative to subgroup analysis in RCTs where mediator proxy effect modification is suspected. The broader question of where to draw the line between a safely exogenous prognostic variable and a mediator proxy effect modifier remains open and is perhaps the most important unresolved issue this discussion has surfaced.
Addendum
The assumption that a subgroup-defining variable is causally exogenous to the treatment mechanism is required for safe subgroup analysis within a CIR, but this assumption cannot be guaranteed for any strongly prognostic biological variable and should be treated as a testable hypothesis rather than a default. Indeed, If a variable is truly exogenous to the treatment mechanism it is unlikely to be a strong effect modifier. If it is a strong effect modifier it is unlikely to be truly exogenous.
Discussion in the ATmosphere