The Three Models for Mediation and Confounding
This page is part of the University of Colorado-Anschutz Medical Campus’ BIOS 6618 Recitation collection. To view other questions, you can view the BIOS 6618 Recitation collection page or use the search bar to look for keywords.
The Three Models for Mediation and Confounding
In confounding and mediation, we set up similar directed acyclic graphs (DAGs) for our three variables (e.g., the outcome
Regardless of the question, we have equivalent statistical approaches (i.e., models) to evaluate the potential associations:
- Crude Model:
- Adjusted Model:
(note, for confounding notation in our lectures) - Covariate Model:
It turns out, that whether we are trying to examine if the variable in question is a mediator or a confounder, they use the exact same three regression models (see “Equivalence of the Mediation, Confounding and Suppression Effect” by MacKinnon, Krull, and Lockwood).
This may seem a bit counterintuitive, since we’re changing the directions of our (proposed) causal relationship. However, when we fit a regression model, the model itself is evaluating the association of the outcome with given predictor(s) and not necessarily its causality (i.e., cause-effect versus correlation).
Further, we are really interested in trying to disentangle the effect of our primary explanatory predictor (i.e.,
[ {crude} - {adj} = {X} {M} ]
Proportion Mediated versus Operational Definition of Confounding
A further connection can be made to our formula for operational confounding:
(favored by biostatisticians) (favored by epidemiologists)
and the formula for the proportion mediated by the indirect effect:
We see here that the biostatistician-favored operational confounding criterion and the proportion mediated are the same! Again, mathematically the idea is equivalent, but the context and conclusions are very different based the question we are seeking to answer (i.e., is the variable a confounder with respect to our primary explanatory variable