Causal Mediation Analysis: An Overview
What is Causal Mediation Analysis?
Causal mediation analysis is a statistical method used to determine how an independent variable (X) influences an outcome variable (Y) through an intermediate variable (M), known as the mediator. This method helps researchers understand the mechanism behind causal effects.
For example, if we want to study how exercise (X) affects heart health (Y), mediation analysis can determine whether the effect is partially or fully explained by an intermediate factor like weight loss (M).
Key Components of Causal Mediation Analysis
- Exposure (X): The independent variable (e.g., Exercise)
- Mediator (M): The variable that transmits part of the effect (e.g., Weight Loss)
- Outcome (Y): The dependent variable (e.g., Heart Health)
The goal is to decompose the total effect of on into:
- Direct Effect (DE): The effect of on not passing through .
- Indirect Effect (IE) / Mediated Effect: The portion of the effect that passes through .
Mathematically, the total effect can be written as:
Types of Causal Effects in Mediation Analysis
- Natural Direct Effect (NDE): The effect of on when is held constant at its natural value.
- Natural Indirect Effect (NIE): The portion of the effect that occurs through the mediator.
- Total Effect (TE): The combined impact of both direct and indirect effects.
Assumptions for Causal Mediation Analysis
- No Unmeasured Confounders: , , and should not have unmeasured common causes.
- Temporal Order: must precede , and must precede .
- No Hidden Feedback Loops: should not influence (causal direction should be clear).
Statistical Approaches for Mediation Analysis
1. Baron & Kenny's Approach (Traditional Regression Method)
- Step 1: Show that affects
- Step 2: Show that affects
- Step 3: Show that affects when controlling for
- Step 4: If the effect of on significantly reduces (or disappears) after adjusting for mediation is present.
2. Sobel Test
- Tests whether the indirect effect () is statistically significant.
- Assumes normality, which may not hold in small samples.
3. Causal Inference-Based Approaches (Counterfactual Mediation Analysis)
- Robins & Greenland (1992), Pearl (2001) developed methods using counterfactuals to formally define direct and indirect effects.
- Uses Structural Equation Modeling (SEM) or Generalized Linear Models (GLMs).
- Implemented in software like R (mediation package) and Python (causalml, dowhy).
Example: Causal Mediation Analysis in Python (Using mediation
in R or DoWhy
in Python)
Python Example with DoWhy
Applications of Causal Mediation Analysis
- Epidemiology & Public Health:
- How smoking (X) affects lung disease (Y) through inflammation (M).
- Economics & Social Sciences:
- How education (X) affects income (Y) via job skills (M).
- Psychology:
- How stress (X) influences health (Y) through sleep quality (M).
Comments
Post a Comment