Assumptions

CategoriesStatistics , Study Design , Causal Inference

Understanding the assumptions used in modelling ensures you can understand when results are trustworthy, and when they may be misleading.

Causal Wizard uses techniques which make numerous assumptions. The most important assumptions are described here. Some assumptions depend on the method of identification or estimation used to generate a result; others are more or less universal.

General Assumptions

  • Data: The data is a representative and unbiased sample of the real-world population being studied. Learn more about bias and confounding.
  • Modelling: The chosen Estimation methods and models are capable of capturing variables' interactions. If unsure, compare results using various methods in Causal Wizard.
  • Domain knowledge: It is assumed that your Study setup accurately captures all material and relevant causal interactions. If you have drawn a Causal Diagram, it should include all relevant variables, without omitting any variables or edges. You might also know this as the Ignorability, unconfoundedness, conditional independence, or selection on observables assumptionLearn about why this is important and how to capture all key interactions. If you are using a Fixed-Effects model, this assumption means you should include all relevant covariates and fixed effects. Unlike the Causal Diagram method, there is no principled way to define which variables are "relevant"!
  • Large Sample Size: Statistical estimators rely on asymptotic properties for valid inference. Therefore, having a large sample size is often necessary to ensure that the estimated coefficients are consistent and asymptotically normally distributed.

Potential Outcomes method Assumptions

The following assumptions apply specifically to results generated using the Potential Outcomes framework:

  • Positivity: There must be sufficient Treated samples which are collectively representative of the Control samples, with the exception of Treated status. This is not merely that there are Treated samples, but also that the values of the variables of those Treated samples must be representative of the range and combinations of values among the Control samples.
  • Ignorability (also known as Exchangeability): All variables which might affect both the Treatment and the Outcome are observable and controlled for. This is achieved during Identification, if your Causal Diagram is complete and accurate.
  • SUTVA (Stable Unit Treatment Value Assumption): The treatment of one sample unit must not affect the outcome of another sample unit. The samples must not interfere with each other.
  • Consistency: This assumption states that an individual's observed outcome under a particular treatment condition corresponds to the potential outcome for that individual under that treatment condition. In other words, if an individual receives treatment, their observed outcome is equal to their potential outcome under treatment; similarly, if they do not receive treatment, their observed outcome is equal to their potential outcome under no treatment.
  • No Unmeasured Confounding: This assumption asserts that there are no unobserved variables that confound the relationship between treatment assignment and potential outcomes. While it's often impossible to completely ensure the absence of unmeasured confounding, researchers typically strive to include as many relevant covariates as possible to mitigate this concern. However, it's also important to only include the right variables in your model. The identification process ensures this, if your Causal Diagram is sufficiently correct and complete.

Panel Data / Fixed-Effects method Assumptions

Causal Wizard now offers a range of Fixed-Effects methods which apply to Panel Data. Panel Data tracks a set of entities over time. Due to these concepts these models, including Difference-in-Differences (DiD), make their own assumptions:

  • Time-invariant Unobserved Heterogeneity: Fixed effects models assume that any unobserved factors that are constant over time and may affect both the treatment assignment and the outcome variable are adequately captured by the fixed effects. This assumption helps control for potential omitted variable bias due to unobserved individual-level characteristics.

  • Strict Exogeneity: This assumption requires that the covariates included in the model are uncorrelated with the error term in the regression equation. In the context of fixed effects models, this means that time-varying factors that affect both treatment assignment and the outcome variable are adequately controlled for through the fixed effects or included as covariates in the model.

  • No Perfect Collinearity: Fixed effects models require that there is no perfect collinearity (correlation or linear association) between the fixed effects and other covariates included in the model. Perfect collinearity would make it impossible to estimate the coefficients of the fixed effects and other covariates separately.

  • Homoscedasticity and Independence of Errors: Like traditional regression models, fixed effects models assume that the errors are homoscedastic (constant variance) and independent across observations. Violations of these assumptions can lead to biased standard errors and incorrect statistical inference.

  • No Serial Correlation: Measurement errors in one period are not correlated with the errors in another period.
  • Common Time Trends: Fixed effects models often assume that there are common time trends across individuals or entities in the absence of treatment. This assumption helps ensure that any observed changes in the outcome variable over time are not solely due to time trends common to all individuals or entities.

  • Parallel Trends Assumption (specific to DiD): In the absence of treatment, the trends in the outcome variables for the treatment group and the control group would have followed parallel paths over time. In other words, any differences in trends between the two groups before the treatment are solely due to random variation and other factors unrelated to the treatment.

  • Common Trend Assumption: Related to the parallel trends assumption, this assumes that there are no differential trends between the treatment and control groups in the pre-treatment period.

  • No Spillover or Contamination Effects: While not as strict as SUTVA, DiD assumes that the treatment effect is isolated to the treated group and does not spill over to the control group or vice versa.

Related articles
In categories