Random Common Cause Refuter

CategoriesCausal Inference , Validation , Statistics , Study Design

One method of refutation in causal inference involves adding a random common cause of both Treatment and Outcome, to see how this affects the estimated effect.

NOTE: Please read this article for a detailed guide to refutation and statistical significance testing in DoWhy, a Python library used in Causal Wizard.

The importance of Refutation

Refutation is a key concept in causal inference, which refers to the process of testing a hypothesis by attempting to prove it false. One way to do this is by adding a random common cause to the model.

Falsifying or refuting an outcome should not been seen as a disappointment:

"The number of scientific papers published every year continues to increase, but scientific knowledge is not progressing at the same rate. Here we argue that a greater emphasis on falsification – the direct testing of strong hypotheses – would lead to faster progress by allowing well-specified hypotheses to be eliminated."

Strict refutation helps to ensure - but does not guarantee - that results are sound and trustworthy.

How Random Common Cause Refuter works

In causal inference, refutation refers to the process of testing a causal hypothesis by attempting to reject it using available data. 

A common cause is a variable that affects both the cause and the effect of a relationship, making it appear as if there is a causal link between them. By adding a random common cause, we introduce a new variable that is unrelated to both the cause and the effect, but that affects them both.

If the original causal hypothesis is correct, then adding a random common cause should not change the relationship between the cause and the effect. However, if the hypothesis is incorrect, then adding a random common cause may reveal a new relationship between the cause and the effect that was previously hidden.

For example, suppose we want to test the hypothesis that smoking causes lung cancer. We might start by looking at the relationship between smoking and lung cancer in a population. If we find a strong association between the two, we might conclude that smoking causes lung cancer.

However, we could also add a random common cause, such as a person's height, to the model. If height is unrelated to smoking and lung cancer, but affects both, then we can use it to test the causal hypothesis. If the relationship between smoking and lung cancer remains the same after controlling for height, then we have stronger evidence that smoking causes lung cancer. However, if the relationship changes, then we may need to reconsider the causal hypothesis.

See this paper for a worked example of the common cause refuter.

Related articles
In categories