Does quitting smoking cause weight gain?
This example is reproduced from Chapter 12 of Causal Inference: What If (the book), by Miguel Hernan. The book is a great and thorough course on Causal Inference, and highly recommended for those who want to get deeply into it.
This case study uses data from the National Health and Nutrition Examination Survey Data Epidemiologic Follow-up Study (NHEFS). It attempts to accurately quantify the effect of smoking cessation on weight gain. In short, do people gain weight after giving up smoking?
We can see that despite some simplifications to make this tutorial easy, we obtain a result close to the reported value given in the book.
Select variable Quit Smoking? as the Treatment.
Select Change in Weight as the Outcome.
Causal Wizard automatically detects the data type in the Treatment column and modifies the user interface to match.
Since quit smoking is provided in the data as a binary value, we don't see the threshold tool this time. Instead, we are asked to specify the value for Control (didn't quit smoking) and Treated (did quit). The defaults are 0 and 1 respectively, which is fine.
This case study uses all the numerical variables in the dataset. Create the graph as shown in the picture below (note: nodes are automatically coloured during the check process; verify that the colours are the same after the Check step to be sure you've added all the same edges).
Basically, every variable is treated as a "confounder" - a cause of both Treatment and Outcome.
In addition, there's one edge directly from Treatment Quit Smoking? to Outcome Change in Weight.
You can always come back to earlier results. If you decided to come back later while following this tutorial, you can find your result by:
Once opened, look at the first row of the findings, which should look something like this:
Note that on Chapter 12, page 158 of "What If" an ATE of 3.4 kg is obtained. This is well within the 95% confidence interval of our simplified model given above.
Another thing worth noting is that the placebo treatment validation test failed - perhaps due to dataset size and noise, but also perhaps due to simplified modelling. These validation tests are quite strict - note that the placebo treatment effect is much smaller, and quite close to zero even though not quite zero.