Causal Diagram

CategoriesCausal Wizard Concept , Graph , Data , Variables , Independence

A causal diagram is a visual representation of the relationships between different variables and their direction of causality in a system or process. Absence of an edge implies no causal relationship.

A causal diagram is a visual representation of the relationships between different variables in a system or process, with arrows indicating the direction of causality (from cause, to effect). It is a tool for understanding cause-and-effect relationships and identifying potential sources of bias in statistical analyses.

Causal diagrams are commonly used in fields such as epidemiology, economics, and social sciences to help researchers understand the complex relationships between different factors that may contribute to a particular outcome. They can also be used to identify potential confounding variables or other sources of bias that may affect the accuracy of statistical analyses.

 What do the arrows in a Causal Diagram mean?

An arrow represents a direct, causal relationship between one variable and another. The arrows in a causal diagram capture the direction of causality. For example, if variable A directly causes variable B, there would be an arrow pointing from A to B. If A causes B indirectly, there will be edges to and from one or more mediating variables.

The absence of an arrow between two variables does not necessarily indicate that there is no relationship between them, only that the relationship is not direct, or not causal (there is often association between variables which do not have a direct causal link).

Causal diagrams can take many different forms, including directed acyclic graphs (DAGs), path diagrams, and structural equation models, among others. They provide a powerful tool for understanding complex systems and identifying potential sources of bias, and are increasingly used in both research and practical applications.

In Causal Wizard, Causal Diagrams are represented by Directed Acyclic Graphs (DAGs). They are used to encode users' prior domain knowledge, so it can be incorporated in analysis.

How to draw a Causal Diagram

When drawing your causal diagram, you should aim to include:

  • All variables materially relevant to the effect you wish to estimate
  • You should consider including variables which are not available in your data, if materially relevant. Create them as unobserved variables.
  • All direct, causal relationships between these variables (do not draw arrows representing only correlation or association)
  • You should not draw arrows representing indirect relationships, unless intermediate (mediating) variables have no effect or are unobserved.

As a result of the above, you should have included the Treatment and Outcome variables. There may or may not be a directed path between them; if not, the causal effect is zero.

Remember, omitting a variable or an arrow is just as important as including one.

What if my Causal Diagram is wrong?

What if I don't know what the graph looks like? What if I get the graph wrong? There are many ways to answer these questions.

We liked this answer:

"you have to accept that your analysis is conditional on the graph you choose, and your conclusions are valid under the assumptions encoded there. In a way, causal inference from observational data is subjective. When you publish a result, you should caveat it with "under these assumptions, this is true". Readers can then dispute and question your assumptions if they disagree."

This means that your Causal Diagram becomes a prior - an assumption - which is clearly documented as part of your research or analysis. Other versions of the Causal Diagram can also be analysed, to help answer questions about what the results would look like under a different set of assumptions  (see below).

There is also some subjectivity in terms of what to include in a Causal Diagram. Some simplification of the real world is always necessary unless you're going to simulate at a Quantum level. There's also expediency - your diagram will be limited by the data which is actually available, although you can include unobserved variables if they're important.

Drawing multiple Causal Diagrams to cover possible scenarios

If you are uncertain as to the true or ideal Causal Diagram, we recommend you use Causal Wizard to create multiple diagrams representing each of the possible graphs. Generate estimates from one or more model types on each diagram, and compare the results.

If your results are unaffected by the different diagrams, you can feel confident in the answer. If your results are materially affected by different diagrams, explore whether there are other studies, existing research or other information you can use to resolve the competing hypotheses.