It's always important to look closely at your data and it also helps to be able to quickly check the
contents of particular columns (variables) when drawing your Causal Diagram. Causal Wizard provides
tabular and graphical views of your data
file.
The table view allows sorting and filtering to find particular values or ranges.
Graphical views display selected variables as histograms and a scatter plot allows you to
visualise the
relationship between pairs of variables.
Tabular data views
Univariate data plot - histogram
Bivariate data view - scatter plot
CausalWizard offers you several popular Causal Inference techniques to choose from.
If your data is in Panel Data
format, which tracks the characteristics of
many entities over time, you can use our Fixed-Effects and Difference-in-Differences
models.
You can also use our more powerful and general-purpose Causal Diagram &
Potential
Outcomes models to estimate causal effects on many types of data.
Capturing your domain knowledge in the form of a Causal Diagram is a key part of Causal
Inference and one of the things that enables estimation of causal effects rather than just association or
correlation between variables,
which any predictive model could provide.
Our Causal Diagram editor allows you to draw a Directed Acyclic Graph (DAG) using all the variables
(columns) in your data file.
However, if you prefer not to upload any data, you can just draw the DAG and label the variables yourself.
Causal Wizard also provides tools to define your Outcome variable and treatment groups either by
thresholding a numerical variable or with specific values
(categorical variables).
Together, these configuration options define the experiment and effect you wish to explore.
After drawing the diagram, Causal Wizard will analyse it for you to
determine whether a causal effect can
be measured,
and the methods which are applicable to the system as described. The role of each variable in the analysis
will be illustrated; Causal Wizard will show you which variables you should include and which
variables you must exclude to obtain an unbiased estimate of the causal effect. If you have uploaded
data, Causal Wizard
will also offer to apply the selected method for you.
Causal Diagram (DAG) editor
Treatment groups identification widget
If you have provided data, Causal Wizard can perform a causal effect estimation and analysis for you.
The results will include key findings, assumptions and a description of the modelling
performed.
The app will also provide graphical plots depicting the causal effect on the outcome,
broken down by treatment group (treated, or control). The exact plots shown depend whether your outcome is
numerical or categorical.
A key benefit of causal analysis is the ability to accurately predict counterfactual outcomes.
When suitable analysis methods are provided, Causal Wizard will give you a table listing various
counterfactual
outcomes for different sub-populations and treatment variable status (treated or control).
Outcome distribution plot
Counterfactual outcomes table
Stakeholders often want to confirm that the model uses covariates (input features) as expected, which gives confidence that the system is being modelled correctly. The feature-importance plot shows the sign and magnitude of regression coefficients, which indicate the direction and contribution of each covariate (and the treatment).
Feature importance plot
The app uses a set of statistical tests from the
DoWhy module to estimate the statistical significance of
results
and whether they are robust to challenges such as randomized outcomes, a placebo treatment
or addition of a random, previously unobserved confounder.
Passing these tests provides additional confidence that the estimated effect is significant and robust.
Refutation test results
Many causal inference techniques are based on the concept of Propensity scores.
These methods also enable us to understand the ways in which the observed population differs between treated
and control groups, which is a primary cause of bias in observational studies.
The propensity distribution plot shows these differences between groups, allowing you to verify that
this bias is likely to be manageable after appropriate weighting, matching or sampling.
The covariate
balance plot (also known as a Love plot) shows the differences between treatment groups
before and after weighting by propensity, confirming to what extent these differences have been controlled.
Propensity score distribution plot
Covariate balance plot
In Machine Learning, it is common to validate model performance on a set of samples not available during
model training or fitting. This allow us to estimate how well the model generalizes to unseen data.
Causal Wizard keeps a user-defined percentage or specific rows of your data for this type of validation.
The results are displayed in a range of plots or tables, depending whether your outcome variable is
categorical or numerical.
For numerical outcomes, we show a scatter plot of predicted vs actual outcomes for individual samples
coloured by treatment group. We report key predictive metrics such as r-squared and
root-mean-square error.
For categorical outcomes, we provide a confusion matrix listing all predicted classification outcomes
and summary metrics including Accuracy, F1-score, Precision and Recall.
Actual vs predicted generalization performance plot
Generalization performance metrics
Categorical generalization performance - confusion matrix
Need a new feature? Tell us about it!