Retrospective Causal Inference with Machine Learning Ensembles: An Application to Anti-recidivism Policies in Colombia

Cyrus Samii; Laura Paler; Sarah Zukerman Daly

doi:10.1093/pan/mpw019

Retrospective Causal Inference with Machine Learning Ensembles: An Application to Anti-recidivism Policies in Colombia

Political Analysis ◽

10.1093/pan/mpw019 ◽

2016 ◽

Vol 24 (4) ◽

pp. 434-456 ◽

Cited By ~ 7

Author(s):

Cyrus Samii ◽

Laura Paler ◽

Sarah Zukerman Daly

Keyword(s):

Machine Learning ◽

Causal Inference ◽

Regression Modeling ◽

Causal Effects ◽

Policy Options ◽

Micro Data ◽

Intervention Effect ◽

New Methods ◽

Conventional Methods ◽

Learning Ensembles

We present new methods to estimate causal effects retrospectively from micro data with the assistance of a machine learning ensemble. This approach overcomes two important limitations in conventional methods like regression modeling or matching: (i) ambiguity about the pertinent retrospective counterfactuals and (ii) potential misspecification, overfitting, and otherwise bias-prone or inefficient use of a large identifying covariate set in the estimation of causal effects. Our method targets the analysis toward a well-defined “retrospective intervention effect” based on hypothetical population interventions and applies a machine learning ensemble that allows data to guide us, in a controlled fashion, on how to use a large identifying covariate set. We illustrate with an analysis of policy options for reducing ex-combatant recidivism in Colombia.

Download Full-text

A scoping review of causal methods enabling predictions under hypothetical interventions

Diagnostic and Prognostic Research ◽

10.1186/s41512-021-00092-9 ◽

2021 ◽

Vol 5 (1) ◽

Author(s):

Lijing Lin ◽

Matthew Sperrin ◽

David A. Jenkins ◽

Glen P. Martin ◽

Niels Peek

Keyword(s):

Machine Learning ◽

Causal Inference ◽

Observational Data ◽

Prediction Models ◽

Risk Estimation ◽

Clinical Decision ◽

Causal Effects ◽

Statistical Machine Learning ◽

Methodological Approaches ◽

Meta Analyses

Abstract Background The methods with which prediction models are usually developed mean that neither the parameters nor the predictions should be interpreted causally. For many applications, this is perfectly acceptable. However, when prediction models are used to support decision making, there is often a need for predicting outcomes under hypothetical interventions. Aims We aimed to identify published methods for developing and validating prediction models that enable risk estimation of outcomes under hypothetical interventions, utilizing causal inference. We aimed to identify the main methodological approaches, their underlying assumptions, targeted estimands, and potential pitfalls and challenges with using the method. Finally, we aimed to highlight unresolved methodological challenges. Methods We systematically reviewed literature published by December 2019, considering papers in the health domain that used causal considerations to enable prediction models to be used for predictions under hypothetical interventions. We included both methodologies proposed in statistical/machine learning literature and methodologies used in applied studies. Results We identified 4919 papers through database searches and a further 115 papers through manual searches. Of these, 87 papers were retained for full-text screening, of which 13 were selected for inclusion. We found papers from both the statistical and the machine learning literature. Most of the identified methods for causal inference from observational data were based on marginal structural models and g-estimation. Conclusions There exist two broad methodological approaches for allowing prediction under hypothetical intervention into clinical prediction models: (1) enriching prediction models derived from observational studies with estimated causal effects from clinical trials and meta-analyses and (2) estimating prediction models and causal effects directly from observational data. These methods require extending to dynamic treatment regimes, and consideration of multiple interventions to operationalise a clinical decision support system. Techniques for validating ‘causal prediction models’ are still in their infancy.

Download Full-text

Optimal subset selection for causal inference using machine learning ensembles and particle swarm optimization

Complex & Intelligent Systems ◽

10.1007/s40747-020-00169-w ◽

2020 ◽

Author(s):

Dhruv Sharma ◽

Christopher Willy ◽

John Bischoff

Keyword(s):

Machine Learning ◽

Particle Swarm Optimization ◽

Causal Inference ◽

Particle Swarm ◽

Subset Selection ◽

Swarm Optimization ◽

Optimal Subset ◽

Selection For ◽

Learning Ensembles

Download Full-text

Causal Inference Meets Machine Learning

Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining ◽

10.1145/3394486.3406460 ◽

2020 ◽

Author(s):

Peng Cui ◽

Zheyan Shen ◽

Sheng Li ◽

Liuyi Yao ◽

Yaliang Li ◽

...

Keyword(s):

Machine Learning ◽

Causal Inference

Download Full-text

A Survey on Causal Inference

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3444944 ◽

2021 ◽

Vol 15 (5) ◽

pp. 1-46

Author(s):

Liuyi Yao ◽

Zhixuan Chu ◽

Sheng Li ◽

Yaliang Li ◽

Jing Gao ◽

...

Keyword(s):

Machine Learning ◽

Causal Inference ◽

Observational Data ◽

Causal Effect ◽

Research Direction ◽

Estimation Methods ◽

Potential Outcome ◽

Outcome Framework ◽

Benchmark Datasets ◽

Inference Methods

Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy, and economics, for decades. Nowadays, estimating causal effect from observational data has become an appealing research direction owing to the large amount of available data and low budget requirement, compared with randomized controlled trials. Embraced with the rapidly developed machine learning area, various causal effect estimation methods for observational data have sprung up. In this survey, we provide a comprehensive review of causal inference methods under the potential outcome framework, one of the well-known causal inference frameworks. The methods are divided into two categories depending on whether they require all three assumptions of the potential outcome framework or not. For each category, both the traditional statistical methods and the recent machine learning enhanced methods are discussed and compared. The plausible applications of these methods are also presented, including the applications in advertising, recommendation, medicine, and so on. Moreover, the commonly used benchmark datasets as well as the open-source codes are also summarized, which facilitate researchers and practitioners to explore, evaluate and apply the causal inference methods.

Download Full-text

G-computation and machine learning for estimating the causal effects of binary exposure statuses on binary outcomes

Scientific Reports ◽

10.1038/s41598-021-81110-0 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Florent Le Borgne ◽

Arthur Chatton ◽

Maxime Léger ◽

Rémi Lenain ◽

Yohann Foucher

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Statistical Power ◽

Small Sample ◽

Causal Effects ◽

Small Samples ◽

Support Vector ◽

Sample Sizes ◽

Super Learner ◽

Small Sample Sizes

AbstractIn clinical research, there is a growing interest in the use of propensity score-based methods to estimate causal effects. G-computation is an alternative because of its high statistical power. Machine learning is also increasingly used because of its possible robustness to model misspecification. In this paper, we aimed to propose an approach that combines machine learning and G-computation when both the outcome and the exposure status are binary and is able to deal with small samples. We evaluated the performances of several methods, including penalized logistic regressions, a neural network, a support vector machine, boosted classification and regression trees, and a super learner through simulations. We proposed six different scenarios characterised by various sample sizes, numbers of covariates and relationships between covariates, exposure statuses, and outcomes. We have also illustrated the application of these methods, in which they were used to estimate the efficacy of barbiturates prescribed during the first 24 h of an episode of intracranial hypertension. In the context of GC, for estimating the individual outcome probabilities in two counterfactual worlds, we reported that the super learner tended to outperform the other approaches in terms of both bias and variance, especially for small sample sizes. The support vector machine performed well, but its mean bias was slightly higher than that of the super learner. In the investigated scenarios, G-computation associated with the super learner was a performant method for drawing causal inferences, even from small sample sizes.

Download Full-text

A Review of Computer-Aided Expert Systems for Breast Cancer Diagnosis

Cancers ◽

10.3390/cancers13112764 ◽

2021 ◽

Vol 13 (11) ◽

pp. 2764

Author(s):

Xin Yu Liew ◽

Nazia Hameed ◽

Jeremie Clos

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Deep Learning ◽

Main Process ◽

Learning Approaches ◽

Learning Methods ◽

Advantages And Disadvantages ◽

Computer Aided ◽

Conventional Methods ◽

The Impact

A computer-aided diagnosis (CAD) expert system is a powerful tool to efficiently assist a pathologist in achieving an early diagnosis of breast cancer. This process identifies the presence of cancer in breast tissue samples and the distinct type of cancer stages. In a standard CAD system, the main process involves image pre-processing, segmentation, feature extraction, feature selection, classification, and performance evaluation. In this review paper, we reviewed the existing state-of-the-art machine learning approaches applied at each stage involving conventional methods and deep learning methods, the comparisons within methods, and we provide technical details with advantages and disadvantages. The aims are to investigate the impact of CAD systems using histopathology images, investigate deep learning methods that outperform conventional methods, and provide a summary for future researchers to analyse and improve the existing techniques used. Lastly, we will discuss the research gaps of existing machine learning approaches for implementation and propose future direction guidelines for upcoming researchers.

Download Full-text

Explainable AI: A Review of Machine Learning Interpretability Methods

Entropy ◽

10.3390/e23010018 ◽

2020 ◽

Vol 23 (1) ◽

pp. 18

Author(s):

Pantelis Linardatos ◽

Vasilis Papastefanopoulos ◽

Sotiris Kotsiantis

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Black Box ◽

Learning Systems ◽

Model Complexity ◽

Learning Models ◽

New Methods ◽

Industrial Adoption ◽

Machine Learning Models ◽

The Way

Recent advances in artificial intelligence (AI) have led to its widespread industrial adoption, with machine learning systems demonstrating superhuman performance in a significant number of tasks. However, this surge in performance, has often been achieved through increased model complexity, turning such systems into “black box” approaches and causing uncertainty regarding the way they operate and, ultimately, the way that they come to decisions. This ambiguity has made it problematic for machine learning systems to be adopted in sensitive yet critical domains, where their value could be immense, such as healthcare. As a result, scientific interest in the field of Explainable Artificial Intelligence (XAI), a field that is concerned with the development of new methods that explain and interpret machine learning models, has been tremendously reignited over recent years. This study focuses on machine learning interpretability methods; more specifically, a literature review and taxonomy of these methods are presented, as well as links to their programming implementations, in the hope that this survey would serve as a reference point for both theorists and practitioners.

Download Full-text

Risk-Controlled Wellbore Stability Criterion Based on Machine Learning Assisted Finite Element Model

10.2118/204101-ms ◽

2021 ◽

Author(s):

Hussain AlBahrani ◽

Nobuo Morita

Keyword(s):

Machine Learning ◽

Experimental Data ◽

Finite Element ◽

Finite Element Model ◽

Stability Criterion ◽

Element Model ◽

Rock Failure ◽

Wellbore Stability ◽

Wellbore Instability ◽

Conventional Methods

Abstract In many drilling scenarios that include deep wells and highly stressed environments, the mud weight required to completely prevent wellbore instability can be impractically high. In such cases, what is known as risk-controlled wellbore stability criterion is introduced. This criterion allows for a certain level of wellbore instability to take place. This means that the mud weight calculated using this criterion will only constrain wellbore instability to a certain manageable level, hence the name risk-controlled. Conventionally, the allowable level of wellbore instability in this type of models has always been based on the magnitude of the breakout angle. However, wellbore enlargements, as seen in calipers and image logs, can be highly irregular in terms of its distribution around the wellbore. This irregularity means that risk-controlling the wellbore instability through the breakout angle might not be always sufficient. Instead, the total volume of cavings is introduced as the risk control parameter for wellbore instability. Unlike the breakout angle, the total volume of cavings can be coupled with a suitable hydraulics model to determine the threshold of manageable instability. The expected total volume of cavings is determined using a machine learning (ML) assisted 3D elasto-plastic finite element model (FEM). The FEM works to model the interval of interest, which eventually provides a description of the stress distribution around the wellbore. The ML algorithm works to learn the patterns and limits of rock failure in a supervised training manner based on the wellbore enlargement seen in calipers and image logs from nearby offset wells. Combing the FEM output with the ML algorithm leads to an accurate prediction of shear failure zones. The model is able to predict both the radial and circumferential distribution of enlargements at any mud weight and stress regime, which leads to a determination of the expected total volume of cavings. The model implementation is first validated through experimental data. The experimental data is based on true-triaxial tests of bored core samples. Next, a full dataset from offset wells is used to populate and train the model. The trained model is then used to produce estimations of risk-controlled stability mud weights for different drilling scenarios. The model results are compared against those produced by conventional methods. Finally, both the FEM-ML model and the conventional methods results are compared against the drilling experience of the offset wells. This methodology provides a more comprehensive and new solution to risk controlling wellbore instability. It relies on a novel process which learns rock failure from calipers and image logs.

Download Full-text

Causal inference via string diagram surgery

Mathematical Structures in Computer Science ◽

10.1017/s096012952100027x ◽

2021 ◽

pp. 1-22

Author(s):

Bart Jacobs ◽

Aleks Kissinger ◽

Fabio Zanasi

Keyword(s):

Causal Inference ◽

Probabilistic Reasoning ◽

Causal Effect ◽

Sufficient Conditions ◽

Causal Effects ◽

Stochastic Matrices ◽

Counterfactual Reasoning ◽

Special Cases ◽

String Diagram ◽

Set Up

Abstract Extracting causal relationships from observed correlations is a growing area in probabilistic reasoning, originating with the seminal work of Pearl and others from the early 1990s. This paper develops a new, categorically oriented view based on a clear distinction between syntax (string diagrams) and semantics (stochastic matrices), connected via interpretations as structure-preserving functors. A key notion in the identification of causal effects is that of an intervention, whereby a variable is forcefully set to a particular value independent of any prior propensities. We represent the effect of such an intervention as an endo-functor which performs ‘string diagram surgery’ within the syntactic category of string diagrams. This diagram surgery in turn yields a new, interventional distribution via the interpretation functor. While in general there is no way to compute interventional distributions purely from observed data, we show that this is possible in certain special cases using a calculational tool called comb disintegration. We demonstrate the use of this technique on two well-known toy examples: one where we predict the causal effect of smoking on cancer in the presence of a confounding common cause and where we show that this technique provides simple sufficient conditions for computing interventions which apply to a wide variety of situations considered in the causal inference literature; the other one is an illustration of counterfactual reasoning where the same interventional techniques are used, but now in a ‘twinned’ set-up, with two version of the world – one factual and one counterfactual – joined together via exogenous variables that capture the uncertainties at hand.

Download Full-text

What Predicts Corruption?

10.31235/osf.io/fq2xb ◽

2020 ◽

Author(s):

Emanuele Colonnelli ◽

Jorge Gallego ◽

Mounu Prem

Keyword(s):

Machine Learning ◽

Human Capital ◽

Cost Effectiveness ◽

Public Sector ◽

Financial Development ◽

Predictive Power ◽

Public Spending ◽

Learning Models ◽

Micro Data ◽

Machine Learning Models

The ability to predict corruption is crucial to policy. Using rich micro-data from Brazil, we show that multiple machine learning models display high levels of performance in predicting municipality-level corruption in public spending. We then quantify which individual municipality features and groups of similar characteristics have the highest predictive power. We find that measures of private sector activity, financial development, and human capital are the strongest predictors of corruption, while public sector and political features play a secondary role. Our findings have implications for the design and cost-effectiveness of various anti-corruption policies.

Download Full-text