Machine Learning Estimation of Heterogeneous Treatment Effects: the Microsoft EconML Library

Abstract: One of the biggest promises of machine learning is the automation of decision making in a multitude of application domains. A core problem that arises in most data-driven personalized decision scenarios is the estimation of heterogeneous treatment effects: what is the effect of an intervention on an outcome of interest as a function of a set of observable characteristics of the treated sample? For instance, this problem arises in personalized pricing, where the goal is to estimate the effect of a price discount on the demand as a function of characteristics of the consumer. Similarly it arises in medical trials where the goal is to estimate the effect of a drug treatment on the clinical response of a patient as a function of patient characteristics. In many such settings we have an abundance of observational data, where the intervention was chosen via some unknown policy and the ability to run control A/B tests is limited.

We will present recent research advances in the area of machine learning based estimation of heterogeneous treatment effects. These novel methods offer large flexibility in modeling the effect heterogeneity (via techniques such as random forests, boosting, lasso and neural nets), while at the same time leverage techniques from causal inference and econometrics to preserve the causal interpretation of the learned model and many times also offer statistical validity via the construction of valid confidence intervals. We will also present and demo the Microsoft EconML library, an open source package developed by the ALICE project of Microsoft Research, New England, which implements several recent estimation algorithms in a common python API.

Bio: Miruna Oprescu is a Data and Applied Scientist at Microsoft Research New England. In her current role, Miruna works alongside researchers and software engineers to build the next generation machine learning tools for interdisciplinary applications.
Miruna spends her time between two projects: project ALICE, a Microsoft Research initiative aimed at applying artificial intelligence concepts to economic decision making, and the Machine Learning for Cancer Immunotherapies initiative, a collaboration with doctors and cancer researchers with the goal of applying machine learning techniques to improving cancer therapies.
Prior to her current position, Miruna was a software engineer at Microsoft building MMLSpark, an open source distributed machine learning library powered by Apache Spark.