Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Policy Evaluation and Optimization with Continuous Treatments
43
Zitationen
2
Autoren
2018
Jahr
Abstract
We study the problem of policy evaluation and learning from batched contextual bandit data when treatments are continuous, going beyond previous work on discrete treatments. Previous work for discrete treatment/action spaces focuses on inverse probability weighting (IPW) and doubly robust (DR) methods that use a rejection sampling approach for evaluation and the equivalent weighted classification problem for learning. In the continuous setting, this reduction fails as we would almost surely reject all observations. To tackle the case of continuous treatments, we extend the IPW and DR approaches to the continuous setting using a kernel function that leverages treatment proximity to attenuate discrete rejection. Our policy estimator is consistent and we characterize the optimal bandwidth. The resulting continuous policy optimizer (CPO) approach using our estimator achieves convergent regret and approaches the best-in-class policy for learnable policy classes. We demonstrate that the estimator performs well and, in particular, outperforms a discretization-based benchmark. We further study the performance of our policy optimizer in a case study on personalized dosing based on a dataset of Warfarin patients, their covariates, and final therapeutic doses. Our learned policy outperforms benchmarks and nears the oracle-best linear policy.
Ähnliche Arbeiten
Applied logistic regression
1990 · 35.656 Zit.
The central role of the propensity score in observational studies for causal effects
1983 · 30.735 Zit.
SPSS and SAS procedures for estimating indirect effects in simple mediation models
2004 · 17.124 Zit.
A Proportional Hazards Model for the Subdistribution of a Competing Risk
1999 · 13.502 Zit.
Asymptotic Confidence Intervals for Indirect Effects in Structural Equation Models
1982 · 12.622 Zit.