A Practical Guide of Off-Policy Evaluation for Bandit Problems
Author
Abstract
Suggested Citation
Download full text from publisher
References listed on IDEAS
- Jinyong Hahn & Keisuke Hirano & Dean Karlan, 2011.
"Adaptive Experimental Design Using the Propensity Score,"
Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(1), pages 96-108, January.
- Hahn, Jinyong & Hirano, Keisuke & Karlan, Dean, 2011. "Adaptive Experimental Design Using the Propensity Score," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 96-108.
- Hahn, Jinyong & Hirano, Keisuke & Karlan, Dean, 2008. "Adaptive Experimental Design Using the Propensity Score," MPRA Paper 8315, University Library of Munich, Germany.
- Hahn, Jinyong & Hirano, Keisuke & Karlan, Dean, 2009. "Adaptive Experimental Design Using the Propensity Score," Working Papers 59, Yale University, Department of Economics.
- Jinyong Hahn & Keisuke Hirano & Dean Karlan, 2009. "Adaptive Experimental Design Using the Propensity Score," Working Papers 969, Economic Growth Center, Yale University.
- Hahn, Jinyong & Hirano, Keisuke & Karlan, Dean S., 2009. "Adaptive Experimental Design Using the Propensity Score," Center Discussion Papers 47107, Yale University, Economic Growth Center.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018.
"Double/debiased machine learning for treatment and structural parameters,"
Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers CWP28/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers 28/17, Institute for Fiscal Studies.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2017. "Double/Debiased Machine Learning for Treatment and Structural Parameters," NBER Working Papers 23564, National Bureau of Economic Research, Inc.
- Masahiro Kato, 2020. "Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales," Papers 2006.06982, arXiv.org.
- Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003.
"Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score,"
Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
- Guido Imbens, 2000. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometric Society World Congress 2000 Contributed Papers 1166, Econometric Society.
- Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2000. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," NBER Technical Working Papers 0251, National Bureau of Economic Research, Inc.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Masahiro Kato & Shota Yasui & Kenichiro McAlinn, 2020. "The Adaptive Doubly Robust Estimator for Policy Evaluation in Adaptive Experiments and a Paradox Concerning Logging Policy," Papers 2010.03792, arXiv.org, revised Jun 2021.
- Masahiro Kato, 2021. "Adaptive Doubly Robust Estimator from Non-stationary Logging Policy under a Convergence of Average Probability," Papers 2102.08975, arXiv.org, revised Mar 2021.
- Masahiro Kato & Yusuke Kaneko, 2020. "Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under Batch Update Policy," Papers 2010.13554, arXiv.org.
- Hirano, Keisuke & Porter, Jack R., 2020. "Asymptotic analysis of statistical decision rules in econometrics," Handbook of Econometrics, in: Steven N. Durlauf & Lars Peter Hansen & James J. Heckman & Rosa L. Matzkin (ed.), Handbook of Econometrics, edition 1, volume 7, chapter 0, pages 283-354, Elsevier.
- Masahiro Kato, 2020. "Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales," Papers 2006.06982, arXiv.org.
- Harrison H. Li & Art B. Owen, 2023. "Double machine learning and design in batch adaptive experiments," Papers 2309.15297, arXiv.org.
- Masahiro Kato & Akihiro Oga & Wataru Komatsubara & Ryo Inokuchi, 2024. "Active Adaptive Experimental Design for Treatment Effect Estimation with Covariate Choices," Papers 2403.03589, arXiv.org, revised Jun 2024.
- Ruoxuan Xiong & Allison Koenecke & Michael Powell & Zhu Shen & Joshua T. Vogelstein & Susan Athey, 2021.
"Federated Causal Inference in Heterogeneous Observational Data,"
Papers
2107.11732, arXiv.org, revised Apr 2023.
- Xiong, Ruoxuan & Koenecke, Allison & Powell, Michael & Shen, Zhu & Vogelstein, Joshua T. & Athey, Susan, 2021. "Federated Causal Inference in Heterogeneous Observational Data," Research Papers 3990, Stanford University, Graduate School of Business.
- Sung Jae Jun & Sokbae Lee, 2024.
"Causal Inference Under Outcome-Based Sampling with Monotonicity Assumptions,"
Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(3), pages 998-1009, July.
- Sung Jae Jun & Sokbae Lee, 2020. "Causal Inference under Outcome-Based Sampling with Monotonicity Assumptions," Papers 2004.08318, arXiv.org, revised Oct 2023.
- Masahiro Kato & Masaaki Imaizumi & Takuya Ishihara & Toru Kitagawa, 2023. "Asymptotically Optimal Fixed-Budget Best Arm Identification with Variance-Dependent Bounds," Papers 2302.02988, arXiv.org, revised Jul 2023.
- Luo, Yu & Graham, Daniel J. & McCoy, Emma J., 2023. "Semiparametric Bayesian doubly robust causal estimation," LSE Research Online Documents on Economics 117944, London School of Economics and Political Science, LSE Library.
- Jinglong Zhao, 2024. "Experimental Design For Causal Inference Through An Optimization Lens," Papers 2408.09607, arXiv.org, revised Aug 2024.
- Yiyan Huang & Cheuk Hang Leung & Xing Yan & Qi Wu & Nanbo Peng & Dongdong Wang & Zhixiang Huang, 2020. "The Causal Learning of Retail Delinquency," Papers 2012.09448, arXiv.org.
- Yihui He & Fang Han, 2023. "On propensity score matching with a diverging number of matches," Papers 2310.14142, arXiv.org, revised Nov 2023.
- Jiang, Liang & Phillips, Peter C.B. & Tao, Yubo & Zhang, Yichong, 2023.
"Regression-adjusted estimation of quantile treatment effects under covariate-adaptive randomizations,"
Journal of Econometrics, Elsevier, vol. 234(2), pages 758-776.
- Liang Jiang & Xiaobin Liu & Peter C.B. Phillips & Yichong Zhang, 2021. "Regression-Adjusted Estimation of Quantile Treatment Effects under Covariate-Adaptive Randomizations," Cowles Foundation Discussion Papers 2288, Cowles Foundation for Research in Economics, Yale University.
- Liang Jiang & Peter C. B. Phillips & Yubo Tao & Yichong Zhang, 2021. "Regression-Adjusted Estimation of Quantile Treatment Effects under Covariate-Adaptive Randomizations," Papers 2105.14752, arXiv.org, revised Sep 2022.
- Huber Martin & Wüthrich Kaspar, 2019.
"Local Average and Quantile Treatment Effects Under Endogeneity: A Review,"
Journal of Econometric Methods, De Gruyter, vol. 8(1), pages 1-27, January.
- Huber, Martin & Wüthrich, Kaspar, 2019. "Local Average and Quantile Treatment Effects Under Endogeneity: A Review," University of California at San Diego, Economics Working Paper Series qt4j29d8sc, Department of Economics, UC San Diego.
- Fei Wang & Yuhao Deng, 2023. "Non-Asymptotic Bounds of AIPW Estimators for Means with Missingness at Random," Mathematics, MDPI, vol. 11(4), pages 1-14, February.
- Neng-Chieh Chang, 2020. "The Mode Treatment Effect," Papers 2007.11606, arXiv.org.
- Michael C Knaus & Michael Lechner & Anthony Strittmatter, 2021.
"Machine learning estimation of heterogeneous causal effects: Empirical Monte Carlo evidence,"
The Econometrics Journal, Royal Economic Society, vol. 24(1), pages 134-161.
- Knaus, Michael C. & Lechner, Michael & Strittmatter, Anthony, 2018. "Machine Learning Estimation of Heterogeneous Causal Effects: Empirical Monte Carlo Evidence," IZA Discussion Papers 12039, Institute of Labor Economics (IZA).
- Lechner, Michael & Knaus, Michael C. & Strittmatter, Anthony, 2018. "Machine Learning Estimation of Heterogeneous Causal Effects: Empirical Monte Carlo Evidence," CEPR Discussion Papers 13402, C.E.P.R. Discussion Papers.
- Knaus, Michael C. & Lechner, Michael & anthony.strittmatter@unisg.ch, 2018. "Machine Learning Estimation of Heterogeneous Causal Effects: Empirical Monte Carlo Evidence," Economics Working Paper Series 1817, University of St. Gallen, School of Economics and Political Science.
- Michael C. Knaus & Michael Lechner & Anthony Strittmatter, 2018. "Machine Learning Estimation of Heterogeneous Causal Effects: Empirical Monte Carlo Evidence," Papers 1810.13237, arXiv.org, revised Dec 2018.
- Victor Chernozhukov & Mert Demirer & Esther Duflo & Iván Fernández‐Val, 2025.
"Fisher–Schultz Lecture: Generic Machine Learning Inference on Heterogeneous Treatment Effects in Randomized Experiments, With an Application to Immunization in India,"
Econometrica, Econometric Society, vol. 93(4), pages 1121-1164, July.
- Victor Chernozhukov & Mert Demirer & Esther Duflo & Iv'an Fern'andez-Val, 2017. "Fisher-Schultz Lecture: Generic Machine Learning Inference on Heterogenous Treatment Effects in Randomized Experiments, with an Application to Immunization in India," Papers 1712.04802, arXiv.org, revised Oct 2023.
- Victor Chernozhukov & Mert Demirer & Esther Duflo & Iván Fernández-Val, 2023. "Fischer-Schultz Lecture: Generic Machine Learning Inference on Heterogenous Treatment Effects in Randomized Experiments, with an Application to Immunization in India," Working Papers hal-04238425, HAL.
More about this item
NEP fields
This paper has been announced in the following NEP Reports:- NEP-ECM-2020-11-02 (Econometrics)
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2010.12470. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.
Printed from https://ideas.repec.org/p/arx/papers/2010.12470.html