A Generally Efficient Targeted Minimum Loss Based Estimator based on the Highly Adaptive Lasso

My bibliography Save this article

A Generally Efficient Targeted Minimum Loss Based Estimator based on the Highly Adaptive Lasso

Author

Listed:

van der Laan Mark
(University of California, Berkeley, USA)

Registered:

Abstract

Suppose we observe n$n$ independent and identically distributed observations of a finite dimensional bounded random variable. This article is concerned with the construction of an efficient targeted minimum loss-based estimator (TMLE) of a pathwise differentiable target parameter of the data distribution based on a realistic statistical model. The only smoothness condition we will enforce on the statistical model is that the nuisance parameters of the data distribution that are needed to evaluate the canonical gradient of the pathwise derivative of the target parameter are multivariate real valued cadlag functions (right-continuous and left-hand limits, (G. Neuhaus. On weak convergence of stochastic processes with multidimensional time parameter. Ann Stat 1971;42:1285–1295.) and have a finite supremum and (sectional) variation norm. Each nuisance parameter is defined as a minimizer of the expectation of a loss function over over all functions it its parameter space. For each nuisance parameter, we propose a new minimum loss based estimator that minimizes the loss-specific empirical risk over the functions in its parameter space under the additional constraint that the variation norm of the function is bounded by a set constant. The constant is selected with cross-validation. We show such an MLE can be represented as the minimizer of the empirical risk over linear combinations of indicator basis functions under the constraint that the sum of the absolute value of the coefficients is bounded by the constant: i.e., the variation norm corresponds with this L1$L_1$-norm of the vector of coefficients. We will refer to this estimator as the highly adaptive Lasso (HAL)-estimator. We prove that for all models the HAL-estimator converges to the true nuisance parameter value at a rate that is faster than n−1/4$n^{-1/4}$ w.r.t. square-root of the loss-based dissimilarity. We also show that if this HAL-estimator is included in the library of an ensemble super-learner, then the super-learner will at minimal achieve the rate of convergence of the HAL, but, by previous results, it will actually be asymptotically equivalent with the oracle (i.e., in some sense best) estimator in the library. Subsequently, we establish that a one-step TMLE using such a super-learner as initial estimator for each of the nuisance parameters is asymptotically efficient at any data generating distribution in the model, under weak structural conditions on the target parameter mapping and model and a strong positivity assumption (e.g., the canonical gradient is uniformly bounded). We demonstrate our general theorem by constructing such a one-step TMLE of the average causal effect in a nonparametric model, and establishing that it is asymptotically efficient.

Suggested Citation

van der Laan Mark, 2017. "A Generally Efficient Targeted Minimum Loss Based Estimator based on the Highly Adaptive Lasso," The International Journal of Biostatistics, De Gruyter, vol. 13(2), pages 1-35, November.

Handle: RePEc:bpj:ijbist:v:13:y:2017:i:2:p:35:n:1
DOI: 10.1515/ijb-2015-0097

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Porter Kristin E. & Gruber Susan & van der Laan Mark J. & Sekhon Jasjeet S., 2011. "The Relative Performance of Targeted Maximum Likelihood Estimators," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-34, August.
Gruber, Susan & Laan, Mark van der, 2012. "tmle: An R Package for Targeted Maximum Likelihood Estimation," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 51(i13).

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Helene C. W. Rytgaard & Frank Eriksson & Mark J. van der Laan, 2023. "Estimation of time‐specific intervention effects on continuously distributed time‐to‐event outcomes by targeted maximum likelihood estimation," Biometrics, The International Biometric Society, vol. 79(4), pages 3038-3049, December.
Iván Díaz & Nima S. Hejazi, 2020. "Causal mediation analysis for stochastic interventions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(3), pages 661-683, July.
Helene C. W. Rytgaard & Mark J. Laan, 2024. "Targeted maximum likelihood estimation for causal inference in survival and competing risks analysis," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 30(1), pages 4-33, January.
Kara E. Rudolph & Nicholas Williams & Iván Díaz, 2023. "Efficient and flexible estimation of natural direct and indirect effects under intermediate confounding and monotonicity constraints," Biometrics, The International Biometric Society, vol. 79(4), pages 3126-3139, December.
Qingyuan Zhao & Dylan S. Small & Ashkan Ertefaie, 2022. "Selective inference for effect modification via the lasso," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(2), pages 382-413, April.
Ashkan Ertefaie & Nima S. Hejazi & Mark J. van der Laan, 2023. "Nonparametric inverse‐probability‐weighted estimators based on the highly adaptive lasso," Biometrics, The International Biometric Society, vol. 79(2), pages 1029-1041, June.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Ronald Herrera & Ursula Berger & Ondine S. Von Ehrenstein & Iván Díaz & Stella Huber & Daniel Moraga Muñoz & Katja Radon, 2017. "Estimating the Causal Impact of Proximity to Gold and Copper Mines on Respiratory Diseases in Chilean Children: An Application of Targeted Maximum Likelihood Estimation," IJERPH, MDPI, vol. 15(1), pages 1-15, December.
Youmi Suk, 2024. "A Within-Group Approach to Ensemble Machine Learning Methods for Causal Inference in Multilevel Studies," Journal of Educational and Behavioral Statistics, , vol. 49(1), pages 61-91, February.
Susan Gruber & Mark J. van der Laan, 2013. "An Application of Targeted Maximum Likelihood Estimation to the Meta-Analysis of Safety Data," Biometrics, The International Biometric Society, vol. 69(1), pages 254-262, March.
Sherri Rose & Sharon‐Lise Normand, 2019. "Double robust estimation for multiple unordered treatments and clustered observations: Evaluating drug‐eluting coronary artery stents," Biometrics, The International Biometric Society, vol. 75(1), pages 289-296, March.
Noémi Kreif & Richard Grieve & Iván Díaz & David Harrison, 2015. "Evaluation of the Effect of a Continuous Treatment: A Machine Learning Approach with an Application to Treatment for Traumatic Brain Injury," Health Economics, John Wiley & Sons, Ltd., vol. 24(9), pages 1213-1228, September.
Jun Wang & Yahe Yu, 2024. "Improved estimation of average treatment effects under covariate‐adaptive randomization methods," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 78(2), pages 310-333, May.
Zhang, Yingheng & Li, Haojie & Ren, Gang, 2025. "Analysing the role of traffic volume as mediator in transport policy evaluation with causal mediation analysis and targeted learning," Transportation Research Part A: Policy and Practice, Elsevier, vol. 192(C).
Gilbert Peter B. & Blette Bryan S. & Hudgens Michael G. & Shepherd Bryan E., 2020. "Post-randomization Biomarker Effect Modification Analysis in an HIV Vaccine Clinical Trial," Journal of Causal Inference, De Gruyter, vol. 8(1), pages 54-69, January.
Lishi Deng & Steff Taelman & Matthew R. Olm & Laeticia Celine Toe & Eva Balini & Lionel Olivier Ouédraogo & Yuri Bastos-Moreira & Alemayehu Argaw & Kokeb Tesfamariam & Erica D. Sonnenburg & Giles T. H, 2025. "Maternal balanced energy-protein supplementation reshapes the maternal gut microbiome and enhances carbohydrate metabolism in infants: a randomized controlled trial," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
Michael Schomaker & Christian Heumann, 2020. "When and when not to use optimal model averaging," Statistical Papers, Springer, vol. 61(5), pages 2221-2240, October.
Youmi Suk & Kyung T. Han, 2024. "A Psychometric Framework for Evaluating Fairness in Algorithmic Decision Making: Differential Algorithmic Functioning," Journal of Educational and Behavioral Statistics, , vol. 49(2), pages 151-172, April.
Mariela Sued & Marina Valdora & Víctor Yohai, 2020. "Robust doubly protected estimators for quantiles with missing data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 819-843, September.
Veronica Sciannameo & Gian Paolo Fadini & Daniele Bottigliengo & Angelo Avogaro & Ileana Baldi & Dario Gregori & Paola Berchialla, 2022. "Assessment of Glucose Lowering Medications’ Effectiveness for Cardiovascular Clinical Risk Management of Real-World Patients with Type 2 Diabetes: Targeted Maximum Likelihood Estimation under Model Mi," IJERPH, MDPI, vol. 19(22), pages 1-13, November.
Lars van der Laan & Wenbo Zhang & Peter B. Gilbert, 2023. "Nonparametric estimation of the causal effect of a stochastic threshold‐based intervention," Biometrics, The International Biometric Society, vol. 79(2), pages 1014-1028, June.
Ziyun Xu & Éric Archambault, 2015. "Chinese interpreting studies: structural determinants of MA students’ career choices," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(2), pages 1041-1058, November.
Yunda Huang & Lily Zhang & Shelly Karuna & Philip Andrew & Michal Juraska & Joshua A. Weiner & Heather Angier & Evgenii Morgan & Yasmin Azzam & Edith Swann & Srilatha Edupuganti & Nyaradzo M. Mgodi & , 2023. "Adults on pre-exposure prophylaxis (tenofovir-emtricitabine) have faster clearance of anti-HIV monoclonal antibody VRC01," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
Brooks Jordan & van der Laan Mark J. & Go Alan S., 2012. "Targeted Maximum Likelihood Estimation for Prediction Calibration," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-35, October.
Díaz Iván & Carone Marco & van der Laan Mark J., 2016. "Second-Order Inference for the Mean of a Variable Missing at Random," The International Journal of Biostatistics, De Gruyter, vol. 12(1), pages 333-349, May.
Iván Díaz & Nima S. Hejazi, 2020. "Causal mediation analysis for stochastic interventions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(3), pages 661-683, July.
Schnitzer Mireille E. & Lok Judith J. & Gruber Susan, 2016. "Variable Selection for Confounder Control, Flexible Modeling and Collaborative Targeted Minimum Loss-Based Estimation in Causal Inference," The International Journal of Biostatistics, De Gruyter, vol. 12(1), pages 97-115, May.

More about this item

Keywords

; ; ; ; ; ; ; ; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:ijbist:v:13:y:2017:i:2:p:35:n:1. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyterbrill.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Generally Efficient Targeted Minimum Loss Based Estimator based on the Highly Adaptive Lasso

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data