xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R

xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R

Author

Listed:

Annalivia Polselli

Abstract

The double machine learning (DML) method combines the predictive power of machine learning with statistical estimation to conduct inference about the structural parameter of interest. This paper presents the R package `xtdml`, which implements DML methods for partially linear panel regression models with low-dimensional fixed effects, high-dimensional confounding variables, proposed by Clarke and Polselli (2025). The package provides functionalities to: (a) learn nuisance functions with machine learning algorithms from the `mlr3` ecosystem, (b) handle unobserved individual heterogeneity choosing among first-difference transformation, within-group transformation, and correlated random effects, (c) transform the covariates with min-max normalization and polynomial expansion to improve learning performance. We showcase the use of `xtdml` with both simulated and real longitudinal data.

Suggested Citation

Annalivia Polselli, 2025. "xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R," Papers 2512.15965, arXiv.org.

Handle: RePEc:arx:papers:2512.15965

Download full text from publisher

References listed on IDEAS

Adamek, Robert & Smeekes, Stephan & Wilms, Ines, 2023. "Lasso inference for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 235(2), pages 1114-1143.
- Robert Adamek & Stephan Smeekes & Ines Wilms, 2020. "Lasso Inference for High-Dimensional Time Series," Papers 2007.10952, arXiv.org, revised Sep 2022.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Efstathios Polyzos & Costas Siriopoulos, 2024. "Autoregressive Random Forests: Machine Learning and Lag Selection for Financial Research," Computational Economics, Springer;Society for Computational Economics, vol. 64(1), pages 225-262, July.
Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
- Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2020. "Machine Learning Advances for Time Series Forecasting," Papers 2012.12802, arXiv.org, revised Apr 2021.
Bin Chen & Yuefeng Han & Qiyang Yu, 2025. "Diffusion Index Forecasting with Tensor Data," Papers 2511.02235, arXiv.org, revised Feb 2026.
ALAMI CHENTOUFI, Reda, 2024. "Penalized Convex Estimation in Dynamic Location-Scale models," MPRA Paper 123283, University Library of Munich, Germany.
Christis Katsouris, 2023. "High Dimensional Time Series Regression Models: Applications to Statistical Learning Methods," Papers 2308.16192, arXiv.org.
Eugene Dettaa & Endong Wang, 2024. "Sparse VARs Do Not Imply Sparse Local Projections: Robust Inference for High-Dimensional Granger Causality," Papers 2410.04330, arXiv.org, revised Feb 2026.
Paul Haimerl & Stephan Smeekes & Ines Wilms, 2025. "Estimation of Latent Group Structures in Time-Varying Panel Data Models," Papers 2503.23165, arXiv.org, revised Nov 2025.
Endong Wang, 2024. "Local projections identify the same policy counterfactuals as empirical and structural models," Papers 2409.09577, arXiv.org, revised Feb 2026.
Karsten Reichold & Ulrike Schneider, 2025. "Beyond the Oracle Property: Adaptive LASSO in Cointegrating Regressions with Local-to-Unity Regressors," Papers 2510.07204, arXiv.org, revised Mar 2026.
Huang, Feiqing & Lu, Kexin & Zheng, Yao & Li, Guodong, 2025. "Supervised factor modeling for high-dimensional linear time series," Journal of Econometrics, Elsevier, vol. 249(PB).
Robert Adamek & Stephan Smeekes & Ines Wilms, 2023. "Sparse High-Dimensional Vector Autoregressive Bootstrap," Papers 2302.01233, arXiv.org, revised May 2025.
Alain Hecq & Luca Margaritella & Stephan Smeekes, 2023. "Inference in Non-stationary High-Dimensional VARs," Papers 2302.01434, arXiv.org, revised Sep 2023.
Sander Barendse, 2023. "Expected Shortfall LASSO," Papers 2307.01033, arXiv.org, revised Jan 2024.
Zhan Gao & Ji Hyung Lee & Ziwei Mei & Zhentao Shi, 2024. "LASSO Inference for High Dimensional Predictive Regressions," Papers 2409.10030, arXiv.org, revised Jan 2026.
Hill, Jonathan B., 2025. "Mixingale and physical dependence equality with applications," Statistics & Probability Letters, Elsevier, vol. 221(C).

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2026-01-12 (Big Data)
NEP-CMP-2026-01-12 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.15965. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data