IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.15965.html

xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R

Author

Listed:
  • Annalivia Polselli

Abstract

The double machine learning (DML) method combines the predictive power of machine learning with statistical estimation to conduct inference about the structural parameter of interest. This paper presents the R package `xtdml`, which implements DML methods for partially linear panel regression models with low-dimensional fixed effects, high-dimensional confounding variables, proposed by Clarke and Polselli (2025). The package provides functionalities to: (a) learn nuisance functions with machine learning algorithms from the `mlr3` ecosystem, (b) handle unobserved individual heterogeneity choosing among first-difference transformation, within-group transformation, and correlated random effects, (c) transform the covariates with min-max normalization and polynomial expansion to improve learning performance. We showcase the use of `xtdml` with both simulated and real longitudinal data.

Suggested Citation

  • Annalivia Polselli, 2025. "xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R," Papers 2512.15965, arXiv.org.
  • Handle: RePEc:arx:papers:2512.15965
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.15965
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Adamek, Robert & Smeekes, Stephan & Wilms, Ines, 2023. "Lasso inference for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 235(2), pages 1114-1143.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Efstathios Polyzos & Costas Siriopoulos, 2024. "Autoregressive Random Forests: Machine Learning and Lag Selection for Financial Research," Computational Economics, Springer;Society for Computational Economics, vol. 64(1), pages 225-262, July.
    2. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    3. Bin Chen & Yuefeng Han & Qiyang Yu, 2025. "Diffusion Index Forecasting with Tensor Data," Papers 2511.02235, arXiv.org, revised Feb 2026.
    4. ALAMI CHENTOUFI, Reda, 2024. "Penalized Convex Estimation in Dynamic Location-Scale models," MPRA Paper 123283, University Library of Munich, Germany.
    5. Christis Katsouris, 2023. "High Dimensional Time Series Regression Models: Applications to Statistical Learning Methods," Papers 2308.16192, arXiv.org.
    6. Eugene Dettaa & Endong Wang, 2024. "Sparse VARs Do Not Imply Sparse Local Projections: Robust Inference for High-Dimensional Granger Causality," Papers 2410.04330, arXiv.org, revised Feb 2026.
    7. Paul Haimerl & Stephan Smeekes & Ines Wilms, 2025. "Estimation of Latent Group Structures in Time-Varying Panel Data Models," Papers 2503.23165, arXiv.org, revised Nov 2025.
    8. Endong Wang, 2024. "Local projections identify the same policy counterfactuals as empirical and structural models," Papers 2409.09577, arXiv.org, revised Feb 2026.
    9. Karsten Reichold & Ulrike Schneider, 2025. "Beyond the Oracle Property: Adaptive LASSO in Cointegrating Regressions with Local-to-Unity Regressors," Papers 2510.07204, arXiv.org, revised Mar 2026.
    10. Huang, Feiqing & Lu, Kexin & Zheng, Yao & Li, Guodong, 2025. "Supervised factor modeling for high-dimensional linear time series," Journal of Econometrics, Elsevier, vol. 249(PB).
    11. Robert Adamek & Stephan Smeekes & Ines Wilms, 2023. "Sparse High-Dimensional Vector Autoregressive Bootstrap," Papers 2302.01233, arXiv.org, revised May 2025.
    12. Alain Hecq & Luca Margaritella & Stephan Smeekes, 2023. "Inference in Non-stationary High-Dimensional VARs," Papers 2302.01434, arXiv.org, revised Sep 2023.
    13. Sander Barendse, 2023. "Expected Shortfall LASSO," Papers 2307.01033, arXiv.org, revised Jan 2024.
    14. Zhan Gao & Ji Hyung Lee & Ziwei Mei & Zhentao Shi, 2024. "LASSO Inference for High Dimensional Predictive Regressions," Papers 2409.10030, arXiv.org, revised Jan 2026.
    15. Hill, Jonathan B., 2025. "Mixingale and physical dependence equality with applications," Statistics & Probability Letters, Elsevier, vol. 221(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.15965. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.