IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2604.23023.html

Realized Regularized Regressions

Author

Listed:
  • Aleksey Kolokolov
  • Shifan Yu

Abstract

We develop a continuous-time penalized regression framework for the estimation of time-varying coefficients and variable selection when both the response and covariates are It\^o semimartingales with jumps. The coefficient paths are approximated by spline basis expansions and estimated via least squares from truncated high-frequency increments. In a finite-dimensional setting, we establish consistency and derive a feasible asymptotic distribution for the integrated coefficient estimator under infill asymptotics. We then extend the framework to high-dimensional settings in which the number of candidate covariates diverges, and show that a group-wise penalized estimator with a truncated $\ell_1$-penalty attains the oracle property, which delivers both consistent model selection and coefficient estimation. An empirical application to a large panel of more than two hundred high-frequency factors documents sparse factor structure across a large cross-section of stocks and industry portfolios.

Suggested Citation

  • Aleksey Kolokolov & Shifan Yu, 2026. "Realized Regularized Regressions," Papers 2604.23023, arXiv.org.
  • Handle: RePEc:arx:papers:2604.23023
    as

    Download full text from publisher

    File URL: https://arxiv.org/pdf/2604.23023
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. O. E. Barndorff-Nielsen & P. Reinhard Hansen & A. Lunde & N. Shephard, 2009. "Realized kernels in practice: trades and quotes," Econometrics Journal, Royal Economic Society, vol. 12(3), pages 1-32, November.
    2. Andersen, Torben G. & Riva, Raul & Thyrsgaard, Martin & Todorov, Viktor, 2023. "Intraday cross-sectional distributions of systematic risk," Journal of Econometrics, Elsevier, vol. 235(2), pages 1394-1418.
    3. Alex Chinco & Adam D. Clark‐Joseph & Mao Ye, 2019. "Sparse Signals in the Cross‐Section of Returns," Journal of Finance, American Finance Association, vol. 74(1), pages 449-492, February.
    4. Andersen, Torben G. & Dobrev, Dobrislav & Schaumburg, Ernst, 2012. "Jump-robust volatility estimation using nearest neighbor truncation," Journal of Econometrics, Elsevier, vol. 169(1), pages 75-93.
    5. Chen, Dachuan & Feng, Long & Mykland, Per A. & Zhang, Lan, 2026. "High dimensional regression coefficient test with high frequency data," Journal of Econometrics, Elsevier, vol. 254(PA).
    6. Jia Li & Viktor Todorov & George Tauchen, 2019. "Jump factor models in large cross‐sections," Quantitative Economics, Econometric Society, vol. 10(2), pages 419-456, May.
    7. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    8. Torben G. Andersen & Tim Bollerslev & Francis X. Diebold & Jin Wu, 2005. "A Framework for Exploring the Macroeconomic Determinants of Systematic Risk," American Economic Review, American Economic Association, vol. 95(2), pages 398-404, May.
    9. Guanhao Feng & Stefano Giglio & Dacheng Xiu, 2020. "Taming the Factor Zoo: A Test of New Factors," Journal of Finance, American Finance Association, vol. 75(3), pages 1327-1370, June.
    10. Joachim Freyberger & Andreas Neuhierl & Michael Weber, 2020. "Dissecting Characteristics Nonparametrically," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    11. Saketh Aleti & Tim Bollerslev, 2025. "News and Asset Pricing: A High-Frequency Anatomy of the SDF," The Review of Financial Studies, Society for Financial Studies, vol. 38(3), pages 712-759.
    12. Per A. Mykland & Lan Zhang, 2009. "Inference for Continuous Semimartingales Observed at High Frequency," Econometrica, Econometric Society, vol. 77(5), pages 1403-1445, September.
    13. Ole E. Barndorff-Nielsen & Neil Shephard, 2004. "Econometric Analysis of Realized Covariation: High Frequency Based Covariance, Regression, and Correlation in Financial Economics," Econometrica, Econometric Society, vol. 72(3), pages 885-925, May.
    14. Theis Ingerslev Jensen & Bryan Kelly & Lasse Heje Pedersen, 2023. "Is There a Replication Crisis in Finance?," Journal of Finance, American Finance Association, vol. 78(5), pages 2465-2518, October.
    15. Bollerslev, Tim & Li, Sophia Zhengzi & Todorov, Viktor, 2016. "Roughing up beta: Continuous versus discontinuous betas and the cross section of expected stock returns," Journal of Financial Economics, Elsevier, vol. 120(3), pages 464-490.
    16. Yacine Aït-Sahalia & Jean Jacod & Dacheng Xiu, 2025. "Continuous-Time Fama-MacBeth Regressions," The Review of Financial Studies, Society for Financial Studies, vol. 38(12), pages 3542-3579.
    17. Ilze Kalnina, 2023. "Inference for Nonparametric High-Frequency Estimators with an Application to Time Variation in Betas," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(2), pages 538-549, April.
    18. Kozak, Serhiy & Nagel, Stefan & Santosh, Shrihari, 2020. "Shrinking the cross-section," Journal of Financial Economics, Elsevier, vol. 135(2), pages 271-292.
    19. Cecilia Mancini, 2009. "Non‐parametric Threshold Estimation for Models with Stochastic Diffusion Coefficient and Jumps," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(2), pages 270-296, June.
    20. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    21. John H. Cochrane, 2011. "Presidential Address: Discount Rates," Journal of Finance, American Finance Association, vol. 66(4), pages 1047-1108, August.
    22. Kong, Xin-Bing & Liu, Cheng, 2018. "Testing against constant factor loading matrix with large panel high-frequency data," Journal of Econometrics, Elsevier, vol. 204(2), pages 301-319.
    23. Todorov, Viktor & Bollerslev, Tim, 2010. "Jumps and betas: A new framework for disentangling and estimating systematic risks," Journal of Econometrics, Elsevier, vol. 157(2), pages 220-235, August.
    24. Aït-Sahalia, Yacine & Kalnina, Ilze & Xiu, Dacheng, 2020. "High-frequency factor models and regressions," Journal of Econometrics, Elsevier, vol. 216(1), pages 86-105.
    25. Xiaotong Shen & Wei Pan & Yunzhang Zhu, 2012. "Likelihood-Based Selection and Sharp Parameter Estimation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 223-232, March.
    26. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    27. Craig W. Holden & Stacey Jacobsen, 2014. "Liquidity Measurement Problems in Fast, Competitive Markets: Expensive and Cheap Solutions," Journal of Finance, American Finance Association, vol. 69(4), pages 1747-1785, August.
    28. Torben G. Andersen & Martin Thyrsgaard & Viktor Todorov, 2021. "Recalcitrant betas: Intraday variation in the cross‐sectional dispersion of systematic risk," Quantitative Economics, Econometric Society, vol. 12(2), pages 647-682, May.
    29. Fama, Eugene F. & French, Kenneth R., 1997. "Industry costs of equity," Journal of Financial Economics, Elsevier, vol. 43(2), pages 153-193, February.
    30. MacKinnon, James G. & White, Halbert, 1985. "Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties," Journal of Econometrics, Elsevier, vol. 29(3), pages 305-325, September.
    31. Jianhua Z. Huang, 2002. "Varying-coefficient models and basis function approximations for the analysis of repeated measurements," Biometrika, Biometrika Trust, vol. 89(1), pages 111-128, March.
    32. Chen, Dachuan & Mykland, Per A. & Zhang, Lan, 2024. "Realized regression with asynchronous and noisy high frequency and high dimensional data," Journal of Econometrics, Elsevier, vol. 239(2).
    33. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    34. Patrick Kline & Raffaele Saggio & Mikkel Sølvsten, 2020. "Leave‐Out Estimation of Variance Components," Econometrica, Econometric Society, vol. 88(5), pages 1859-1898, September.
    35. Andrew Y. Chen & Tom Zimmermann, 2022. "Open Source Cross-Sectional Asset Pricing," Critical Finance Review, now publishers, vol. 11(2), pages 207-264, May.
    36. Wang, Lifeng & Li, Hongzhe & Huang, Jianhua Z., 2008. "Variable Selection in Nonparametric Varying-Coefficient Models for Analysis of Repeated Measurements," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1556-1569.
    37. Yi Ding & Yingying Li & Rui Song, 2024. "Statistical Learning for Individualized Asset Allocation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(545), pages 639-649, January.
    38. Joachim Freyberger & Andreas Neuhierl & Michael Weber & Andrew KarolyiEditor, 2020. "Dissecting Characteristics Nonparametrically," Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    39. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    40. Saketh Aleti & Tim Bollerslev & Mathias Siggaard, 2025. "Intraday Market Return Predictability Culled from the Factor Zoo," Management Science, INFORMS, vol. 71(9), pages 7731-7751, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Changeun Kim & Younwoo Jeong & Bong-Gyu Jang, 2025. "Interpretable Deep Learning for Stock Returns: A Consensus-Bottleneck Asset Pricing Model," Papers 2512.16251, arXiv.org, revised Apr 2026.
    2. Bakalli, Gaetan & Guerrier, Stéphane & Scaillet, Olivier, 2023. "A penalized two-pass regression to predict stock returns with time-varying risk premia," Journal of Econometrics, Elsevier, vol. 237(2).
    3. Cheng, Mingmian & Liao, Yuan & Yang, Xiye, 2023. "Uniform predictive inference for factor models with instrumental and idiosyncratic betas," Journal of Econometrics, Elsevier, vol. 237(2).
    4. Sun, Chuanping, 2024. "Factor correlation and the cross section of asset returns: A correlation-robust machine learning approach," Journal of Empirical Finance, Elsevier, vol. 77(C).
    5. Donggyu Kim & Minseok Shin, 2024. "Nonconvex High-Dimensional Time-Varying Coefficient Estimation for Noisy High-Frequency Observations with a Factor Structure," Working Papers 202418, University of California at Riverside, Department of Economics.
    6. Söhnke M. Bartram & Harald Lohre & Peter F. Pope & Ananthalakshmi Ranganathan, 2021. "Navigating the factor zoo around the world: an institutional investor perspective," Journal of Business Economics, Springer, vol. 91(5), pages 655-703, July.
    7. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    8. Guilherme V. Moura & Andr'e P. Santos & Hudson S. Torrent, 2025. "Variable selection for minimum-variance portfolios," Papers 2508.14986, arXiv.org.
    9. Jiang, Hao & Li, Sophia Zhengzi & Yuan, Peixuan, 2025. "Granular information and sectoral movements," Journal of Economic Dynamics and Control, Elsevier, vol. 171(C).
    10. Malakhov, Alexey & Riley, Timothy B. & Yan, Qing, 2024. "Do hedge funds bet against beta?," International Review of Economics & Finance, Elsevier, vol. 93(PA), pages 1507-1525.
    11. Yan, Jingda & Yu, Jialin, 2023. "Cross-stock momentum and factor momentum," Journal of Financial Economics, Elsevier, vol. 150(2).
    12. Esfandiar Maasoumi & Jianqiu Wang & Zhuo Wang & Ke Wu, 2024. "Identifying factors via automatic debiased machine learning," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(3), pages 438-461, April.
    13. Dong, Mengmeng, 2025. "Economic aggregation of return signals in global markets," Journal of Empirical Finance, Elsevier, vol. 84(C).
    14. Huang, Dashan & Li, Jiangyuan & Wang, Liyao, 2021. "Are disagreements agreeable? Evidence from information aggregation," Journal of Financial Economics, Elsevier, vol. 141(1), pages 83-101.
    15. De Nard, Gianluca & Zhao, Zhao, 2023. "Using, taming or avoiding the factor zoo? A double-shrinkage estimator for covariance matrices," Journal of Empirical Finance, Elsevier, vol. 72(C), pages 23-35.
    16. Ai He & Dashan Huang & Jiaen Li & Guofu Zhou, 2023. "Shrinking Factor Dimension: A Reduced-Rank Approach," Management Science, INFORMS, vol. 69(9), pages 5501-5522, September.
    17. Siddhartha Chib & Simon C. Smith, 2024. "Factor Selection and Structural Breaks," Finance and Economics Discussion Series 2024-037, Board of Governors of the Federal Reserve System (U.S.).
    18. Smith, Simon C., 2022. "Time-variation, multiple testing, and the factor zoo," International Review of Financial Analysis, Elsevier, vol. 84(C).
    19. Shanyan Lai, 2025. "Multilayer Perceptron Neural Network Models in Asset Pricing: An Empirical Study on Large-Cap US Stocks," Papers 2505.01921, arXiv.org, revised Jun 2026.
    20. Nima Afsharhajari & Jonathan Yu-Meng Li, 2026. "The Virtue of Sparsity in Complexity," Papers 2604.17166, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2604.23023. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: https://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.