IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2510.24714.html
   My bibliography  Save this paper

Machine-Learning-Assisted Comparison of Regression Functions

Author

Listed:
  • Jian Yan
  • Zhuoxi Li
  • Yang Ning
  • Yong Chen

Abstract

We revisit the classical problem of comparing regression functions, a fundamental question in statistical inference with broad relevance to modern applications such as data integration, transfer learning, and causal inference. Existing approaches typically rely on smoothing techniques and are thus hindered by the curse of dimensionality. We propose a generalized notion of kernel-based conditional mean dependence that provides a new characterization of the null hypothesis of equal regression functions. Building on this reformulation, we develop two novel tests that leverage modern machine learning methods for flexible estimation. We establish the asymptotic properties of the test statistics, which hold under both fixed- and high-dimensional regimes. Unlike existing methods that often require restrictive distributional assumptions, our framework only imposes mild moment conditions. The efficacy of the proposed tests is demonstrated through extensive numerical studies.

Suggested Citation

  • Jian Yan & Zhuoxi Li & Yang Ning & Yong Chen, 2025. "Machine-Learning-Assisted Comparison of Regression Functions," Papers 2510.24714, arXiv.org.
  • Handle: RePEc:arx:papers:2510.24714
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2510.24714
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Srihera, Ramidha & Stute, Winfried, 2010. "Nonparametric comparison of regression functions," Journal of Multivariate Analysis, Elsevier, vol. 101(9), pages 2039-2059, October.
    2. Richard K. Crump & V. Joseph Hotz & Guido W. Imbens & Oscar A. Mitnik, 2008. "Nonparametric Tests for Treatment Effect Heterogeneity," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 389-405, August.
    3. Wenceslao González-Manteiga & Rosa Crujeiras, 2013. "Rejoinder on: An updated review of Goodness-of-Fit tests for regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(3), pages 442-447, September.
    4. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    5. Wenceslao González-Manteiga & Rosa Crujeiras, 2013. "An updated review of Goodness-of-Fit tests for regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(3), pages 361-411, September.
    6. Xiaoyu Hu & Jing Lei, 2024. "A Two-Sample Conditional Distribution Test Using Conformal Prediction and Weighted Rank Sum," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(546), pages 1136-1154, April.
    7. Jeffrey S. Racine & Ingrid Van Keilegom, 2020. "A Smooth Nonparametric, Multivariate, Mixed-Data Location-Scale Test," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(4), pages 784-795, October.
    8. Xiaofeng Shao & Jingsi Zhang, 2014. "Martingale Difference Correlation and Its Use in High-Dimensional Variable Screening," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1302-1318, September.
    9. Lavergne, Pascal, 2001. "An equality test across nonparametric regressions," Journal of Econometrics, Elsevier, vol. 103(1-2), pages 307-344, July.
    10. Jian Yan & Xianyang Zhang, 2023. "Kernel two-sample tests in high dimensions: interplay between moment discrepancy and dimension-and-sample orders," Biometrika, Biometrika Trust, vol. 110(2), pages 411-430.
    11. Lai, Tingyu & Zhang, Zhongzhan & Wang, Yafei, 2021. "A kernel-based measure for conditional mean dependence," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    12. Runze Li & Kai Xu & Yeqing Zhou & Liping Zhu, 2023. "Testing the Effects of High-Dimensional Covariates via Aggregating Cumulative Covariances," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(543), pages 2184-2194, July.
    13. King, Eileen & Hart, Jeffrey D. & Wehrly, Thomas E., 1991. "Testing the equality of two regression curves using linear smoothers," Statistics & Probability Letters, Elsevier, vol. 12(3), pages 239-247, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jian Yan & Zhuoxi Li & Xianyang Zhang, 2022. "Distance and Kernel-Based Measures for Global and Local Two-Sample Conditional Distribution Testing," Papers 2210.08149, arXiv.org, revised Aug 2025.
    2. Zhou, Niwen & Guo, Xu & Zhu, Lixing, 2024. "Significance test for semiparametric conditional average treatment effects and other structural functions," Computational Statistics & Data Analysis, Elsevier, vol. 189(C).
    3. Wenceslao González-Manteiga & Rosa Crujeiras, 2013. "An updated review of Goodness-of-Fit tests for regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(3), pages 361-411, September.
    4. Sun, Shuang & Song, Zening & Song, Xiaojun, 2025. "Unified specification tests in partially linear time series models," Computational Statistics & Data Analysis, Elsevier, vol. 203(C).
    5. Hušková, Marie & Meintanis, Simos G. & Pretorius, Charl, 2020. "Tests for validity of the semiparametric heteroskedastic transformation model," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    6. Álvarez-Liébana, J. & López-Pérez, A. & González-Manteiga, W. & Febrero-Bande, M., 2025. "A goodness-of-fit test for functional time series with applications to Ornstein-Uhlenbeck processes," Computational Statistics & Data Analysis, Elsevier, vol. 203(C).
    7. Pedro H. C. Sant'Anna & Xiaojun Song & Qi Xu, 2022. "Covariate distribution balance via propensity scores," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(6), pages 1093-1120, September.
    8. Zhang, Yaowu & Zhou, Yeqing & Zhu, Liping, 2025. "Interval quantile correlations with applications to testing high-dimensional quantile effects," Journal of Econometrics, Elsevier, vol. 249(PA).
    9. Miguel A. Delgado & Juan Carlos Escanciano, 2013. "Conditional Stochastic Dominance Testing," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 31(1), pages 16-28, January.
    10. Sianesi, Barbara, 2017. "Evidence of randomisation bias in a large-scale social experiment: The case of ERA," Journal of Econometrics, Elsevier, vol. 198(1), pages 41-64.
    11. Yongzhen Feng & Jie Li & Xiaojun Song, 2025. "Testing linearity in semi-functional partially linear regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 34(3), pages 786-814, September.
    12. Nathan Kallus & Miruna Oprescu, 2022. "Robust and Agnostic Learning of Conditional Distributional Treatment Effects," Papers 2205.11486, arXiv.org, revised Jun 2025.
    13. Zhijian Li & Tiejun Tong & Yuedong Wang, 2025. "A difference-based method for testing no effect in nonparametric regression," Computational Statistics, Springer, vol. 40(1), pages 153-176, January.
    14. Adam D. Bull, 2015. "Semimartingale detection and goodness-of-fit tests," Papers 1506.00088, arXiv.org, revised Jun 2016.
    15. Dong, Hao & Taylor, Luke, 2022. "Nonparametric Significance Testing In Measurement Error Models," Econometric Theory, Cambridge University Press, vol. 38(3), pages 454-496, June.
    16. Xu Guo & Gao-Rong Li & Michael McAleer & Wing-Keung Wong, 2018. "Specification Testing of Production in a Stochastic Frontier Model," Sustainability, MDPI, vol. 10(9), pages 1-10, August.
    17. Xu Guo & Wangli Xu & Lixing Zhu, 2015. "Model checking for parametric regressions with response missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(2), pages 229-259, April.
    18. Enno Mammen & Jens Perch Nielsen & Michael Scholz & Stefan Sperlich, 2019. "Conditional Variance Forecasts for Long-Term Stock Returns," Risks, MDPI, vol. 7(4), pages 1-22, November.
    19. Nathan Kallus, 2023. "Treatment Effect Risk: Bounds and Inference," Management Science, INFORMS, vol. 69(8), pages 4579-4590, August.
    20. Tian, Zhentao & Zhang, Zhongzhan, 2025. "Quantile feature screening for infinite dimensional data under FDR control," Computational Statistics & Data Analysis, Elsevier, vol. 206(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2510.24714. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.