IDEAS home Printed from https://ideas.repec.org/a/gam/jstats/v8y2025i3p71-d1721169.html
   My bibliography  Save this article

Individual Homogeneity Learning in Density Data Response Additive Models

Author

Listed:
  • Zixuan Han

    (Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA 98109, USA)

  • Tao Li

    (School of Statistics and Data Science, Shanghai University of Finance and Economics, Shanghai 200433, China)

  • Jinhong You

    (School of Statistics and Data Science, Shanghai University of Finance and Economics, Shanghai 200433, China)

  • Narayanaswamy Balakrishnan

    (Department of Mathematics and Statistics, McMaster University, Hamilton, ON L8S 4L8, Canada)

Abstract

In many complex applications, both data heterogeneity and homogeneity are present simultaneously. Overlooking either aspect can lead to misleading statistical inferences. Moreover, the increasing prevalence of complex, non-Euclidean data calls for more sophisticated modeling techniques. To address these challenges, we propose a density data response additive model, where the response variable is represented by a distributional density function. In this framework, individual effect curves are assumed to be homogeneous within groups but heterogeneous across groups, while covariates that explain variation share common additive bivariate functions. We begin by applying a transformation to map density functions into a linear space. To estimate the unknown subject-specific functions and the additive bivariate components, we adopt a B-spline series approximation method. Latent group structures are uncovered using a hierarchical agglomerative clustering algorithm, which allows our method to recover the true underlying groupings with high probability. To further improve estimation efficiency, we develop refined spline-backfitted local linear estimators for both the grouped structures and the additive bivariate functions in the post-grouping model. We also establish the asymptotic properties of the proposed estimators, including their convergence rates, asymptotic distributions, and post-grouping oracle efficiency. The effectiveness of our method is demonstrated through extensive simulation studies and real-world data analysis, both of which show promising and robust performance.

Suggested Citation

  • Zixuan Han & Tao Li & Jinhong You & Narayanaswamy Balakrishnan, 2025. "Individual Homogeneity Learning in Density Data Response Additive Models," Stats, MDPI, vol. 8(3), pages 1-27, August.
  • Handle: RePEc:gam:jstats:v:8:y:2025:i:3:p:71-:d:1721169
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-905X/8/3/71/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-905X/8/3/71/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jialiang Li & Chao Huang & Zhub Hongtu, 2017. "A Functional Varying-Coefficient Single-Index Model for Functional Response Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1169-1181, July.
    2. Liangjun Su & Zhentao Shi & Peter C. B. Phillips, 2016. "Identifying Latent Structures in Panel Data," Econometrica, Econometric Society, vol. 84, pages 2215-2264, November.
    3. Kyunghee Han & Hans-Georg Müller & Byeong U. Park, 2020. "Additive Functional Regression for Densities as Responses," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 997-1010, April.
    4. Pei, Youquan & Huang, Tao & You, Jinhong, 2018. "Nonparametric fixed effects model for panel data with locally stationary regressors," Journal of Econometrics, Elsevier, vol. 202(2), pages 286-305.
    5. Talská, R. & Menafoglio, A. & Machalová, J. & Hron, K. & Fišerová, E., 2018. "Compositional regression with functional response," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 66-85.
    6. Xinchao Luo & Lixing Zhu & Hongtu Zhu, 2016. "Single‐index varying coefficient model for functional responses," Biometrics, The International Biometric Society, vol. 72(4), pages 1275-1284, December.
    7. Michael Vogt & Oliver Linton, 2017. "Classification of non-parametric regression functions in longitudinal data models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 5-27, January.
    8. J. Ramsay, 1982. "When the data are functions," Psychometrika, Springer;The Psychometric Society, vol. 47(4), pages 379-396, December.
    9. Elias Masry, 1996. "Multivariate Local Polynomial Regression For Time Series:Uniform Strong Consistency And Rates," Journal of Time Series Analysis, Wiley Blackwell, vol. 17(6), pages 571-599, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiaorong Yang & Jia Chen & Degui Li & Runze Li, 2024. "Functional-Coefficient Quantile Regression for Panel Data with Latent Group Structure," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(3), pages 1026-1040, July.
    2. Miao, Ke & Su, Liangjun & Wang, Wendun, 2020. "Panel threshold regressions with latent group structures," Journal of Econometrics, Elsevier, vol. 214(2), pages 451-481.
    3. Jingru Zhang & Mathias Basner & Christopher W. Jones & David F. Dinges & Haochang Shou & Hongzhe Li, 2024. "Mediation Analysis with Random Distribution as Mediator with an Application to iCOMPARE Trial," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 16(1), pages 107-128, April.
    4. Bian, Yulin & Su, Liangjun, 2025. "A note on factor models with latent group structures," Economics Letters, Elsevier, vol. 252(C).
    5. Ruiyan Luo & Xin Qi, 2023. "Nonlinear function‐on‐scalar regression via functional universal approximation," Biometrics, The International Biometric Society, vol. 79(4), pages 3319-3331, December.
    6. Qian Huang & Jinhong You & Liwen Zhang, 2022. "Efficient inference of longitudinal/functional data models with time‐varying additive structure," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(2), pages 744-771, June.
    7. Wang, Wei & Xiao, Zhijie & Ren, Yanyan & Yan, Xiaodong, 2023. "A bi-integrative analysis of two-dimensional heterogeneous panel data models," Economics Letters, Elsevier, vol. 230(C).
    8. Degui Li & Bin Peng & Songqiao Tang & Weibiao Wu, 2023. "Estimation of Grouped Time-Varying Network Vector Autoregression Models," Papers 2303.10117, arXiv.org, revised Mar 2024.
    9. Chen, Feifei & Jiang, Qing & Feng, Zhenghui & Zhu, Lixing, 2020. "Model checks for functional linear regression models based on projected empirical processes," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    10. Vogt, Michael & Linton, Oliver, 2020. "Multiscale clustering of nonparametric regression curves," Journal of Econometrics, Elsevier, vol. 216(1), pages 305-325.
    11. Petersen, Alexander & Zhang, Chao & Kokoszka, Piotr, 2022. "Modeling Probability Density Functions as Data Objects," Econometrics and Statistics, Elsevier, vol. 21(C), pages 159-178.
    12. Jia Chen, 2019. "Estimating latent group structure in time-varying coefficient panel data models," The Econometrics Journal, Royal Economic Society, vol. 22(3), pages 223-240.
    13. Xiong Cai & Liugen Xue & Xiaolong Pu & Xingyu Yan, 2021. "Efficient Estimation for Varying-Coefficient Mixed Effects Models with Functional Response Data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(4), pages 467-495, May.
    14. Michael Vogt & Oliver Linton, 2015. "Classification of nonparametric regression functions in heterogeneous panels," CeMMAP working papers 06/15, Institute for Fiscal Studies.
    15. Gao, Jiti & Xia, Kai & Zhu, Huanjun, 2020. "Heterogeneous panel data models with cross-sectional dependence," Journal of Econometrics, Elsevier, vol. 219(2), pages 329-353.
    16. Pionati, Alessandro, 2025. "Latent grouped structures in panel data: a review," MPRA Paper 123954, University Library of Munich, Germany.
    17. Degui Li & Bin Peng & Songqiao Tang & Weibiao Wu, 2023. "Inference of Grouped Time-Varying Network Vector Autoregression Models," Monash Econometrics and Business Statistics Working Papers 5/23, Monash University, Department of Econometrics and Business Statistics.
    18. Liebl, Dominik & Walders, Fabian, 2019. "Parameter regimes in partial functional panel regression," Econometrics and Statistics, Elsevier, vol. 11(C), pages 105-115.
    19. Zhentao Shi & Liangjun Su & Tian Xie, 2020. "L2-Relaxation: With Applications to Forecast Combination and Portfolio Analysis," Papers 2010.09477, arXiv.org, revised Aug 2022.
    20. Paul Haimerl & Stephan Smeekes & Ines Wilms, 2025. "Estimation of Latent Group Structures in Time-Varying Panel Data Models," Papers 2503.23165, arXiv.org.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jstats:v:8:y:2025:i:3:p:71-:d:1721169. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.