IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v138y2019icp239-259.html
   My bibliography  Save this article

Subgroup analysis for heterogeneous additive partially linear models and its application to car sales data

Author

Listed:
  • Liu, Lili
  • Lin, Lu

Abstract

As an extension of additive partially linear model, heterogeneous additive partially linear model contains the homogeneous linear components and subject-dependent additive components, but has no group information of subject-dependent additive components. Such a model is more flexible and efficient for addressing some special issues such as precision medicine and precision marketing. A polynomial spline smoothing is used to approximate the heterogeneous additive components, and then a new clustering method is developed to automatically identify subgroups. The procedure avoids solving coefficient vector in each iterative step as in regression clustering procedures. Thus, this approach is rapid and computationally stable even if the sample size is large. Based on the clustered heterogeneous additive components, consistent estimators of the homogeneous parameters and subgroup-specific additive components are further obtained. Moreover, n-consistency and asymptotic normality for the estimators of the parametric components are established. The simulation studies and real data analysis illustrate that the model and proposed clustering and estimation are effective in practice.

Suggested Citation

  • Liu, Lili & Lin, Lu, 2019. "Subgroup analysis for heterogeneous additive partially linear models and its application to car sales data," Computational Statistics & Data Analysis, Elsevier, vol. 138(C), pages 239-259.
  • Handle: RePEc:eee:csdana:v:138:y:2019:i:c:p:239-259
    DOI: 10.1016/j.csda.2019.04.011
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947319300957
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2019.04.011?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Shen, Xiaotong & Huang, Hsin-Cheng, 2010. "Grouping Pursuit Through a Regularization Solution Surface," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 727-739.
    2. Li, Qi, 2000. "Efficient Estimation of Additive Partially Linear Models," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 41(4), pages 1073-1092, November.
    3. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    4. Robert Tibshirani & Michael Saunders & Saharon Rosset & Ji Zhu & Keith Knight, 2005. "Sparsity and smoothness via the fused lasso," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(1), pages 91-108, February.
    5. Juan Shen & Xuming He, 2015. "Inference for Subgroup Analysis With a Structured Logistic-Normal Mixture Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 303-312, March.
    6. Jianhua Z. Huang, 2002. "Varying-coefficient models and basis function approximations for the analysis of repeated measurements," Biometrika, Biometrika Trust, vol. 89(1), pages 111-128, March.
    7. Jian Guo & Elizaveta Levina & George Michailidis & Ji Zhu, 2010. "Pairwise Variable Selection for High-Dimensional Model-Based Clustering," Biometrics, The International Biometric Society, vol. 66(3), pages 793-804, September.
    8. Xuming He, 2002. "Estimation in a semiparametric model for longitudinal data with unspecified dependence structure," Biometrika, Biometrika Trust, vol. 89(3), pages 579-590, August.
    9. Hansheng Wang & Runze Li & Chih-Ling Tsai, 2007. "Tuning parameter selectors for the smoothly clipped absolute deviation method," Biometrika, Biometrika Trust, vol. 94(3), pages 553-568.
    10. Shujie Ma & Jian Huang, 2017. "A Concave Pairwise Fusion Approach to Subgroup Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 410-423, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wang, Xin & Zhu, Zhengyuan & Zhang, Hao Helen, 2023. "Spatial heterogeneity automatic detection and estimation," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    2. Zhang, Xiaochen & Zhang, Qingzhao & Ma, Shuangge & Fang, Kuangnan, 2022. "Subgroup analysis for high-dimensional functional regression," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    3. Fang, Kuangnan & Chen, Yuanxing & Ma, Shuangge & Zhang, Qingzhao, 2022. "Biclustering analysis of functionals via penalized fusion," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    4. Veronica Distefano & Maria Mannone & Irene Poli, 2023. "Exploring Heterogeneity with Category and Cluster Analyses for Mixed Data," Stats, MDPI, vol. 6(3), pages 1-16, July.
    5. Mingyang Ren & Qingzhao Zhang & Sanguo Zhang & Tingyan Zhong & Jian Huang & Shuangge Ma, 2022. "Hierarchical cancer heterogeneity analysis based on histopathological imaging features," Biometrics, The International Biometric Society, vol. 78(4), pages 1579-1591, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mehrabani, Ali, 2023. "Estimation and identification of latent group structures in panel data," Journal of Econometrics, Elsevier, vol. 235(2), pages 1464-1482.
    2. Lu Tang & Peter X.‐K. Song, 2021. "Poststratification fusion learning in longitudinal data analysis," Biometrics, The International Biometric Society, vol. 77(3), pages 914-928, September.
    3. Lian, Heng, 2015. "Quantile regression for dynamic partially linear varying coefficient time series models," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 49-66.
    4. Tian, Ruiqin & Xue, Liugen & Liu, Chunling, 2014. "Penalized quadratic inference functions for semiparametric varying coefficient partially linear models with longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 132(C), pages 94-110.
    5. Baosheng Liang & Peng Wu & Xingwei Tong & Yanping Qiu, 2020. "Regression and subgroup detection for heterogeneous samples," Computational Statistics, Springer, vol. 35(4), pages 1853-1878, December.
    6. Shuang Zhang & Xingdong Feng, 2022. "Distributed identification of heterogeneous treatment effects," Computational Statistics, Springer, vol. 37(1), pages 57-89, March.
    7. Ye, Mao & Lu, Zhao-Hua & Li, Yimei & Song, Xinyuan, 2019. "Finite mixture of varying coefficient model: Estimation and component selection," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 452-474.
    8. Xiao Ni & Daowen Zhang & Hao Helen Zhang, 2010. "Variable Selection for Semiparametric Mixed Models in Longitudinal Studies," Biometrics, The International Biometric Society, vol. 66(1), pages 79-88, March.
    9. Sakyajit Bhattacharya & Paul McNicholas, 2014. "A LASSO-penalized BIC for mixture model selection," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(1), pages 45-61, March.
    10. Jeon, Jong-June & Kwon, Sunghoon & Choi, Hosik, 2017. "Homogeneity detection for the high-dimensional generalized linear model," Computational Statistics & Data Analysis, Elsevier, vol. 114(C), pages 61-74.
    11. Wang, Wuyi & Su, Liangjun, 2021. "Identifying latent group structures in nonlinear panels," Journal of Econometrics, Elsevier, vol. 220(2), pages 272-295.
    12. Fei Wang & Lu Wang & Peter X.‐K. Song, 2016. "Fused lasso with the adaptation of parameter ordering in combining multiple studies with repeated measurements," Biometrics, The International Biometric Society, vol. 72(4), pages 1184-1193, December.
    13. Zheng Tracy Ke & Jianqing Fan & Yichao Wu, 2015. "Homogeneity Pursuit," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 175-194, March.
    14. Xuejun Ma & Yue Du & Jingli Wang, 2022. "Model detection and variable selection for mode varying coefficient model," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 31(2), pages 321-341, June.
    15. Bang, Sungwan & Jhun, Myoungshic, 2012. "Simultaneous estimation and factor selection in quantile regression via adaptive sup-norm regularization," Computational Statistics & Data Analysis, Elsevier, vol. 56(4), pages 813-826.
    16. Zheng, Xueying & Xue, Lan & Qu, Annie, 2018. "Time-varying correlation structure estimation and local-feature detection for spatio-temporal data," Journal of Multivariate Analysis, Elsevier, vol. 168(C), pages 221-239.
    17. Wang, Xin & Zhu, Zhengyuan & Zhang, Hao Helen, 2023. "Spatial heterogeneity automatic detection and estimation," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    18. Qian, Junhui & Su, Liangjun, 2016. "Shrinkage estimation of common breaks in panel data models via adaptive group fused Lasso," Journal of Econometrics, Elsevier, vol. 191(1), pages 86-109.
    19. Ye He & Ling Zhou & Yingcun Xia & Huazhen Lin, 2023. "Center‐augmented ℓ2‐type regularization for subgroup learning," Biometrics, The International Biometric Society, vol. 79(3), pages 2157-2170, September.
    20. Sunkyung Kim & Wei Pan & Xiaotong Shen, 2013. "Network-Based Penalized Regression With Application to Genomic Data," Biometrics, The International Biometric Society, vol. 69(3), pages 582-593, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:138:y:2019:i:c:p:239-259. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.