IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v54y2010i10p2253-2266.html
   My bibliography  Save this article

Mixtures of regressions with predictor-dependent mixing proportions

Author

Listed:
  • Young, D.S.
  • Hunter, D.R.

Abstract

We extend the standard mixture of linear regressions model by allowing the mixing proportions to be modeled nonparametrically as a function of the predictors. This framework allows for more flexibility in the modeling of the mixing proportions than the fully parametric mixture of experts model, which we also discuss. We present an EM-like algorithm for estimation of the new model. We also provide simulations demonstrating that our nonparametric approach can provide a better fit than the parametric approach in some instances and can serve to validate and thus reinforce the parametric approach in others. We also analyze and interpret two real data sets using the new method.

Suggested Citation

  • Young, D.S. & Hunter, D.R., 2010. "Mixtures of regressions with predictor-dependent mixing proportions," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2253-2266, October.
  • Handle: RePEc:eee:csdana:v:54:y:2010:i:10:p:2253-2266
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00146-5
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. T. Rolf Turner, 2000. "Estimating the propagation rate of a viral infection of potato plants via mixtures of regressions," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 49(3), pages 371-384.
    2. Yau, Kelvin K. W. & Lee, Andy H. & Ng, Angus S. K., 2003. "Finite mixture regression model with random effects: application to neonatal hospital length of stay," Computational Statistics & Data Analysis, Elsevier, vol. 41(3-4), pages 359-366, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Keefe Murphy & Thomas Brendan Murphy, 2020. "Gaussian parsimonious clustering models with covariates and a noise component," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(2), pages 293-325, June.
    2. David Hunter & Derek Young, 2012. "Semiparametric mixtures of regressions," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(1), pages 19-38.
    3. Chau, Thi Tuyet Trang & Ailliot, Pierre & Monbet, Valérie, 2021. "An algorithm for non-parametric estimation in state–space models," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    4. Yao, Weixin & Wei, Yan & Yu, Chun, 2014. "Robust mixture regression using the t-distribution," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 116-127.
    5. Marco Berrettini & Giuliano Galimberti & Saverio Ranciati, 2023. "Semiparametric finite mixture of regression models with Bayesian P-splines," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(3), pages 745-775, September.
    6. Sijia Xiang & Weixin Yao, 2020. "Semiparametric mixtures of regressions with single-index for model based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(2), pages 261-292, June.
    7. Wang, Shaoli & Yao, Weixin & Huang, Mian, 2014. "A note on the identifiability of nonparametric and semiparametric mixtures of GLMs," Statistics & Probability Letters, Elsevier, vol. 93(C), pages 41-45.
    8. Sijia Xiang & Weixin Yao, 2018. "Semiparametric mixtures of nonparametric regressions," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(1), pages 131-154, February.
    9. Wraith, Darren & Forbes, Florence, 2015. "Location and scale mixtures of Gaussians with flexible tail behaviour: Properties, inference and application to multivariate clustering," Computational Statistics & Data Analysis, Elsevier, vol. 90(C), pages 61-73.
    10. Yuzhu Tian & Manlai Tang & Maozai Tian, 2016. "A class of finite mixture of quantile regressions with its applications," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(7), pages 1240-1252, July.
    11. Xue, Jiacheng & Yao, Weixin, 2022. "Machine Learning Embedded Semiparametric Mixtures of Regressions with Covariate-Varying Mixing Proportions," Econometrics and Statistics, Elsevier, vol. 22(C), pages 159-171.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gianfranco DI VAIO & Michele BATTISTI, 2010. "A Spatially-Filtered Mixture of Beta-Convergence Regression for EU Regions, 1980-2002," Regional and Urban Modeling 284100013, EcoMod.
    2. Nalan Basturk & Richard Paap & Dick van Dijk, 2008. "Structural Differences in Economic Growth," Tinbergen Institute Discussion Papers 08-085/4, Tinbergen Institute.
    3. Di Vaio, Gianfranco & Enflo, Kerstin, 2011. "Did globalization drive convergence? Identifying cross-country growth regimes in the long run," European Economic Review, Elsevier, vol. 55(6), pages 832-844, August.
    4. Xiong, Yingge & Tobias, Justin L. & Mannering, Fred L., 2014. "The analysis of vehicle crash injury-severity data: A Markov switching approach with road-segment heterogeneity," Transportation Research Part B: Methodological, Elsevier, vol. 67(C), pages 109-128.
    5. Alegre, Joaquín & Mateo, Sara & Pou, Llorenç, 2011. "A latent class approach to tourists’ length of stay," Tourism Management, Elsevier, vol. 32(3), pages 555-563.
    6. Atefeh Zarei & Zahra Khodadadi & Mohsen Maleki & Karim Zare, 2023. "Robust mixture regression modeling based on two-piece scale mixtures of normal distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(1), pages 181-210, March.
    7. Michele Battisti & Gianfranco Vaio, 2009. "A spatially filtered mixture of β-convergence regressions for EU regions, 1980–2002," Studies in Empirical Economics, in: Giuseppe Arbia & Badi H. Baltagi (ed.), Spatial Econometrics, pages 105-121, Springer.
    8. Xiong, Yingge & Mannering, Fred L., 2013. "The heterogeneous effects of guardian supervision on adolescent driver-injury severities: A finite-mixture random-parameters approach," Transportation Research Part B: Methodological, Elsevier, vol. 49(C), pages 39-54.
    9. Gianfranco Di Vaio & Kerstin Enflo, 2009. "Did Globalization Lead to Segmentation? Identifying Cross-Country Growth Regimes in the Long-Run," Discussion Papers 09-08, University of Copenhagen. Department of Economics.
    10. Michele Battisti, 2013. "Reassessing Segmentation In The Labour Market: An Application For Italy 1995–2004," Bulletin of Economic Research, Wiley Blackwell, vol. 65, pages 38-55, May.
    11. Rainer Schlittgen, 2011. "A weighted least-squares approach to clusterwise regression," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 95(2), pages 205-217, June.
    12. Chungkham Singh & Laishram Ladusingh, 2010. "Inpatient length of stay: a finite mixture modeling analysis," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 11(2), pages 119-126, April.
    13. Gabriele Perrone & Gabriele Soffritti, 2023. "Seemingly unrelated clusterwise linear regression for contaminated data," Statistical Papers, Springer, vol. 64(3), pages 883-921, June.
    14. Ye He & Ling Zhou & Yingcun Xia & Huazhen Lin, 2023. "Center‐augmented ℓ2‐type regularization for subgroup learning," Biometrics, The International Biometric Society, vol. 79(3), pages 2157-2170, September.
    15. Shin-Fu Tsai, 2019. "Comparing Coefficients Across Subpopulations in Gaussian Mixture Regression Models," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 24(4), pages 610-633, December.
    16. Ang Shan & Fengkai Yang, 2021. "Bayesian Inference for Finite Mixture Regression Model Based on Non-Iterative Algorithm," Mathematics, MDPI, vol. 9(6), pages 1-13, March.
    17. Charlotte Articus & Jan Pablo Burgard, 2014. "A Finite Mixture Fay Herriot-type model for estimating regional rental prices in Germany," Research Papers in Economics 2014-14, University of Trier, Department of Economics.
    18. Yan Meng & Xueyan Zhao & Xibin Zhang & Jiti Gao, 2017. "A panel data analysis of hospital variations in length of stay for hip replacements: Private versus public," Monash Econometrics and Business Statistics Working Papers 20/17, Monash University, Department of Econometrics and Business Statistics.
    19. Luísa Novais & Susana Faria, 2021. "Comparison of the EM, CEM and SEM algorithms in the estimation of finite mixtures of linear mixed models: a simulation study," Computational Statistics, Springer, vol. 36(4), pages 2507-2533, December.
    20. Giuliano Galimberti & Lorenzo Nuzzi & Gabriele Soffritti, 2021. "Covariance matrix estimation of the maximum likelihood estimator in multivariate clusterwise linear regression," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 235-268, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:54:y:2010:i:10:p:2253-2266. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.