IDEAS home Printed from https://ideas.repec.org/a/spr/jclass/v42y2025i2d10.1007_s00357-024-09497-9.html
   My bibliography  Save this article

Old but Gold or New and Shiny? Comparing Tree Ensembles for Ordinal Prediction with a Classic Parametric Approach

Author

Listed:
  • Philip Buczak

    (TU Dortmund University
    UA Ruhr)

  • Daniel Horn

    (TU Dortmund University)

  • Markus Pauly

    (TU Dortmund University
    UA Ruhr)

Abstract

Ordinal data are frequently encountered, e.g., in the life and social sciences. Predicting ordinal outcomes can inform important decisions, e.g., in medicine or education. Two methodological streams tackle prediction of ordinal outcomes: Traditional parametric models, e.g., the proportional odds model (POM), and machine learning-based tree ensemble (TE) methods. A promising TE approach involves selecting the best performing from sets of randomly generated numeric scores assigned to ordinal response categories (ordinal forest; Hornung, 2019). We propose a new method, the ordinal score optimization algorithm, that takes a similar approach but selects scores through non-linear optimization. We compare these and other TE methods with the computationally much less expensive POM. Despite selective efforts, the literature lacks an encompassing simulation-based comparison. Aiming to fill this gap, we find that while TE approaches outperform the POM for strong non-linear effects, the latter is competitive for small sample sizes even under medium non-linear effects.

Suggested Citation

  • Philip Buczak & Daniel Horn & Markus Pauly, 2025. "Old but Gold or New and Shiny? Comparing Tree Ensembles for Ordinal Prediction with a Classic Parametric Approach," Journal of Classification, Springer;The Classification Society, vol. 42(2), pages 364-390, July.
  • Handle: RePEc:spr:jclass:v:42:y:2025:i:2:d:10.1007_s00357-024-09497-9
    DOI: 10.1007/s00357-024-09497-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00357-024-09497-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00357-024-09497-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Wright, Marvin N. & Ziegler, Andreas, 2017. "ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 77(i01).
    2. Archer, Kellie J., 2010. "rpartOrdinal: An R Package for Deriving a Classification Tree for Predicting an Ordinal Response," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 34(i07).
    3. Galimberti, Giuliano & Soffritti, Gabriele & Maso, Matteo Di, 2012. "Classification Trees for Ordinal Responses in R: The rpartScore Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 47(i10).
    4. Gerhard Tutz & Moritz Berger, 2022. "Sparser Ordinal Regression Models Based on Parametric and Additive Location‐Shift Approaches," International Statistical Review, International Statistical Institute, vol. 90(2), pages 306-327, August.
    5. Faisal Maqbool Zahid & Gerhard Tutz, 2013. "Proportional Odds Models with High‐Dimensional Data Structure," International Statistical Review, International Statistical Institute, vol. 81(3), pages 388-406, December.
    6. Bercedis Peterson & Frank E. Harrell, 1990. "Partial Proportional Odds Models for Ordinal Response Variables," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 39(2), pages 205-217, June.
    7. Janitza, Silke & Tutz, Gerhard & Boulesteix, Anne-Laure, 2016. "Random forest for ordinal responses: Prediction and variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 96(C), pages 57-73.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gerhard Tutz, 2022. "Ordinal Trees and Random Forests: Score-Free Recursive Partitioning and Improved Ensembles," Journal of Classification, Springer;The Classification Society, vol. 39(2), pages 241-263, July.
    2. repec:osf:osfxxx:h8t4p_v1 is not listed on IDEAS
    3. Yiwei Fan & Jiaqi Gu & Guosheng Yin, 2023. "Sparse concordance‐based ordinal classification," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(3), pages 934-961, September.
    4. Buczak, Philip & Horn, Daniel & Pauly, Markus, 2024. "Old but Gold or New and Shiny? Comparing Tree Ensembles for Ordinal Prediction with a Classic Parametric Approach," OSF Preprints v7bcf, Center for Open Science.
    5. repec:osf:osfxxx:v7bcf_v1 is not listed on IDEAS
    6. Roman Hornung, 2020. "Ordinal Forests," Journal of Classification, Springer;The Classification Society, vol. 37(1), pages 4-17, April.
    7. Adolfo Morrone & Alfonso Piscitelli & Antonio D’Ambrosio, 2019. "How Disadvantages Shape Life Satisfaction: An Alternative Methodological Approach," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 141(1), pages 477-502, January.
    8. Michael Lechner & Gabriel Okasa, 2025. "Random Forest estimation of the ordered choice model," Empirical Economics, Springer, vol. 68(1), pages 1-106, January.
    9. repec:osf:osfxxx:ny6we_v1 is not listed on IDEAS
    10. Angela Maria D’Uggento & Alfonso Piscitelli & Nunziata Ribecco & Germana Scepi, 2023. "Perceived climate change risk and global green activism among young people," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(4), pages 1167-1195, October.
    11. Ejike R. Ugba & Daniel Mörlein & Jan Gertheiss, 2021. "Smoothing in Ordinal Regression: An Application to Sensory Data," Stats, MDPI, vol. 4(3), pages 1-18, July.
    12. Buczak, Philip, 2024. "fabOF: A Novel Tree Ensemble Method for Ordinal Prediction," OSF Preprints h8t4p, Center for Open Science.
    13. Buczak, Philip, 2024. "Mixed-Effects Frequency-Adjusted Borders Ordinal Forest: A Tree Ensemble Method for Ordinal Prediction with Hierarchical Data," OSF Preprints ny6we, Center for Open Science.
    14. Riccardo Di Francesco, 2023. "Ordered Correlation Forest," Papers 2309.08755, arXiv.org.
    15. Gabriel Okasa, 2022. "Meta-Learners for Estimation of Causal Effects: Finite Sample Cross-Fit Performance," Papers 2201.12692, arXiv.org.
    16. D'Alberto, R. & Targetti, S. & Schaller, L. & Bartolini, F. & Eichhorn, T. & Haltia, E. & Harmanny, K. & Le Gloux, F. & Nikolov, D. & Runge, T. & Vergamini, D. & Viaggi, D., 2024. "A European perspective on acceptability of innovative agri-environment-climate contract solutions," Land Use Policy, Elsevier, vol. 141(C).
    17. Backer, David & Billing, Trey, 2024. "Forecasting the prevalence of child acute malnutrition using environmental and conflict conditions as leading indicators," World Development, Elsevier, vol. 176(C).
    18. Luis A Barboza & Shu-Wei Chou-Chen & Paola Vásquez & Yury E García & Juan G Calvo & Hugo G Hidalgo & Fabio Sanchez, 2023. "Assessing dengue fever risk in Costa Rica by using climate variables and machine learning techniques," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 17(1), pages 1-13, January.
    19. William Magee, 2023. "Earnings, Intersectional Earnings Inequality, Disappointment in One’s Life Achievements and Life (Dis)satisfaction," Journal of Happiness Studies, Springer, vol. 24(1), pages 373-396, January.
    20. Mariana Oliveira & Luís Torgo & Vítor Santos Costa, 2021. "Evaluation Procedures for Forecasting with Spatiotemporal Data," Mathematics, MDPI, vol. 9(6), pages 1-27, March.
    21. Hanna Dudek & Joanna Landmesser, 2012. "Income satisfaction and relative deprivation," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 13(2), pages 321-334, June.
    22. Bokelmann, Björn & Lessmann, Stefan, 2024. "Improving uplift model evaluation on randomized controlled trial data," European Journal of Operational Research, Elsevier, vol. 313(2), pages 691-707.
    23. Joel Podgorski & Oliver Kracht & Luis Araguas-Araguas & Stefan Terzer-Wassmuth & Jodie Miller & Ralf Straub & Rolf Kipfer & Michael Berg, 2024. "Groundwater vulnerability to pollution in Africa’s Sahel region," Nature Sustainability, Nature, vol. 7(5), pages 558-567, May.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jclass:v:42:y:2025:i:2:d:10.1007_s00357-024-09497-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.