IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v96y2016icp57-73.html

Random forest for ordinal responses: Prediction and variable selection

Author

Listed:
  • Janitza, Silke
  • Tutz, Gerhard
  • Boulesteix, Anne-Laure

Abstract

The random forest method is a commonly used tool for classification with high-dimensional data that is able to rank candidate predictors through its inbuilt variable importance measures. It can be applied to various kinds of regression problems including nominal, metric and survival response variables. While classification and regression problems using random forest methodology have been extensively investigated in the past, in the case of ordinal response there is no standard procedure. Extensive studies using random forest based on conditional inference trees are conducted to explore whether incorporating the ordering information yields any improvement in both prediction performance or variable selection. Two novel permutation variable importance measures are presented that are reasonable alternatives to the currently implemented importance measure which was developed for nominal response and makes no use of the ordering in the levels of an ordinal response variable. Results based on simulated and real data suggest that predictor rankings can be improved in some settings by using new permutation importance measures that explicitly use the ordering in the response levels in combination with ordinal regression trees. With respect to prediction accuracy, the performance of ordinal regression trees was similar to and in most settings even slightly better than that of classification trees.

Suggested Citation

  • Janitza, Silke & Tutz, Gerhard & Boulesteix, Anne-Laure, 2016. "Random forest for ordinal responses: Prediction and variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 96(C), pages 57-73.
  • Handle: RePEc:eee:csdana:v:96:y:2016:i:c:p:57-73
    DOI: 10.1016/j.csda.2015.10.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947315002601
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2015.10.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Raffaella Piccarreta, 2001. "A new measure of nominal-ordinal association," Journal of Applied Statistics, Taylor & Francis Journals, vol. 28(1), pages 107-120.
    2. Hothorn, Torsten & Hornik, Kurt & van de Wiel, Mark A. & Zeileis, Achim, 2006. "A Lego System for Conditional Inference," The American Statistician, American Statistical Association, vol. 60, pages 257-263, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tomáš Želinský, 2015. "Nekonzistentnosť časových preferencií ľudí z arginalizovaných rómskych komunít [On inconsistency of time preferences of people from the marginalised roma communities]," Politická ekonomie, Prague University of Economics and Business, vol. 2015(2), pages 204-222.
    2. repec:osf:osfxxx:ha4cw_v1 is not listed on IDEAS
    3. repec:jss:jstsof:36:i02 is not listed on IDEAS
    4. repec:plo:pone00:0186285 is not listed on IDEAS
    5. Georgina Milne & Andrew William Byrne & Emma Campbell & Jordon Graham & John McGrath & Raymond Kirke & Wilma McMaster & Jesko Zimmermann & Adewale Henry Adenuga, 2022. "Quantifying Land Fragmentation in Northern Irish Cattle Enterprises," Land, MDPI, vol. 11(3), pages 1-16, March.
    6. Payton J. Jones & Patrick Mair & Thorsten Simon & Achim Zeileis, 2020. "Network Trees: A Method for Recursively Partitioning Covariance Structures," Psychometrika, Springer;The Psychometric Society, vol. 85(4), pages 926-945, December.
    7. Seibold Heidi & Zeileis Achim & Hothorn Torsten, 2016. "Model-Based Recursive Partitioning for Subgroup Analyses," The International Journal of Biostatistics, De Gruyter, vol. 12(1), pages 45-63, May.
    8. M. Perakis & P. Maravelakis & S. Psarakis & E. Xekalaki & J. Panaretos, 2005. "On Certain Indices for Ordinal Data with Unequally Weighted Classes," Quality & Quantity: International Journal of Methodology, Springer, vol. 39(5), pages 515-536, October.
    9. McGinlay, James & Parsons, David J. & Morris, Joe & Hubatova, Marie & Graves, Anil & Bradbury, Richard B. & Bullock, James M., 2017. "Do charismatic species groups generate more cultural ecosystem service benefits?," Ecosystem Services, Elsevier, vol. 27(PA), pages 15-24.
    10. Marc Ditzhaus & Arnold Janssen, 2020. "Bootstrap and permutation rank tests for proportional hazards under right censoring," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(3), pages 493-517, July.
    11. Elsäßer Amelie & Victor Anja & Hommel Gerhard, 2011. "Multiple Testing in Candidate Gene Situations: A Comparison of Classical, Discrete, and Resampling-Based Procedures," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-21, November.
    12. Jeff Allen & Santiago Carbo-Valverde & Sujit Chakravorti & Francisco Rodriguez-Fernandez & Oya Pinar Ardic, 2022. "Assessing incentives to increase digital payment acceptance and usage: A machine learning approach," PLOS ONE, Public Library of Science, vol. 17(11), pages 1-29, November.
    13. Julia Hehner & Laura Schneider & Anna Woitalla & Benjamin Ott & Kim Chi Thi Vu & Anja Schöbel & Torsten Hain & Dominik Schwudke & Eva Herker, 2024. "Glycerophospholipid remodeling is critical for orthoflavivirus infection," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    14. repec:jss:jstsof:28:i08 is not listed on IDEAS
    15. Torsten Hothorn & Achim Zeileis, 2008. "Generalized Maximally Selected Statistics," Biometrics, The International Biometric Society, vol. 64(4), pages 1263-1269, December.
    16. Jorge M. Arevalillo & Hilario Navarro, 2021. "Skewness-Kurtosis Model-Based Projection Pursuit with Application to Summarizing Gene Expression Data," Mathematics, MDPI, vol. 9(9), pages 1-18, April.
    17. Jeff Allen & Santiago Carbo Valverde & Sujit Chakravorti & Francisco Rodriguez-Fernandez & Oya Pinar Ardic, 2022. "Assessing Incentives to Increase Digital Payment Acceptance and Usage," World Bank Publications - Books, The World Bank Group, number 37487, April.
    18. Henning Sommermeyer & Hanna Krauss & Zuzanna Chęcińska-Maciejewska & Marcin Pszczola & Jacek Piątek, 2020. "Infantile Colic—The Perspective of German and Polish Pediatricians in 2020," IJERPH, MDPI, vol. 17(19), pages 1-11, September.
    19. Raffaella Piccarreta, 2008. "Classification trees for ordinal variables," Computational Statistics, Springer, vol. 23(3), pages 407-427, July.
    20. Claire E. Otero & Sophia Petkova & Martin Ebermann & Husam Taher & Nessy John & Katja Hoffmann & Angel Davalos & Matilda J. Moström & Roxanne M. Gilbride & Courtney R. Papen & Aaron Barber-Axthelm & E, 2025. "Rhesus Cytomegalovirus-encoded Fcγ-binding glycoproteins facilitate viral evasion from IgG-mediated humoral immunity," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    21. Ioannis Segos & Jens Eeckhoven & Simon Berger & Nikhil Mishra & Eric J. Lambie & Barbara Conradt, 2025. "Unequal segregation of mitochondria during asymmetric cell division contributes to cell fate divergence in sister cells in vivo," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    22. Luis Javier Madrigal-Roca, 2024. "Assessing the predictive value of morphological traits on primary lifestyle of birds through the extreme gradient boosting algorithm," PLOS ONE, Public Library of Science, vol. 19(1), pages 1-18, January.
    23. Paweł Świsłowski & Małgorzata Rajfur, 2025. "A bibliometric analysis and review of the use of LEGO topic in the scientific literature," Quality & Quantity: International Journal of Methodology, Springer, vol. 59(5), pages 4223-4242, October.
    24. Chavez, Alex K. & Bicchieri, Cristina, 2013. "Third-party sanctioning and compensation behavior: Findings from the ultimatum game," Journal of Economic Psychology, Elsevier, vol. 39(C), pages 268-277.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:96:y:2016:i:c:p:57-73. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.