IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v88y2023i4d10.1007_s11336-022-09882-6.html
   My bibliography  Save this article

The Bradley–Terry Regression Trunk approach for Modeling Preference Data with Small Trees

Author

Listed:
  • Alessio Baldassarre

    (University of Cagliari)

  • Elise Dusseldorp

    (Leiden University)

  • Antonio D’Ambrosio

    (University of Naples Federico II)

  • Mark de Rooij

    (Leiden University)

  • Claudio Conversano

    (University of Cagliari)

Abstract

This paper introduces the Bradley–Terry regression trunk model, a novel probabilistic approach for the analysis of preference data expressed through paired comparison rankings. In some cases, it may be reasonable to assume that the preferences expressed by individuals depend on their characteristics. Within the framework of tree-based partitioning, we specify a tree-based model estimating the joint effects of subject-specific covariates over and above their main effects. We, therefore, combine a tree-based model and the log-linear Bradley-Terry model using the outcome of the comparisons as response variable. The proposed model provides a solution to discover interaction effects when no a-priori hypotheses are available. It produces a small tree, called trunk, that represents a fair compromise between a simple interpretation of the interaction effects and an easy to read partition of judges based on their characteristics and the preferences they have expressed. We present an application on a real dataset following two different approaches, and a simulation study to test the model’s performance. Simulations showed that the quality of the model performance increases when the number of rankings and objects increases. In addition, the performance is considerably amplified when the judges’ characteristics have a high impact on their choices.

Suggested Citation

  • Alessio Baldassarre & Elise Dusseldorp & Antonio D’Ambrosio & Mark de Rooij & Claudio Conversano, 2023. "The Bradley–Terry Regression Trunk approach for Modeling Preference Data with Small Trees," Psychometrika, Springer;The Psychometric Society, vol. 88(4), pages 1443-1465, December.
  • Handle: RePEc:spr:psycho:v:88:y:2023:i:4:d:10.1007_s11336-022-09882-6
    DOI: 10.1007/s11336-022-09882-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11336-022-09882-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11336-022-09882-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Anders Skrondal & Sophia Rabe-Hesketh, 2003. "Multilevel logistic regression for polytomous data and rankings," Psychometrika, Springer;The Psychometric Society, vol. 68(2), pages 267-287, June.
    2. Claudio Conversano & Elise Dusseldorp, 2017. "Modeling Threshold Interaction Effects Through the Logistic Classification Trunk," Journal of Classification, Springer;The Classification Society, vol. 34(3), pages 399-426, October.
    3. Achim Zeileis & Kurt Hornik, 2007. "Generalized M‐fluctuation tests for parameter instability," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 61(4), pages 488-508, November.
    4. Brian Francis & Regina Dittrich & Reinhold Hatzinger & Roger Penn, 2002. "Analysing partial ranks by using smoothed paired comparison methods: an investigation of value orientation in Europe," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 51(3), pages 319-336, July.
    5. Turner, Heather & Firth, David, 2012. "Bradley-Terry Models in R: The BradleyTerry2 Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i09).
    6. Vicente Rodríguez Montequín & Joaquín Manuel Villanueva Balsera & Marina Díaz Piloñeta & César Álvarez Pérez, 2020. "A Bradley-Terry Model-Based Approach to Prioritize the Balance Scorecard Driving Factors: The Case Study of a Financial Software Factory," Mathematics, MDPI, vol. 8(2), pages 1-15, February.
    7. Carolin Strobl & Florian Wickelmaier & Achim Zeileis, 2011. "Accounting for Individual Differences in Bradley-Terry Models by Means of Recursive Partitioning," Journal of Educational and Behavioral Statistics, , vol. 36(2), pages 135-153, April.
    8. Antonio D’Ambrosio & Willem J. Heiser, 2016. "A Recursive Partitioning Method for the Prediction of Preference Rankings Based Upon Kemeny Distances," Psychometrika, Springer;The Psychometric Society, vol. 81(3), pages 774-794, September.
    9. Hatzinger, Reinhold & Dittrich, Regina, 2012. "prefmod: An R Package for Modeling Preferences Based on Paired Comparisons, Rankings, or Ratings," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i10).
    10. Elise Dusseldorp & Jacqueline Meulman, 2004. "The regression trunk approach to discover treatment covariate interaction," Psychometrika, Springer;The Psychometric Society, vol. 69(3), pages 355-374, September.
    11. Frank Busing & Patrick Groenen & Willem Heiser, 2005. "Avoiding degeneracy in multidimensional unfolding by penalizing on the coefficient of variation," Psychometrika, Springer;The Psychometric Society, vol. 70(1), pages 71-98, March.
    12. Amodio, S. & D’Ambrosio, A. & Siciliano, R., 2016. "Accurate algorithms for identifying the median ranking when dealing with weak and partial rankings under the Kemeny axiomatic approach," European Journal of Operational Research, Elsevier, vol. 249(2), pages 667-676.
    13. Dittrich, Regina & Francis, Brian & Hatzinger, Reinhold & Katzenbeisser, Walter, 2006. "Modelling dependency in multivariate paired comparisons: A log-linear approach," Mathematical Social Sciences, Elsevier, vol. 52(2), pages 197-209, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Antonio D’Ambrosio & Willem J. Heiser, 2016. "A Recursive Partitioning Method for the Prediction of Preference Rankings Based Upon Kemeny Distances," Psychometrika, Springer;The Psychometric Society, vol. 81(3), pages 774-794, September.
    2. Yu-Shan Shih & Kuang-Hsun Liu, 2019. "Regression trees for detecting preference patterns from rank data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(3), pages 683-702, September.
    3. Antonio D’Ambrosio & Carmela Iorio & Michele Staiano & Roberta Siciliano, 2019. "Median constrained bucket order rank aggregation," Computational Statistics, Springer, vol. 34(2), pages 787-802, June.
    4. Weichen Wu & Nynke Niezink & Brian Junker, 2022. "A diagnostic framework for the Bradley–Terry model," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(S2), pages 461-484, December.
    5. Antonella Plaia & Simona Buscemi & Johannes Fürnkranz & Eneldo Loza Mencía, 2022. "Comparing Boosting and Bagging for Decision Trees of Rankings," Journal of Classification, Springer;The Classification Society, vol. 39(1), pages 78-99, March.
    6. Anna Gottard & Giorgio Calzolari, 2014. "Alternative estimating procedures for multiple membership logit models with mixed effects: indirect inference and data cloning," Econometrics Working Papers Archive 2014_07, Universita' degli Studi di Firenze, Dipartimento di Statistica, Informatica, Applicazioni "G. Parenti".
    7. Yoo, Yeawon & Escobedo, Adolfo R. & Skolfield, J. Kyle, 2020. "A new correlation coefficient for comparing and aggregating non-strict and incomplete rankings," European Journal of Operational Research, Elsevier, vol. 285(3), pages 1025-1041.
    8. Antonella Plaia & Simona Buscemi & Mariangela Sciandra, 2021. "Consensus among preference rankings: a new weighted correlation coefficient for linear and weak orderings," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 15(4), pages 1015-1037, December.
    9. Cascón, J.M. & González-Arteaga, T. & de Andrés Calle, R., 2022. "A new preference classification approach: The λ-dissensus cluster algorithm," Omega, Elsevier, vol. 111(C).
    10. Wickelmaier, Florian & Strobl, Carolin & Zeileis, Achim, 2012. "Psychoco: Psychometric Computing in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i01).
    11. Jones, Payton J. & Mair, Patrick & Simon, Thorsten & Zeileis, Achim, 2019. "Network Model Trees," OSF Preprints ha4cw, Center for Open Science.
    12. Giulia Vannucci & Anna Gottard, 2023. "An evolutionary estimation procedure for generalized semilinear regression trees," Computational Statistics, Springer, vol. 38(4), pages 1927-1946, December.
    13. Marta Nai Ruscone & Daniel Fernández & Antonio D’Ambrosio, 2024. "Copula-Based Non-Metric Unfolding on Augmented Data Matrix," Journal of Classification, Springer;The Classification Society, vol. 41(3), pages 678-697, November.
    14. Carolin Strobl & Florian Wickelmaier & Achim Zeileis, 2011. "Accounting for Individual Differences in Bradley-Terry Models by Means of Recursive Partitioning," Journal of Educational and Behavioral Statistics, , vol. 36(2), pages 135-153, April.
    15. Gunther Schauberger & Andreas Groll & Gerhard Tutz, 2018. "Analysis of the importance of on-field covariates in the German Bundesliga," Journal of Applied Statistics, Taylor & Francis Journals, vol. 45(9), pages 1561-1578, July.
    16. Alwyn Lim & Shawn Pope, 2022. "What drives companies to do good? A “universal” ordering of corporate social responsibility motivations," Corporate Social Responsibility and Environmental Management, John Wiley & Sons, vol. 29(1), pages 233-255, January.
    17. Daniel Wochner, 2020. "Dynamic Factor Trees and Forests – A Theory-led Machine Learning Framework for Non-Linear and State-Dependent Short-Term U.S. GDP Growth Predictions," KOF Working papers 20-472, KOF Swiss Economic Institute, ETH Zurich.
    18. Rowland G. Seymour & David Sirl & Simon P. Preston & Ian L. Dryden & Madeleine J. A. Ellis & Bertrand Perrat & James Goulding, 2022. "The Bayesian Spatial Bradley–Terry model: Urban deprivation modelling in Tanzania," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(2), pages 288-308, March.
    19. Martin Kroh, 2008. "The Preadult Origins of Post-Materialism: A Longitudinal Sibling Study," Discussion Papers of DIW Berlin 797, DIW Berlin, German Institute for Economic Research.
    20. Martin Kroh, 2008. "The Preadult Origins of Post-Materialism: A Longitudinal Sibling Study," SOEPpapers on Multidisciplinary Panel Data Research 101, DIW Berlin, The German Socio-Economic Panel (SOEP).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:88:y:2023:i:4:d:10.1007_s11336-022-09882-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.