IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v335y2024i1d10.1007_s10479-023-05725-4.html
   My bibliography  Save this article

A data-driven optimization approach to baseball roster management

Author

Listed:
  • Sean Barnes

    (Netflix)

  • Margrét Bjarnadóttir

    (University of Maryland College Park)

  • Daniel Smolyak

    (University of Maryland College Park)

  • Aurélie Thiele

    (Southern Methodist University)

Abstract

Each year, major league baseball (MLB) teams face complex decisions about which players to retain and which players to recruit. In addition to operational, team and budget constraints, these decisions are further complicated by the fact that an athlete’s future performance and its impact on the team are both uncertain. In this paper, we combine prediction modeling with decision optimization to study the MLB free agent market. We develop optimization models for the allocation of a team’s recruitment budget using six different metrics that evaluate a player’s contributions to a team’s success. We consider both an ideal case, where each team can choose among all free agents, and a sequential case, where we assume that teams with stronger appeal (big market) are more successful in attracting talent, while teams with less pull must optimize their rosters over a much smaller pool of remaining players. Using the best-performing metric, which takes into account both players’ positions and their positional flexibility, we develop a series of quantitative tools that help teams, especially those with small budgets, identify (1) the players who deliver a key competitive advantage to their teams, appearing in both their ideal and sequential rosters and (2) the players who are in many ideal rosters and thus are likely to be hired by teams with big budgets, perhaps at a substantial salary premium. In order to gain and maintain an edge in the fiercely competitive free agent market, teams need to continuously adapt their strategies, and our models represent a first step towards prescriptive (not just predictive) analytics designed to help them do so. Further, our analysis indicates that a few players are in high demand from many teams (for instance, in every year of the period considered, the ten most in-demand players appear in the ideal rosters of at least seven teams), while most players appear in one ideal roster or none at all. Our models go beyond players’ individual performance metrics to help teams understand which players will be in high demand due to teams’ position needs in a given year. The results further emphasize the increasing importance of contract extensions as a strategy to bypass the free agent market.

Suggested Citation

  • Sean Barnes & Margrét Bjarnadóttir & Daniel Smolyak & Aurélie Thiele, 2024. "A data-driven optimization approach to baseball roster management," Annals of Operations Research, Springer, vol. 335(1), pages 33-58, April.
  • Handle: RePEc:spr:annopr:v:335:y:2024:i:1:d:10.1007_s10479-023-05725-4
    DOI: 10.1007/s10479-023-05725-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-023-05725-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-023-05725-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Albert James, 2006. "Pitching Statistics, Talent and Luck, and the Best Strikeout Seasons of All-Time," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 2(1), pages 1-32, January.
    2. Alexander Gross & Charles Link, 2017. "Does Option Theory Hold For Major League Baseball Contracts?," Economic Inquiry, Western Economic Association International, vol. 55(1), pages 425-433, January.
    3. Elitzur, Ramy, 2020. "Data analytics effects in major league baseball," Omega, Elsevier, vol. 90(C).
    4. Scully, Gerald W, 1974. "Pay and Performance in Major League Baseball," American Economic Review, American Economic Association, vol. 64(6), pages 915-930, December.
    5. Timothy C. Y. Chan & Douglas Fearing, 2019. "Process Flexibility in Baseball: The Value of Positional Flexibility," Management Science, INFORMS, vol. 65(4), pages 1642-1666, April.
    6. Jerry W. Kim & Brayden G King, 2014. "Seeing Stars: Matthew Effects and Status Bias in Major League Baseball Umpiring," Management Science, INFORMS, vol. 60(11), pages 2619-2644, November.
    7. Koop G., 2002. "Comparing the Performance of Baseball Players: A Multiple-Output Approach," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 710-720, September.
    8. Duane W. Rockerbie, 2009. "Strategic Free Agency in Baseball," Journal of Sports Economics, , vol. 10(3), pages 278-291, June.
    9. Breusch, T S & Pagan, A R, 1979. "A Simple Test for Heteroscedasticity and Random Coefficient Variation," Econometrica, Econometric Society, vol. 47(5), pages 1287-1294, September.
    10. Frederick Wiseman & Sangit Chatterjee, 2003. "Team payroll and team performance in major league baseball: 1985-2002," Economics Bulletin, AccessEcon, vol. 1(2), pages 1-10.
    11. Roger D. Blair & Brad R. Humphreys & Hyunwoong Pyun, 2017. "Monopsony Exploitation in Professional Sport: Evidence from Major League Baseball Position Players, 2000–2011," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 38(5), pages 676-688, July.
    12. Jahn K. Hakes & Raymond D. Sauer, 2006. "An Economic Evaluation of the Moneyball Hypothesis," Journal of Economic Perspectives, American Economic Association, vol. 20(3), pages 173-186, Summer.
    13. Doug J. Chung, 2017. "How Much Is a Win Worth? An Application to Intercollegiate Athletics," Management Science, INFORMS, vol. 63(2), pages 548-565, February.
    14. Lawrence M. Kahn, 1993. "Managerial Quality, Team Success, and Individual Player Performance in Major League Baseball," ILR Review, Cornell University, ILR School, vol. 46(3), pages 531-547, April.
    15. Clément Lesaege & Michael Poss, 2016. "The Partial Choice Recoverable Knapsack Problem," Lecture Notes in Economics and Mathematical Systems, in: Raquel J. Fonseca & Gerhard-Wilhelm Weber & João Telhada (ed.), Computational Management Science, edition 1, pages 189-194, Springer.
    16. repec:ebl:ecbull:v:1:y:2003:i:2:p:1-10 is not listed on IDEAS
    17. MacKinnon, James G. & White, Halbert, 1985. "Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties," Journal of Econometrics, Elsevier, vol. 29(3), pages 305-325, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rodney Fort & Young Hoon Lee & Taeyeon Oh, 2019. "Quantile Insights on Market Structure and Worker Salaries: The Case of Major League Baseball," Journal of Sports Economics, , vol. 20(8), pages 1066-1087, December.
    2. Elitzur, Ramy, 2020. "Data analytics effects in major league baseball," Omega, Elsevier, vol. 90(C).
    3. Brian M. Mills, 2017. "Policy Changes In Major League Baseball: Improved Agent Behavior And Ancillary Productivity Outcomes," Economic Inquiry, Western Economic Association International, vol. 55(2), pages 1104-1118, April.
    4. Fort, Rodney & Maxcy, Joel & Diehl, Mark, 2016. "Uncertainty by regulation: Rottenberg׳s invariance principle," Research in Economics, Elsevier, vol. 70(3), pages 454-467.
    5. Joshua M. Congdon-Hohman & Jonathan A. Lanning, 2018. "Beyond Moneyball," Journal of Sports Economics, , vol. 19(7), pages 1046-1061, October.
    6. Carlo Bellavite Pellegrini & Raul Caruso & Marco Di Domizio, 2021. "Relative wages, payroll structure and performance in soccer. Evidence from Italian Serie A (2007-2019)," DISCE - Working Papers del Dipartimento di Politica Economica dipe0015, Università Cattolica del Sacro Cuore, Dipartimenti e Istituti di Scienze Economiche (DISCE).
    7. LE GALLO, Julie, 2000. "Econométrie spatiale 2 -Hétérogénéité spatiale," LATEC - Document de travail - Economie (1991-2003) 2001-01, LATEC, Laboratoire d'Analyse et des Techniques EConomiques, CNRS UMR 5118, Université de Bourgogne.
    8. Wen-Jhan Jane, 2013. "Overpayment and Reservation Salary in the Nippon Professional Baseball League," Journal of Sports Economics, , vol. 14(6), pages 563-583, December.
    9. Stefan Szymanski, 2010. "The Economic Design of Sporting Contests," Palgrave Macmillan Books, in: The Comparative Economics of Sport, chapter 1, pages 1-78, Palgrave Macmillan.
    10. Kubal, Jan & Kristoufek, Ladislav, 2022. "Exploring the relationship between Bitcoin price and network’s hashrate within endogenous system," International Review of Financial Analysis, Elsevier, vol. 84(C).
    11. Dufour, Jean-Marie & Khalaf, Lynda & Bernard, Jean-Thomas & Genest, Ian, 2004. "Simulation-based finite-sample tests for heteroskedasticity and ARCH effects," Journal of Econometrics, Elsevier, vol. 122(2), pages 317-347, October.
    12. repec:ver:wpaper:12/2012 is not listed on IDEAS
    13. Romano, Joseph P. & Wolf, Michael, 2017. "Resurrecting weighted least squares," Journal of Econometrics, Elsevier, vol. 197(1), pages 1-19.
    14. Jahn Hakes & Chad Turner, 2011. "Pay, productivity and aging in Major League Baseball," Journal of Productivity Analysis, Springer, vol. 35(1), pages 61-74, February.
    15. Robert Breunig & Bronwyn Garrett-Rumba & Mathieu Jardin & Yvon Rocaboy, 2014. "Wage dispersion and team performance: a theoretical model and evidence from baseball," Applied Economics, Taylor & Francis Journals, vol. 46(3), pages 271-281, January.
    16. Kenneth W. Clements & H. Y. Izan & Yihui Lan, 2009. "A Stochastic Measure of International Competitiveness," International Review of Finance, International Review of Finance Ltd., vol. 9(1‐2), pages 51-81, March.
    17. Chau, K.W. & Davies, Stephen N.G. & Lai, Lawrence W.C. & Lennon, H.T. Choy, 2023. "Museums for ex situ tangible heritage conservation: A neo-institutional analytical and empirical economic analysis," Land Use Policy, Elsevier, vol. 127(C).
    18. Marco Di Domizio & Carlo Bellavite Pellegrini & Raul Caruso, 2022. "Payroll dispersion and performance in soccer: A seasonal perspective analysis for Italian Serie A (2007–2021)," Contemporary Economic Policy, Western Economic Association International, vol. 40(3), pages 513-525, July.
    19. Adam Hoffer & Jared A. Pincin, 2019. "Quantifying NFL Players’ Value With the Help of Vegas Point Spreads Values," Journal of Sports Economics, , vol. 20(7), pages 959-974, October.
    20. Julie Le Gallo, 2000. "Spatial econometrics (2, Spatial heterogeneity) [Econométrie spatiale (2, Hétérogénéité spatiale)]," Working Papers hal-01526969, HAL.
    21. Perone, G.;, 2024. "Prioritizing investments in public healthcare to address the COVID-19 outbreak: Evidence from Europe and the South Caucasus," Health, Econometrics and Data Group (HEDG) Working Papers 24/05, HEDG, c/o Department of Economics, University of York.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:335:y:2024:i:1:d:10.1007_s10479-023-05725-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.