IDEAS home Printed from https://ideas.repec.org/a/bpj/jqsprt/v8y2012i4n4.html

An Exploratory Study of Minor League Baseball Statistics

Author

Listed:
  • Chandler Gabriel

    (Pomona College)

  • Stevens Guy

    (Pomona College '13)

Abstract

We consider the problem of projecting future success of Minor League baseball players at each level of the farm system. Using tree based methods, in particular random forests, we consider which statistics are most correlated with Major League success, how Major League teams use these statistics differently in handling prospects, and how prior belief in a players ability, measured through draft position, is used throughout a players Minor League career. We show that roughly the 18th round prospect corresponds to being draft neutral for a team, whereas teams essentially make decisions based strictly on performance. We use for our data all position players drafted between 1999 and 2002.

Suggested Citation

  • Chandler Gabriel & Stevens Guy, 2012. "An Exploratory Study of Minor League Baseball Statistics," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 8(4), pages 1-28, November.
  • Handle: RePEc:bpj:jqsprt:v:8:y:2012:i:4:n:4
    DOI: 10.1515/1559-0410.1445
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/1559-0410.1445
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/1559-0410.1445?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Neil Longley & Glenn Wong, 2011. "The speed of human capital formation in the baseball industry: the information value of minor‐league performance in predicting major‐league performance," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 32(3), pages 193-204, April.
    2. Archer, Kellie J. & Kimes, Ryan V., 2008. "Empirical characterization of random forest variable importance measures," Computational Statistics & Data Analysis, Elsevier, vol. 52(4), pages 2249-2260, January.
    3. Jason A. Winfree & Christopher J. Molitor, 2007. "The Value of College," Journal of Sports Economics, , vol. 8(4), pages 378-393, August.
    4. John D. Burger & Stephen J. K. Walters, 2009. "Uncertain Prospects," Journal of Sports Economics, , vol. 10(5), pages 485-501, October.
    5. Stephen J. Spurr, 2000. "The Baseball Draft," Journal of Sports Economics, , vol. 1(1), pages 66-85, February.
    6. Stephen J. Spurr & William Barber, 1994. "The Effect of Performance on a Worker's Career: Evidence from Minor League Baseball," ILR Review, Cornell University, ILR School, vol. 47(4), pages 692-708, July.
    7. Mills Brian M. & Salaga Steven, 2011. "Using Tree Ensembles to Analyze National Baseball Hall of Fame Voting Patterns: An Application to Discrimination in BBWAA Voting," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(4), pages 1-32, October.
    8. Dennis Coates & Babatunde Oguntimein, 2010. "The Length and Success of NBA Careers: Does College Production Predict Professional Outcomes?," International Journal of Sport Finance, Fitness Information Technology, vol. 5(1), pages 4-26, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lock Dennis & Nettleton Dan, 2014. "Using random forests to estimate win probability before each play of an NFL game," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 10(2), pages 197-205, June.
    2. Gerber Eric A. E. & Craig Bruce A., 2021. "A mixed effects multinomial logistic-normal model for forecasting baseball performance," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(3), pages 221-239, September.
    3. N. David Pifer & Christopher M. McLeod & William J. Travis & Colten R. Castleberry, 2020. "Who Should Sign a Professional Baseball Contract? Quantifying the Financial Opportunity Costs of Major League Draftees," Journal of Sports Economics, , vol. 21(7), pages 746-780, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. N. David Pifer & Christopher M. McLeod & William J. Travis & Colten R. Castleberry, 2020. "Who Should Sign a Professional Baseball Contract? Quantifying the Financial Opportunity Costs of Major League Draftees," Journal of Sports Economics, , vol. 21(7), pages 746-780, October.
    2. Justin Sims & Vittorio Addona, 2016. "Hurdle Models and Age Effects in the Major League Baseball Draft," Journal of Sports Economics, , vol. 17(7), pages 672-687, October.
    3. Seth R. Gitter & Thomas A. Rhoads, 2011. "Top Prospects and Minor League Baseball Attendance," Journal of Sports Economics, , vol. 12(3), pages 341-351, June.
    4. Michael A. Roach, 2022. "Career concerns and personnel investment in the Major League Baseball player draft," Economic Inquiry, Western Economic Association International, vol. 60(1), pages 413-426, January.
    5. Heather Mitchell & Constantino Stavros & Mark F. Stewart, 2011. "Does the Australian Football League Draft Undervalue Indigenous Australian Footballers?," Journal of Sports Economics, , vol. 12(1), pages 36-54, February.
    6. Richard J. Paulsen, 2022. "Peer effects and human capital accumulation: Time spent in college and productivity in the National Basketball Association," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 43(8), pages 3611-3619, December.
    7. Christopher Garmon, 2013. "Major League Baseball’s First Year Player Draft," Journal of Sports Economics, , vol. 14(5), pages 451-478, October.
    8. Lamperti, Francesco & Roventini, Andrea & Sani, Amir, 2018. "Agent-based model calibration using machine learning surrogates," Journal of Economic Dynamics and Control, Elsevier, vol. 90(C), pages 366-389.
    9. Mohamed Zine & Fouzi Harrou & Mohammed Terbeche & Mohammed Bellahcene & Abdelkader Dairi & Ying Sun, 2023. "E-Learning Readiness Assessment Using Machine Learning Methods," Sustainability, MDPI, vol. 15(11), pages 1-22, June.
    10. Fred A. Bellemore, 2001. "Racial and Ethnic Employment Discrimination," Journal of Sports Economics, , vol. 2(4), pages 356-368, November.
    11. Yigit Aydede & Jan Ditzen, 2022. "Identifying the regional drivers of influenza-like illness in Nova Scotia with dominance analysis," Papers 2212.06684, arXiv.org.
    12. Ying, Jinhuika & Su, Huasheng & He, Shouchao & Qiu, Guchuan & Chen, Xuezhen, 2025. "Belief dispersion in credit markets: Evidence from CDS-Bond basis," Finance Research Letters, Elsevier, vol. 86(PA).
    13. De Bock, Koen W. & Coussement, Kristof & Van den Poel, Dirk, 2010. "Ensemble classification based on generalized additive models," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1535-1546, June.
    14. Ollech, Daniel & Webel, Karsten, 2020. "A random forest-based approach to identifying the most informative seasonality tests," Discussion Papers 55/2020, Deutsche Bundesbank.
    15. Ilias Thomas & Alex M. Dickens & Jussi P. Posti & Endre Czeiter & Daniel Duberg & Tim Sinioja & Matilda Kråkström & Isabel R. A. Retel Helmrich & Kevin K. W. Wang & Andrew I. R. Maas & Ewout W. Steyer, 2022. "Serum metabolome associated with severity of acute traumatic brain injury," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    16. Lu, Xuefei & Baraldi, Piero & Zio, Enrico, 2020. "A data-driven framework for identifying important components in complex systems," Reliability Engineering and System Safety, Elsevier, vol. 204(C).
    17. Göke Stefan & Prinz Joachim & Weimar Daniel, 2014. "Diamonds are Forever: Job-Matching and Career Success of Young Workers," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 234(4), pages 450-473, August.
    18. Mahyar Jahaninasab & Ehsan Taheran & S. Alireza Zarabadi & Mohammadreza Aghaei & Ali Rajabpour, 2023. "A Novel Approach for Reducing Feature Space Dimensionality and Developing a Universal Machine Learning Model for Coated Tubes in Cross-Flow Heat Exchangers," Energies, MDPI, vol. 16(13), pages 1-13, July.
    19. Jianghong Xu & Wei Lu & Weixin Wang, 2024. "From “fragile smallholders” to “resilient smallholders”: measuring rural household resilience in China," Humanities and Social Sciences Communications, Palgrave Macmillan, vol. 11(1), pages 1-14, December.
    20. Mirza, Fuat Kaan & Baykaş, Tunçer & Hekimoğlu, Mustafa & Pekcan, Önder & Tunçay, Gönül Paçacı, 2025. "Decoding rhythmic complexity: A nonlinear dynamics approach via visibility graphs for classifying asymmetrical rhythmic frameworks of Turkish classical music," Applied Mathematics and Computation, Elsevier, vol. 507(C).

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:jqsprt:v:8:y:2012:i:4:n:4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyterbrill.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.