IDEAS home Printed from https://ideas.repec.org/a/bpj/jqsprt/v14y2018i2p37-56n1.html
   My bibliography  Save this article

Estimating the effect of plate discipline using a causal inference framework: an application of the G-computation algorithm

Author

Listed:
  • Vock David Michael

    (Division of Biostatistics, University of Minnesota Twin Cities, 420 Delaware St SE, MMC 303, Minneapolis, MN 55455, USA)

  • Vock Laura Frances Boehm

    (Department of Mathematics, Computer Science, and Statistics, Gustavus Adolphus College, Saint Peter, MN, USA)

Abstract

Offensive performance in baseball depends on a number of correlated factors: the pitches the batter faces, the batter’s choice to swing, and the batter’s hitting ability. Recently a renewed focus on the effect of plate discipline on batter performance has emerged. Plate discipline has traditionally been summarized as the proportion of pitches inside and outside of the strike zone a player swings at; however, there have been few metrics proposed to assess the effect of plate discipline directly on batters’ outcomes. In this paper, we focus on estimating a batter’s performance if he were able to adopt a different plate discipline. Because we wish to assess the effect of a counterfactual plate discipline, we use a potential outcome framework and show how the G-computation algorithm can be used to isolate the effect of plate discipline separately from a batter’s hitting ability or the types of pitches the batter faces. As an example, we implement our approach using data collected with the PITCHf/x system over the 2012–2014 seasons to identify the improvement Starlin Castro would expect to see in offensive performance were he able to adopt Andrew McCutchen’s plate discipline. We estimate that had Castro adopted McCutchen’s discipline his batting average, on-base percentage, and slugging percentage would have increased 0.017 (se = 0.004), 0.040 (se = 0.006), and 0.028 (se = 0.008), respectively.

Suggested Citation

  • Vock David Michael & Vock Laura Frances Boehm, 2018. "Estimating the effect of plate discipline using a causal inference framework: an application of the G-computation algorithm," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 14(2), pages 37-56, June.
  • Handle: RePEc:bpj:jqsprt:v:14:y:2018:i:2:p:37-56:n:1
    DOI: 10.1515/jqas-2016-0029
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jqas-2016-0029
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/jqas-2016-0029?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Baumer Ben S, 2008. "Why On-Base Percentage is a Better Indicator of Future Performance than Batting Average: An Algebraic Proof," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 4(2), pages 1-13, April.
    2. Daniel Almirall & Thomas Ten Have & Susan A. Murphy, 2010. "Structural Nested Mean Models for Assessing Time-Varying Effect Moderation," Biometrics, The International Biometric Society, vol. 66(1), pages 131-139, March.
    3. Phillips David C, 2011. "You're Hurting My Game: Lineup Protection and Injuries in Major League Baseball," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(3), pages 1-31, July.
    4. Albert James, 2006. "Pitching Statistics, Talent and Luck, and the Best Strikeout Seasons of All-Time," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 2(1), pages 1-32, January.
    5. Cain Lauren E. & Robins James M. & Lanoy Emilie & Logan Roger & Costagliola Dominique & Hernán Miguel A., 2010. "When to Start Treatment? A Systematic Approach to the Comparison of Dynamic Regimes Using Observational Data," The International Journal of Biostatistics, De Gruyter, vol. 6(2), pages 1-26, April.
    6. Simon N. Wood, 2003. "Thin plate regression splines," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(1), pages 95-114, February.
    7. Schmotzer Brian J & Switchenko Jeff & Kilgo Patrick D, 2008. "Did Steroid Use Enhance the Performance of the Mitchell Batters? The Effect of Alleged Performance Enhancing Drug Use on Offensive Performance from 1995 to 2007," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 4(3), pages 1-17, July.
    8. Kvam Paul H, 2011. "Comparing Hall of Fame Baseball Players Using Most Valuable Player Ranks," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(3), pages 1-22, July.
    9. Acharya Rohit A & Ahmed Alexander J & D'Amour Alexander N & Lu Haibo & Morris Carl N & Oglevee Bradley D & Peterson Andrew W & Swift Robert N, 2008. "Improving Major League Baseball Park Factor Estimates," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 4(2), pages 1-18, April.
    10. Nieswiadomy Michael L. & Strazicich Mark C. & Clayton Stephen, 2012. "Was There a Structural Break in Barry Bonds's Bat?," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 8(3), pages 1-19, October.
    11. Baumer Benjamin S. & Jensen Shane T. & Matthews Gregory J., 2015. "openWAR: An open source system for evaluating overall player performance in major league baseball," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(2), pages 69-84, June.
    12. Simon N. Wood, 2004. "Stable and Efficient Multiple Smoothing Parameter Estimation for Generalized Additive Models," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 673-686, January.
    13. Kaplan David, 2006. "A Variance Decomposition of Individual Offensive Baseball Performance," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 2(3), pages 1-18, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jyh-How Huang & Yu-Chia Hsu, 2021. "A Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball," SAGE Open, , vol. 11(4), pages 21582440211, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Brian M. Mills, 2017. "Policy Changes In Major League Baseball: Improved Agent Behavior And Ancillary Productivity Outcomes," Economic Inquiry, Western Economic Association International, vol. 55(2), pages 1104-1118, April.
    2. Longhi, Christian & Musolesi, Antonio & Baumont, Catherine, 2014. "Modeling structural change in the European metropolitan areas during the process of economic integration," Economic Modelling, Elsevier, vol. 37(C), pages 395-407.
    3. Strasak, Alexander M. & Umlauf, Nikolaus & Pfeiffer, Ruth M. & Lang, Stefan, 2011. "Comparing penalized splines and fractional polynomials for flexible modelling of the effects of continuous predictor variables," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1540-1551, April.
    4. E. Zanini & E. Eastoe & M. J. Jones & D. Randell & P. Jonathan, 2020. "Flexible covariate representations for extremes," Environmetrics, John Wiley & Sons, Ltd., vol. 31(5), August.
    5. McShane Blakeley B. & Braunstein Alexander & Piette James & Jensen Shane T., 2011. "A Hierarchical Bayesian Variable Selection Approach to Major League Baseball Hitting Metrics," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 7(4), pages 1-26, October.
    6. Sun, Tianyu & Chand, Satish & Sharpe, Keiran, 2018. "Effect of aging on housing prices: evidence from a panel data," MPRA Paper 94418, University Library of Munich, Germany, revised 01 Mar 2019.
    7. Roman Matousek & Nickolaos G. Tzeremes, 2021. "The asymmetric impact of human capital on economic growth," Empirical Economics, Springer, vol. 60(3), pages 1309-1334, March.
    8. Philip T. Reiss & R. Todd Ogden, 2010. "Functional Generalized Linear Models with Images as Predictors," Biometrics, The International Biometric Society, vol. 66(1), pages 61-69, March.
    9. Fukuyama, Hirofumi & Matousek, Roman & Tzeremes, Nickolaos G., 2020. "A Nerlovian cost inefficiency two-stage DEA model for modeling banks’ production process: Evidence from the Turkish banking system," Omega, Elsevier, vol. 95(C).
    10. Øystein Sørensen & Anders M. Fjell & Kristine B. Walhovd, 2023. "Longitudinal Modeling of Age-Dependent Latent Traits with Generalized Additive Latent and Mixed Models," Psychometrika, Springer;The Psychometric Society, vol. 88(2), pages 456-486, June.
    11. Yurko Ronald & Ventura Samuel & Horowitz Maksim, 2019. "nflWAR: a reproducible method for offensive player evaluation in football," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 15(3), pages 163-183, September.
    12. Ferrara, Giancarlo & Vidoli, Francesco & Canello, Jacopo & Campagna, Arianna, 2013. "Labour-use Efficiency in the Italian Machinery Industry: a Non-parametric Stochastic Frontier Perspective," MPRA Paper 94359, University Library of Munich, Germany.
    13. Musolesi Antonio & Mazzanti Massimiliano, 2014. "Nonlinearity, heterogeneity and unobserved effects in the carbon dioxide emissions-economic development relation for advanced countries," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 18(5), pages 1-21, December.
    14. Lan Zhou & Huijun Pan, 2014. "Smoothing noisy data for irregular regions using penalized bivariate splines on triangulations," Computational Statistics, Springer, vol. 29(1), pages 263-281, February.
    15. Scott Tainsky & Brian M. Mills & Jason A. Winfree, 2015. "Further Examination of Potential Discrimination Among MLB Umpires," Journal of Sports Economics, , vol. 16(4), pages 353-374, May.
    16. Nickolaos G. Tzeremes, 2018. "Financial Development and Countries’ Production Efficiency: A Nonparametric Analysis," JRFM, MDPI, vol. 11(3), pages 1-13, August.
    17. Giampiero Marra & Rosalba Radice & Till Bärnighausen & Simon N. Wood & Mark E. McGovern, 2017. "A Simultaneous Equation Approach to Estimating HIV Prevalence With Nonignorable Missing Responses," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 484-496, April.
    18. Stefan Sperlich & Raoul Theler, 2015. "Modeling heterogeneity: a praise for varying-coefficient models in causal analysis," Computational Statistics, Springer, vol. 30(3), pages 693-718, September.
    19. Christos Kollias & Suzanna Maria Paleologou & Panayiotis Tzeremes & Nickolaos Tzeremes, 2018. "The demand for military spending in Latin American countries," Latin American Economic Review, Springer;Centro de Investigaciòn y Docencia Económica (CIDE), vol. 27(1), pages 1-17, December.
    20. Emma M. V. Blomgren & Mohsen Banaei & Razgar Ebrahimy & Olof Samuelsson & Francesco D’Ettorre & Henrik Madsen, 2023. "Intensive Data-Driven Model for Real-Time Observability in Low-Voltage Radial DSO Grids," Energies, MDPI, vol. 16(11), pages 1-22, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:jqsprt:v:14:y:2018:i:2:p:37-56:n:1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.