IDEAS home Printed from https://ideas.repec.org/a/spr/aodasc/v10y2023i4d10.1007_s40745-021-00350-z.html
   My bibliography  Save this article

The Zero-Inflated Negative Binomial Semiparametric Regression Model: Application to Number of Failing Grades Data

Author

Listed:
  • Elton G. Aráujo

    (Universidade Federal de Mato Grosso do Sul)

  • Julio C. S. Vasconcelos

    (Universidade de São Paulo)

  • Denize P. Santos

    (Universidade de São Paulo)

  • Edwin M. M. Ortega

    (Universidade de São Paulo)

  • Dalton Souza

    (Universidade Federal de Mato Grosso do Sul)

  • João P. F. Zanetoni

    (Universidade Federal de Mato Grosso do Sul)

Abstract

In this paper we study the performance of college students, measured by the number of failing grades, considering various covariables that can positively or negatively influence this performance. The students in the sample were undergraduate business majors studying at night at a federal public university in the state of Mato Grosso do Sul, Brazil. Among the factors considered are covariables that had a linear and nonlinear relationship with the students’ performance. We also observed a high percentage of zeros, the reason we used a zero-inflated semiparametric regression model based on the negative binomial distribution to analyze our dataset.We used the penalized maximum likelihood method along with analysis of the residuals to verify the model’s assumptions. We present results, discussion and conclusions about the number of subjects failed by the students.

Suggested Citation

  • Elton G. Aráujo & Julio C. S. Vasconcelos & Denize P. Santos & Edwin M. M. Ortega & Dalton Souza & João P. F. Zanetoni, 2023. "The Zero-Inflated Negative Binomial Semiparametric Regression Model: Application to Number of Failing Grades Data," Annals of Data Science, Springer, vol. 10(4), pages 991-1006, August.
  • Handle: RePEc:spr:aodasc:v:10:y:2023:i:4:d:10.1007_s40745-021-00350-z
    DOI: 10.1007/s40745-021-00350-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40745-021-00350-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40745-021-00350-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Garay, Aldo M. & Hashimoto, Elizabeth M. & Ortega, Edwin M.M. & Lachos, Víctor H., 2011. "On estimation and influence diagnostics for zero-inflated negative binomial regression models," Computational Statistics & Data Analysis, Elsevier, vol. 55(3), pages 1304-1318, March.
    2. Feng Liu & Yong Shi, 2020. "Investigating Laws of Intelligence Based on AI IQ Research," Annals of Data Science, Springer, vol. 7(3), pages 399-416, September.
    3. Germán Ibacache-Pulgar & Gilberto Paula & Francisco Cysneiros, 2013. "Semiparametric additive models under symmetric distributions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(1), pages 103-121, March.
    4. Xiaoli L. Etienne & Giancarlo Ferrara & Douglas Mugabe, 2019. "How efficient is maize production among smallholder farmers in Zimbabwe? A comparison of semiparametric and parametric frontier efficiency analyses," Applied Economics, Taylor & Francis Journals, vol. 51(26), pages 2855-2871, June.
    5. Md. Karimuzzaman & Nusrat Islam & Sabrina Afroz & Md. Moyazzem Hossain, 2021. "Predicting Stock Market Price of Bangladesh: A Comparative Study of Linear Classification Models," Annals of Data Science, Springer, vol. 8(1), pages 21-38, March.
    6. F. Belloc & A. Maruotti & L. Petrella, 2011. "How individual characteristics affect university students drop-out: a semiparametric mixed-effects model for an Italian case study," Journal of Applied Statistics, Taylor & Francis Journals, vol. 38(10), pages 2225-2239.
    7. James M. Tien, 2017. "Internet of Things, Real-Time Decision Making, and Artificial Intelligence," Annals of Data Science, Springer, vol. 4(2), pages 149-178, June.
    8. Hashimoto, Elizabeth M. & Ortega, Edwin M.M. & Paula, Gilberto A. & Barreto, Mauricio L., 2011. "Regression models for grouped survival data: Estimation and sensitivity analysis," Computational Statistics & Data Analysis, Elsevier, vol. 55(2), pages 993-1007, February.
    9. Tsu‐Tan Fu & Cliff J. Huang & Yung‐Lieh Yang, 2011. "Quality And Economies Of Scale In Higher Education: A Semiparametric Smooth Coefficient Estimation," Contemporary Economic Policy, Western Economic Association International, vol. 29(1), pages 138-149, January.
    10. R. A. Rigby & D. M. Stasinopoulos, 2005. "Generalized additive models for location, scale and shape," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 507-554, June.
    11. Stasinopoulos, D. Mikis & Rigby, Robert A., 2007. "Generalized Additive Models for Location Scale and Shape (GAMLSS) in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 23(i07).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Luis Vanegas & Gilberto Paula, 2015. "A semiparametric approach for joint modeling of median and skewness," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(1), pages 110-135, March.
    2. Yixuan Wang & Jianzhu Li & Ping Feng & Rong Hu, 2015. "A Time-Dependent Drought Index for Non-Stationary Precipitation Series," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 29(15), pages 5631-5647, December.
    3. Panayi, Efstathios & Peters, Gareth W. & Danielsson, Jon & Zigrand, Jean-Pierre, 2018. "Designating market maker behaviour in limit order book markets," Econometrics and Statistics, Elsevier, vol. 5(C), pages 20-44.
    4. Gauss Cordeiro & Josemar Rodrigues & Mário Castro, 2012. "The exponential COM-Poisson distribution," Statistical Papers, Springer, vol. 53(3), pages 653-664, August.
    5. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.
    6. Matteo Malavasi & Gareth W. Peters & Pavel V. Shevchenko & Stefan Truck & Jiwook Jang & Georgy Sofronov, 2021. "Cyber Risk Frequency, Severity and Insurance Viability," Papers 2111.03366, arXiv.org, revised Mar 2022.
    7. Lucio Masserini & Matilde Bini & Monica Pratesi, 2017. "Effectiveness of non-selective evaluation test scores for predicting first-year performance in university career: a zero-inflated beta regression approach," Quality & Quantity: International Journal of Methodology, Springer, vol. 51(2), pages 693-708, March.
    8. Tong, Edward N.C. & Mues, Christophe & Thomas, Lyn, 2013. "A zero-adjusted gamma model for mortgage loan loss given default," International Journal of Forecasting, Elsevier, vol. 29(4), pages 548-562.
    9. Alexander Silbersdorff & Kai Sebastian Schneider, 2019. "Distributional Regression Techniques in Socioeconomic Research on the Inequality of Health with an Application on the Relationship between Mental Health and Income," IJERPH, MDPI, vol. 16(20), pages 1-28, October.
    10. Xueyan Xu & Fusheng Yu & Runjun Wan, 2023. "A Determining Degree-Based Method for Classification Problems with Interval-Valued Attributes," Annals of Data Science, Springer, vol. 10(2), pages 393-413, April.
    11. Tong, Edward N.C. & Mues, Christophe & Brown, Iain & Thomas, Lyn C., 2016. "Exposure at default models with and without the credit conversion factor," European Journal of Operational Research, Elsevier, vol. 252(3), pages 910-920.
    12. Micha{l} Narajewski & Florian Ziel, 2020. "Ensemble Forecasting for Intraday Electricity Prices: Simulating Trajectories," Papers 2005.01365, arXiv.org, revised Aug 2020.
    13. D. Chiru Naik & Sagar Rohidas Chavan & P. Sonali, 2023. "Incorporating the climate oscillations in the computation of meteorological drought over India," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 117(3), pages 2617-2646, July.
    14. Shuhui Guo & Lihua Xiong & Jie Chen & Shenglian Guo & Jun Xia & Ling Zeng & Chong-Yu Xu, 2023. "Nonstationary Regional Flood Frequency Analysis Based on the Bayesian Method," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(2), pages 659-681, January.
    15. Maike Hohberg & Katja Landau & Thomas Kneib & Stephan Klasen & Walter Zucchini, 2018. "Vulnerability to poverty revisited: Flexible modeling and better predictive performance," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 16(3), pages 439-454, September.
    16. Kuntz, Laura-Chloé, 2020. "Beta dispersion and market timing," Journal of Empirical Finance, Elsevier, vol. 59(C), pages 235-256.
    17. Epstein, Leonardo D. & Inostroza-Quezada, Ignacio E. & Goodstein, Ronald C. & Choi, S. Chan, 2021. "Dynamic effects of store promotions on purchase conversion: Expanding technology applications with innovative analytics," Journal of Business Research, Elsevier, vol. 128(C), pages 279-289.
    18. Serinaldi, Francesco, 2011. "Distributional modeling and short-term forecasting of electricity prices by Generalized Additive Models for Location, Scale and Shape," Energy Economics, Elsevier, vol. 33(6), pages 1216-1226.
    19. Yolanda M. Gómez & Diego I. Gallardo & Marcelo Bourguignon & Eduardo Bertolli & Vinicius F. Calsavara, 2023. "A general class of promotion time cure rate models with a new biological interpretation," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 29(1), pages 66-86, January.
    20. Christophe Croux & Irène Gijbels & Ilaria Prosdocimi, 2012. "Robust Estimation of Mean and Dispersion Functions in Extended Generalized Additive Models," Biometrics, The International Biometric Society, vol. 68(1), pages 31-44, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aodasc:v:10:y:2023:i:4:d:10.1007_s40745-021-00350-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.