IDEAS home Printed from https://ideas.repec.org/p/pra/mprapa/97017.html
   My bibliography  Save this paper

Low sample size and regression: A Monte Carlo approach

Author

Listed:
  • Riveros Gavilanes, John Michael

Abstract

This article performs simulations with different small samples considering the regression techniques of OLS, Jackknife, Bootstrap, Lasso and Robust Regression in order to stablish the best approach in terms of lower bias and statistical significance with a pre-specified data generating process -DGP-. The methodology consists of a DGP with 5 variables and 1 constant parameter which was regressed among the simulations with a set of random normally distributed variables considering samples sizes of 6, 10, 20 and 500. Using the expected values discriminated by each sample size, the accuracy of the estimators was calculated in terms of the relative bias for each technique. The results indicate that Jackknife approach is more suitable for lower sample sizes as it was stated by Speed (1994), Bootstrap approach reported to be sensitive to a lower sample size indicating that it might not be suitable for stablish significant relationships in the regressions. The Monte Carlo simulations also reflected that when a significant relationship is found in small samples, this relationship will also tend to remain significant when the sample size is increased.

Suggested Citation

  • Riveros Gavilanes, John Michael, 2019. "Low sample size and regression: A Monte Carlo approach," MPRA Paper 97017, University Library of Munich, Germany.
  • Handle: RePEc:pra:mprapa:97017
    as

    Download full text from publisher

    File URL: https://mpra.ub.uni-muenchen.de/97017/7/MPRA_paper_97017.pdf
    File Function: original version
    Download Restriction: no

    File URL: https://mpra.ub.uni-muenchen.de/99465/1/MPRA_paper_99465.pdf
    File Function: revised version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mingfeng Lin & Henry C. Lucas & Galit Shmueli, 2013. "Research Commentary ---Too Big to Fail: Large Samples and the p -Value Problem," Information Systems Research, INFORMS, vol. 24(4), pages 906-917, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. John Michael, Riveros Gavilanes, 2002. "Una consideración empírica preliminar del Coronavirus en Colombia [An empiric preliminary consideration of the Coronavirus en Colombia]," MPRA Paper 99291, University Library of Munich, Germany.
    2. Peter Mako & Andrej Dávid & Patrik Böhm & Sorin Savu, 2021. "Sustainable Transport in the Danube Region," Sustainability, MDPI, vol. 13(12), pages 1-21, June.
    3. Ashraf Zaghwan & Indra Gunawan, 2021. "Energy Loss Impact in Electrical Smart Grid Systems in Australia," Sustainability, MDPI, vol. 13(13), pages 1-34, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hsin-Han Chen & Hui-Ling Chen & Yi-Tien Lin & Chaou-Wen Lin & Chien-Chang Ho & Hsueh-Yi Lin & Po-Fu Lee, 2020. "The Associations between Functional Fitness Test Performance and Abdominal Obesity in Healthy Elderly People: Results from the National Physical Fitness Examination Survey in Taiwan," IJERPH, MDPI, vol. 18(1), pages 1-14, December.
    2. Yen-Chun Chou & Howard Hao-Chun Chuang, 2018. "A predictive investigation of first-time customer retention in online reservation services," Service Business, Springer;Pan-Pacific Business Association, vol. 12(4), pages 685-699, December.
    3. Tang, Kayu & Parsons, David J. & Jude, Simon, 2019. "Comparison of automatic and guided learning for Bayesian networks to analyse pipe failures in the water distribution system," Reliability Engineering and System Safety, Elsevier, vol. 186(C), pages 24-36.
    4. Claire Teunenbroek & René Bekkers & Bianca Beersma, 2021. "They ought to do it too: Understanding effects of social information on donation behavior and mood," International Review on Public and Nonprofit Marketing, Springer;International Association of Public and Non-Profit Marketing, vol. 18(2), pages 229-253, June.
    5. Khalilzadeh, Jalayer & Tasci, Asli D.A., 2017. "Large sample size, significance level, and the effect size: Solutions to perils of using big data for academic research," Tourism Management, Elsevier, vol. 62(C), pages 89-96.
    6. Daniel Homocianu, 2020. "A Methodology of Discovering Comparable Models. The Case of Investing in Retirement Accounts when Considering Age, Main Residence and Education before 1989 vs. Globalization," Scientific Annals of Economics and Business (continues Analele Stiintifice), Alexandru Ioan Cuza University, Faculty of Economics and Business Administration, vol. 67(4), pages 19-31, December.
    7. Kaizhi Yu & Yun Zhang & Hong Zou & Chenchen Wang, 2019. "Absolute Income, Income Inequality and the Subjective Well-Being of Migrant Workers in China: Toward an Understanding of the Relationship and Its Psychological Mechanisms," IJERPH, MDPI, vol. 16(14), pages 1-27, July.
    8. Irfan Kanat & Yili Hong & T. S. Raghu, 2018. "Surviving in Global Online Labor Markets for IT Services: A Geo-Economic Analysis," Information Systems Research, INFORMS, vol. 29(4), pages 893-909, December.
    9. Shrestha, Keshab & Subramaniam, Ravichandran & Rassiah, Puspavathy, 2017. "Pure martingale and joint normality tests for energy futures contracts," Energy Economics, Elsevier, vol. 63(C), pages 174-184.
    10. Viengkham, Doris & Baumann, Chris & Winzar, Hume & Dahana, Wirawan Dony, 2022. "Toward understanding Convergence and Divergence: Inter-ocular testing of traditional philosophies, economic orientation, and religiosity/spirituality," Journal of Business Research, Elsevier, vol. 139(C), pages 1335-1352.
    11. Notheisen, Benedikt & Marino, Vincenzo & Englert, Daniel & Weinhardt, Christof, 2019. "Trading stocks on blocks: The quality of decentralized markets," Working Paper Series in Economics 129, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    12. Galit Shmueli, 2020. "Discussion on “Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer‐Lemeshow test” by Giovanni Nattino, Michael L. Pennell, and Stanley Lemeshow," Biometrics, The International Biometric Society, vol. 76(2), pages 561-563, June.
    13. Anthony G. Stacey, 2021. "Ages of cited references and growth of scientific knowledge: an explication of the gamma distribution in business and management disciplines," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(1), pages 619-640, January.
    14. Agnese Vitali & Romina Fraboni, 2022. "Pooling of Wealth in Marriage: The Role of Premarital Cohabitation," European Journal of Population, Springer;European Association for Population Studies, vol. 38(4), pages 721-754, October.
    15. Sergio Jimenez & Youlin Avila & George Dueñas & Alexander Gelbukh, 2020. "Automatic prediction of citability of scientific articles by stylometry of their titles and abstracts," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 3187-3232, December.
    16. Ana Fern'andez Vilas & Rebeca D'iaz Redondo & Ant'on Lorenzo Garc'ia, 2023. "The irruption of cryptocurrencies into Twitter cashtags: a classifying solution," Papers 2312.11531, arXiv.org.
    17. Michael Scholz & Markus Franz & Oliver Hinz, 2016. "The Ambiguous Identifier Clustering Technique," Electronic Markets, Springer;IIM University of St. Gallen, vol. 26(2), pages 143-156, May.
    18. Arenas-Márquez, F.J. & Martínez-Torres, M.R. & Toral, S.L., 2021. "How can trustworthy influencers be identified in electronic word-of-mouth communities?," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    19. Eduardo Oliveira & Vera L. Miguéis & José L. Borges, 2023. "Automatic root cause analysis in manufacturing: an overview & conceptualization," Journal of Intelligent Manufacturing, Springer, vol. 34(5), pages 2061-2078, June.
    20. Zhanfei Lei & Dezhi Yin & Han Zhang, 2021. "Focus Within or On Others: The Impact of Reviewers’ Attentional Focus on Review Helpfulness," Information Systems Research, INFORMS, vol. 32(3), pages 801-819, September.

    More about this item

    Keywords

    Small sample size; Statistical significance; Regression; Simulations; Bias;
    All these keywords.

    JEL classification:

    • C15 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Statistical Simulation Methods: General
    • C19 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Other
    • C63 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Computational Techniques

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:97017. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.