IDEAS home Printed from https://ideas.repec.org/p/pra/mprapa/97017.html

Low sample size and regression: A Monte Carlo approach

Author

Listed:
  • Riveros Gavilanes, John Michael

Abstract

This article performs simulations with different small samples considering the regression techniques of OLS, Jackknife, Bootstrap, Lasso and Robust Regression in order to stablish the best approach in terms of lower bias and statistical significance with a pre-specified data generating process -DGP-. The methodology consists of a DGP with 5 variables and 1 constant parameter which was regressed among the simulations with a set of random normally distributed variables considering samples sizes of 6, 10, 20 and 500. Using the expected values discriminated by each sample size, the accuracy of the estimators was calculated in terms of the relative bias for each technique. The results indicate that Jackknife approach is more suitable for lower sample sizes as it was stated by Speed (1994), Bootstrap approach reported to be sensitive to a lower sample size indicating that it might not be suitable for stablish significant relationships in the regressions. The Monte Carlo simulations also reflected that when a significant relationship is found in small samples, this relationship will also tend to remain significant when the sample size is increased.

Suggested Citation

  • Riveros Gavilanes, John Michael, 2019. "Low sample size and regression: A Monte Carlo approach," MPRA Paper 97017, University Library of Munich, Germany.
  • Handle: RePEc:pra:mprapa:97017
    as

    Download full text from publisher

    File URL: https://mpra.ub.uni-muenchen.de/97017/7/MPRA_paper_97017.pdf
    File Function: original version
    Download Restriction: no

    File URL: https://mpra.ub.uni-muenchen.de/99465/1/MPRA_paper_99465.pdf
    File Function: revised version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mingfeng Lin & Henry C. Lucas & Galit Shmueli, 2013. "Research Commentary ---Too Big to Fail: Large Samples and the p -Value Problem," Information Systems Research, INFORMS, vol. 24(4), pages 906-917, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. John Michael, Riveros Gavilanes, 2002. "Una consideración empírica preliminar del Coronavirus en Colombia [An empiric preliminary consideration of the Coronavirus en Colombia]," MPRA Paper 99291, University Library of Munich, Germany.
    2. Peter Mako & Andrej Dávid & Patrik Böhm & Sorin Savu, 2021. "Sustainable Transport in the Danube Region," Sustainability, MDPI, vol. 13(12), pages 1-21, June.
    3. Ashraf Zaghwan & Indra Gunawan, 2021. "Energy Loss Impact in Electrical Smart Grid Systems in Australia," Sustainability, MDPI, vol. 13(13), pages 1-34, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hsin-Han Chen & Hui-Ling Chen & Yi-Tien Lin & Chaou-Wen Lin & Chien-Chang Ho & Hsueh-Yi Lin & Po-Fu Lee, 2020. "The Associations between Functional Fitness Test Performance and Abdominal Obesity in Healthy Elderly People: Results from the National Physical Fitness Examination Survey in Taiwan," IJERPH, MDPI, vol. 18(1), pages 1-14, December.
    2. Thomas Görzen, 2019. "Can Experience be Trusted? Investigating the Effect of Experience on Decision Biases in Crowdworking Platforms," Working Papers Dissertations 55, Paderborn University, Faculty of Business Administration and Economics.
    3. Yi Yang & Kunpeng Zhang & Yangyang Fan, 2023. "sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics," Information Systems Research, INFORMS, vol. 34(1), pages 137-156, March.
    4. Evangelista, Rui & Ramalho, Esmeralda A. & Andrade e Silva, João, 2020. "On the use of hedonic regression models to measure the effect of energy efficiency on residential property transaction prices: Evidence for Portugal and selected data issues," Energy Economics, Elsevier, vol. 86(C).
    5. Yen-Chun Chou & Howard Hao-Chun Chuang, 2018. "A predictive investigation of first-time customer retention in online reservation services," Service Business, Springer;Pan-Pacific Business Association, vol. 12(4), pages 685-699, December.
    6. Tang, Kayu & Parsons, David J. & Jude, Simon, 2019. "Comparison of automatic and guided learning for Bayesian networks to analyse pipe failures in the water distribution system," Reliability Engineering and System Safety, Elsevier, vol. 186(C), pages 24-36.
    7. Sujin Park & Ali Tafti & Galit Shmueli, 2024. "Transporting Causal Effects Across Populations Using Structural Causal Modeling: An Illustration to Work-from-Home Productivity," Information Systems Research, INFORMS, vol. 35(2), pages 686-705, June.
    8. Claire Teunenbroek & René Bekkers & Bianca Beersma, 2021. "They ought to do it too: Understanding effects of social information on donation behavior and mood," International Review on Public and Nonprofit Marketing, Springer;International Association of Public and Non-Profit Marketing, vol. 18(2), pages 229-253, June.
    9. Khalilzadeh, Jalayer & Tasci, Asli D.A., 2017. "Large sample size, significance level, and the effect size: Solutions to perils of using big data for academic research," Tourism Management, Elsevier, vol. 62(C), pages 89-96.
    10. Juan Manuel Pérez-Salamero González & Marta Regúlez-Castillo & Carlos Vidal-Meliá, 2017. "The continuous sample of working lives: improving its representativeness," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 8(1), pages 43-95, March.
    11. Thomas Niemand & Sascha Kraus & Martin Angerer & Ferdinand Thies & Alicia Mas-Tur, 2019. "More is not always better—non-linear effects in crowdfunding," International Journal of Quality Innovation, Springer, vol. 5(1), pages 1-10, December.
    12. Pei-Yu Chen & Yili Hong & Ying Liu, 2018. "The Value of Multidimensional Rating Systems: Evidence from a Natural Experiment and Randomized Experiments," Management Science, INFORMS, vol. 64(10), pages 4629-4647, October.
    13. Pusch, Natasha, 2024. "Estimating the prevalence and frequency of adolescent substance use using zero-inflated models and variables associated with social learning, social bond, and opportunity," Journal of Criminal Justice, Elsevier, vol. 94(C).
    14. Daniel Homocianu, 2020. "A Methodology of Discovering Comparable Models. The Case of Investing in Retirement Accounts when Considering Age, Main Residence and Education before 1989 vs. Globalization," Scientific Annals of Economics and Business (continues Analele Stiintifice), Alexandru Ioan Cuza University, Faculty of Economics and Business Administration, vol. 67(4), pages 19-31, December.
    15. Kaizhi Yu & Yun Zhang & Hong Zou & Chenchen Wang, 2019. "Absolute Income, Income Inequality and the Subjective Well-Being of Migrant Workers in China: Toward an Understanding of the Relationship and Its Psychological Mechanisms," IJERPH, MDPI, vol. 16(14), pages 1-27, July.
    16. Irfan Kanat & Yili Hong & T. S. Raghu, 2018. "Surviving in Global Online Labor Markets for IT Services: A Geo-Economic Analysis," Information Systems Research, INFORMS, vol. 29(4), pages 893-909, December.
    17. Jae H. Kim & Kamran Ahmed & Philip Inyeob Ji, 2018. "Significance Testing in Accounting Research: A Critical Evaluation Based on Evidence," Abacus, Accounting Foundation, University of Sydney, vol. 54(4), pages 524-546, December.
    18. Malek Simon Grimm & Ralf Wagner, 2024. "Challenging the linearity assumption of intra-brand image confusion," Journal of Marketing Analytics, Palgrave Macmillan, vol. 12(2), pages 355-374, June.
    19. Qiu-Hong Wang & Ruibin Geng & Seung Hyun Kim, 2024. "Chilling Effect of the Enforcement of Computer Misuse Act: Evidence from Publicly Accessible Hack Forums," Information Systems Research, INFORMS, vol. 35(3), pages 1195-1215, September.
    20. Shrestha, Keshab & Subramaniam, Ravichandran & Rassiah, Puspavathy, 2017. "Pure martingale and joint normality tests for energy futures contracts," Energy Economics, Elsevier, vol. 63(C), pages 174-184.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • C15 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Statistical Simulation Methods: General
    • C19 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Other
    • C63 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Computational Techniques

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:97017. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.