Sample size implications when biases are modelled rather than ignored

My bibliography Save this article

Sample size implications when biases are modelled rather than ignored

Author

Listed:

Paul Gustafson

Registered:

Abstract

Summary. Realistic statistical modelling of observational data often suggests a statistical model which is not fully identified, owing to potential biases that are not under the control of study investigators. Bayesian inference can be implemented with such a model, ideally with the most precise prior knowledge that can be ascertained. However, as a consequence of the non‐identifiability, inference cannot be made arbitrarily accurate by choosing the sample size to be sufficiently large. In turn, this has consequences for sample size determination. The paper presents a sample size criterion that is based on a quantification of how much Bayesian learning can arise in a given non‐identified model. A global perspective is adopted, whereby choosing larger sample sizes for some studies necessarily implies that some other potentially worthwhile studies cannot be undertaken. This suggests that smaller sample sizes should be selected with non‐identified models, as larger sample sizes constitute a squandering of resources in making estimator variances very small compared with their biases. Particularly, consider two investigators planning the same study, one of whom admits to the potential biases at hand and consequently uses a non‐identified model, whereas the other pretends that there are no biases, leading to an identified but less realistic model. It is seen that the former investigator always selects a smaller sample size than the latter, with the difference being quite marked in some illustrative cases.

Suggested Citation

Paul Gustafson, 2006. "Sample size implications when biases are modelled rather than ignored," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 169(4), pages 865-881, October.

Handle: RePEc:bla:jorssa:v:169:y:2006:i:4:p:865-881
DOI: 10.1111/j.1467-985X.2006.00436.x

Download full text from publisher

References listed on IDEAS

Paul Gustafson & Nhu D. Le & Refik Saskin, 2001. "Case–Control Analysis with Partial Knowledge of Exposure Misclassification Probabilities," Biometrics, The International Biometric Society, vol. 57(2), pages 598-609, June.
Sander Greenland, 2005. "Multiple‐bias modelling for analysis of observational data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 168(2), pages 267-306, March.
Nandini Dendukuri & Elham Rahme & Patrick Bélisle & Lawrence Joseph, 2004. "Bayesian Sample Size Determination for Prevalence and Diagnostic Test Studies in the Absence of a Gold Standard Test," Biometrics, The International Biometric Society, vol. 60(2), pages 388-397, June.
Sander Greenland, 2000. "When Should Epidemiologic Regressions Use Random Coefficients?," Biometrics, The International Biometric Society, vol. 56(3), pages 915-921, September.
E. Rahme & L. Joseph & T. W. Gyorkos, 2000. "Bayesian sample size determination for estimating binomial parameters from data subject to misclassification," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 49(1), pages 119-128.
Sander Greenland, 2001. "Sensitivity Analysis, Monte Carlo Risk Analysis, and Bayesian Uncertainty Assessment," Risk Analysis, John Wiley & Sons, vol. 21(4), pages 579-584, August.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Gustafson Paul, 2010. "Bayesian Inference for Partially Identified Models," The International Journal of Biostatistics, De Gruyter, vol. 6(2), pages 1-20, March.
P. Gustafson & L. C. McCandless & A. R. Levy & S. Richardson, 2010. "Simplified Bayesian Sensitivity Analysis for Mismeasured and Unobserved Confounders," Biometrics, The International Biometric Society, vol. 66(4), pages 1129-1137, December.
Wang Dongxu & Shen Tian & Gustafson Paul, 2012. "Partial Identification arising from Nondifferential Exposure Misclassification: How Informative are Data on the Unlikely, Maybe, and Likely Exposed?," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-27, November.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Sander Greenland, 2005. "Multiple‐bias modelling for analysis of observational data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 168(2), pages 267-306, March.
Paul Gustafson & Sander Greenland, 2006. "The Performance of Random Coefficient Regression in Accounting for Residual Confounding," Biometrics, The International Biometric Society, vol. 62(3), pages 760-768, September.
Sander Greenland, 2005. "Discussion on "Statistical Issues Arising in the Women's Health Initiative"," Biometrics, The International Biometric Society, vol. 61(4), pages 920-921, December.
Zhuoyu Wang & Nandini Dendukuri & Madhukar Pai & Lawrence Joseph, 2017. "Taking Costs and Diagnostic Test Accuracy into Account When Designing Prevalence Studies: An Application to Childhood Tuberculosis Prevalence," Medical Decision Making, , vol. 37(8), pages 922-929, November.
Paul Gustafson, 2007. "Measurement error modelling with an approximate instrumental variable," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(5), pages 797-815, November.
Stamey, James & Gerlach, Richard, 2007. "Bayesian sample size determination for case-control studies with misclassification," Computational Statistics & Data Analysis, Elsevier, vol. 51(6), pages 2982-2992, March.
Nandini Dendukuri & Elham Rahme & Patrick Bélisle & Lawrence Joseph, 2004. "Bayesian Sample Size Determination for Prevalence and Diagnostic Test Studies in the Absence of a Gold Standard Test," Biometrics, The International Biometric Society, vol. 60(2), pages 388-397, June.
Paul Gustafson & Lawrence C. McCandless, 2010. "Probabilistic Approaches to Better Quantifying the Results of Epidemiologic Studies," IJERPH, MDPI, vol. 7(4), pages 1-20, April.
Martin Ladouceur & Elham Rahme & Christian A. Pineau & Lawrence Joseph, 2007. "Robustness of Prevalence Estimates Derived from Misclassified Data from Administrative Databases," Biometrics, The International Biometric Society, vol. 63(1), pages 272-279, March.
Lawrence C. McCandless & Sylvia Richardson & Nicky Best, 2012. "Adjustment for Missing Confounders Using External Validation Data and Propensity Scores," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 40-51, March.
Julian P. T. Higgins & Simon G. Thompson & David J. Spiegelhalter, 2009. "A re‐evaluation of random‐effects meta‐analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 172(1), pages 137-159, January.
Yeojin Chung & Sophia Rabe-Hesketh & Vincent Dorie & Andrew Gelman & Jingchen Liu, 2013. "A Nondegenerate Penalized Likelihood Estimator for Variance Parameters in Multilevel Models," Psychometrika, Springer;The Psychometric Society, vol. 78(4), pages 685-709, October.
Al-Kandari Noriah M. & Lahiri Partha, 2016. "Prediction of a Function of Misclassified Binary Data," Statistics in Transition New Series, Statistics Poland, vol. 17(3), pages 429-447, September.
Douglas E. Schaubel & Guanghui Wei, 2011. "Double Inverse-Weighted Estimation of Cumulative Treatment Effects Under Nonproportional Hazards and Dependent Censoring," Biometrics, The International Biometric Society, vol. 67(1), pages 29-38, March.
Paul Gustafson & Nhu D. Le, 2002. "Comparing the Effects of Continuous and Discrete Covariate Mismeasurement, with Emphasis on the Dichotomization of Mismeasured Predictors," Biometrics, The International Biometric Society, vol. 58(4), pages 878-887, December.
Carlos Díaz-Venegas, 2014. "Identifying the Confounders of Marginalization and Mortality in Mexico, 2003–2007," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 118(2), pages 851-875, September.
A. Goubar & A. E. Ades & D. De Angelis & C. A. McGarrigle & C. H. Mercer & P. A. Tookey & K. Fenton & O. N. Gill, 2008. "Estimates of human immunodeficiency virus prevalence and proportion diagnosed based on Bayesian multiparameter synthesis of surveillance data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 171(3), pages 541-580, June.
McCandless Lawrence C., 2012. "Meta-Analysis of Observational Studies with Unmeasured Confounders," The International Journal of Biostatistics, De Gruyter, vol. 8(2), pages 1-31, January.
Maria Gheorghe & Susan Picavet & Monique Verschuren & Werner B. F. Brouwer & Pieter H. M. Baal, 2017. "Health losses at the end of life: a Bayesian mixed beta regression approach," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(3), pages 723-749, June.
Martijn van Hasselt & Christopher R. Bollinger & Jeremy W. Bray, 2022. "A Bayesian approach to account for misclassification in prevalence and trend estimation," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(2), pages 351-367, March.
- Martijn van Hasselt & Christopher Bollinger & Jeremy Bray, 2019. "A Bayesian Approach to Account for Misclassification in Prevalence and Trend Estimation," UNCG Economics Working Papers 19-13, University of North Carolina at Greensboro, Department of Economics.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:169:y:2006:i:4:p:865-881. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Sample size implications when biases are modelled rather than ignored

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data