IDEAS home Printed from https://ideas.repec.org/p/may/mayecw/n274-16.pdf.html
   My bibliography  Save this paper

-Adjusted p-values for genome-wide regression analysis with non-normally distributed quantitative phenotypes

Author

Listed:
  • Gregory Connor

    (Department of Economics, Finance and Accounting, Maynooth University.)

  • Michael O?Neill

    (School of Business, University College, Dublin)

Abstract

This paper provides a small-sample adjustment for Bonferonni- corrected p-values in multiple univariate regressions of a quantitative phenotype (such as a social trait) on individual genome markers. The p-value estimator conventionally used in existing genome-wide asso- ciation (GWA) regressions assumes a normally-distributed dependent variable, or relies on a central limit theorem based approximation. We show that the central limit theorem approximation is unreliable for GWA regression Bonferonni-corrected p-values except in very large samples. We note that measured phenotypes (particularly in the case of social traits) often have markedly non-normal distributions. We propose a mixed normal distribution to better ?t observed pheno- typic variables, and derive exact small-sample p-values for the stan- dard GWA regression under this distributional assumption.

Suggested Citation

  • Gregory Connor & Michael O?Neill, 2016. "-Adjusted p-values for genome-wide regression analysis with non-normally distributed quantitative phenotypes," Economics Department Working Paper Series n274-16.pdf, Department of Economics, National University of Ireland - Maynooth.
  • Handle: RePEc:may:mayecw:n274-16.pdf
    as

    Download full text from publisher

    File URL: http://repec.maynoothuniversity.ie/mayecw-files/N274-16.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Benaglia, Tatiana & Chauveau, Didier & Hunter, David R. & Young, Derek S., 2009. "mixtools: An R Package for Analyzing Mixture Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i06).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ana Pinto & Tong Yin & Marion Reichenbach & Raghavendra Bhatta & Pradeep Kumar Malik & Eva Schlecht & Sven König, 2020. "Enteric Methane Emissions of Dairy Cattle Considering Breed Composition, Pasture Management, Housing Conditions and Feeding Characteristics along a Rural-Urban Gradient in a Rising Megacity," Agriculture, MDPI, vol. 10(12), pages 1-18, December.
    2. Ozonder, Gozde & Miller, Eric J., 2021. "Longitudinal investigation of skeletal activity episode timing decisions – A copula approach," Journal of choice modelling, Elsevier, vol. 40(C).
    3. Minjung Kyung & Ju-Hyun Park & Ji Yeh Choi, 2022. "Bayesian Mixture Model of Extended Redundancy Analysis," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 946-966, September.
    4. Xue, Jiacheng & Yao, Weixin, 2022. "Machine Learning Embedded Semiparametric Mixtures of Regressions with Covariate-Varying Mixing Proportions," Econometrics and Statistics, Elsevier, vol. 22(C), pages 159-171.
    5. Meng Li & Sijia Xiang & Weixin Yao, 2016. "Robust estimation of the number of components for mixtures of linear regression models," Computational Statistics, Springer, vol. 31(4), pages 1539-1555, December.
    6. Derek S. Young & Xi Chen & Dilrukshi C. Hewage & Ricardo Nilo-Poyanco, 2019. "Finite mixture-of-gamma distributions: estimation, inference, and model-based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(4), pages 1053-1082, December.
    7. Héctor Nájera & David Gordon, 2023. "A Monte Carlo Study of Some Empirical Methods to Find the Optimal Poverty Line in Multidimensional Poverty Measurement," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 167(1), pages 391-419, June.
    8. Doering, Kenji & Sendelbach, Luke & Steinschneider, Scott & Lindsay Anderson, C., 2021. "The effects of wind generation and other market determinants on price spikes," Applied Energy, Elsevier, vol. 300(C).
    9. Zihao Wang & Wenxi Wang & Xiaoming Xie & Yongfa Wang & Zhengzhao Yang & Huiru Peng & Mingming Xin & Yingyin Yao & Zhaorong Hu & Jie Liu & Zhenqi Su & Chaojie Xie & Baoyun Li & Zhongfu Ni & Qixin Sun &, 2022. "Dispersed emergence and protracted domestication of polyploid wheat uncovered by mosaic ancestral haploblock inference," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    10. Dalla Valle, Luciana & De Giuli, Maria Elena & Tarantola, Claudia & Manelli, Claudio, 2016. "Default probability estimation via pair copula constructions," European Journal of Operational Research, Elsevier, vol. 249(1), pages 298-311.
    11. Shusaku Tsumoto & Tomohiro Kimura & Shoji Hirano, 2021. "Mining Clinical Pathways Using Dual Clustering," The Review of Socionetwork Strategies, Springer, vol. 15(2), pages 287-307, November.
    12. Zhang, Liyuan & Zhang, Huihui & Han, Wenting & Niu, Yaxiao & Chávez, José L. & Ma, Weitong, 2021. "The mean value of gaussian distribution of excess green index: A new crop water stress indicator," Agricultural Water Management, Elsevier, vol. 251(C).
    13. Ivana Malá, 2013. "Použití konečných směsí logaritmicko-normálních rozdělení pro modelování příjmů českých domácností [The Use of Finite Mixtures of Lognormal Distribution for the Modelling of Household Income Distri," Politická ekonomie, Prague University of Economics and Business, vol. 2013(3), pages 356-372.
    14. Christian Schellhase & Göran Kauermann, 2012. "Density estimation and comparison with a penalized mixture approach," Computational Statistics, Springer, vol. 27(4), pages 757-777, December.
    15. Chauveau, Didier & Hoang, Vy Thuy Lynh, 2016. "Nonparametric mixture models with conditionally independent multivariate component densities," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 1-16.
    16. Daan de Waal & Tristan Harris & Alta de Waal & Jocelyn Mazarura, 2022. "Modelling Bimodal Data Using a Multivariate Triangular-Linked Distribution," Mathematics, MDPI, vol. 10(14), pages 1-20, July.
    17. Hu, Hao & Yao, Weixin & Wu, Yichao, 2017. "The robust EM-type algorithms for log-concave mixtures of regression models," Computational Statistics & Data Analysis, Elsevier, vol. 111(C), pages 14-26.
    18. Antonello Maruotti & Antonio Punzo, 2021. "Initialization of Hidden Markov and Semi‐Markov Models: A Critical Evaluation of Several Strategies," International Statistical Review, International Statistical Institute, vol. 89(3), pages 447-480, December.
    19. Cheng Peng & Stanislav Uryasev, 2023. "Factor Model of Mixtures," Papers 2301.13843, arXiv.org, revised Mar 2023.
    20. Zulfiqar Ali & Asad Ellahi & Ijaz Hussain & Amna Nazeer & Sadia Qamar & Guangheng Ni & Muhammad Faisal, 2021. "Reduction of Errors in Hydrological Drought Monitoring – A Novel Statistical Framework for Spatio-Temporal Assessment of Drought," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 35(13), pages 4363-4380, October.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:may:mayecw:n274-16.pdf. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/demayie.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.