Hierarchical correction of p-values via an ultrametric tree running Ornstein-Uhlenbeck process

My bibliography Save this article

Hierarchical correction of p-values via an ultrametric tree running Ornstein-Uhlenbeck process

Author

Listed:

Antoine Bichat
(LaMME, Université d’Évry val d’Essonne
Enterome)
Christophe Ambroise
(LaMME, Université d’Évry val d’Essonne)
Mahendra Mariadassou
(MaIAGE, INRAE, Université Paris-Saclay)

Registered:

Abstract

Statistical testing is classically used as an exploratory tool to search for association between a phenotype and many possible explanatory variables. This approach often leads to multiple testing under dependence. We assume a hierarchical structure between tests via an Ornstein-Uhlenbeck process on a tree. The process correlation structure is used for smoothing the p-values. We design a penalized estimation of the mean of the Ornstein-Uhlenbeck process for p-value computation. The performances of the algorithm are assessed via simulations. Its ability to discover new associations is demonstrated on a metagenomic dataset. The corresponding R package is available from https://github.com/abichat/zazou .

Suggested Citation

Antoine Bichat & Christophe Ambroise & Mahendra Mariadassou, 2022. "Hierarchical correction of p-values via an ultrametric tree running Ornstein-Uhlenbeck process," Computational Statistics, Springer, vol. 37(3), pages 995-1013, July.

Handle: RePEc:spr:compst:v:37:y:2022:i:3:d:10.1007_s00180-021-01148-6
DOI: 10.1007/s00180-021-01148-6

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Matteo Sesia & Eugene Katsevich & Stephen Bates & Emmanuel Candès & Chiara Sabatti, 2020. "Multi-resolution localization of causal variants across the genome," Nature Communications, Nature, vol. 11(1), pages 1-10, December.
Goeman Jelle J. & Finos Livio, 2012. "The Inheritance Procedure: Multiple Testing of Tree-structured Hypotheses," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-18, January.
Frederick A Matsen IV & Steven N Evans, 2013. "Edge Principal Components and Squash Clustering: Using the Special Structure of Phylogenetic Placement Data for Sample Comparison," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-15, March.
Henk R Cremers & Tor D Wager & Tal Yarkoni, 2017. "The relation between statistical power and inference in fMRI," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-20, November.
Paul Bastide & Mahendra Mariadassou & Stéphane Robin, 2017. "Detection of adaptive shifts on phylogenies by using shifted stochastic processes on a tree," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1067-1093, September.
Kim Kyung In & Roquain Etienne & van de Wiel Mark A, 2010. "Spatial Clustering of Array CGH Features in Combination with Hierarchical Multiple Testing," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-25, November.
Yingying Fan & Cheng Yong Tang, 2013. "Tuning parameter selection in high dimensional penalized likelihood," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(3), pages 531-552, June.
Yekutieli, Daniel, 2008. "Hierarchical False Discovery RateControlling Methodology," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 309-316, March.
Benjamini, Yoav & Heller, Ruth, 2007. "False Discovery Rates for Spatial Signals," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1272-1281, December.
Claude Renaux & Laura Buzdugan & Markus Kalisch & Peter Bühlmann, 2020. "Rejoinder on: Hierarchical inference for genome-wide association studies: a view on methodology with software," Computational Statistics, Springer, vol. 35(1), pages 59-67, March.
Sankaran, Kris & Holmes, Susan, 2014. "structSSI: Simultaneous and Selective Inference for Grouped or Hierarchically Structured Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 59(i13).
Tingni Sun & Cun-Hui Zhang, 2012. "Scaled sparse linear regression," Biometrika, Biometrika Trust, vol. 99(4), pages 879-898.
Nicolai Meinshausen, 2008. "Hierarchical testing of variable importance," Biometrika, Biometrika Trust, vol. 95(2), pages 265-278.
Claude Renaux & Laura Buzdugan & Markus Kalisch & Peter Bühlmann, 2020. "Hierarchical inference for genome-wide association studies: a view on methodology with software," Computational Statistics, Springer, vol. 35(1), pages 1-40, March.
Cun-Hui Zhang & Stephanie S. Zhang, 2014. "Confidence intervals for low dimensional parameters in high dimensional linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 217-242, January.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Meijer Rosa J. & Krebs Thijmen J.P. & Goeman Jelle J., 2015. "A region-based multiple testing method for hypotheses ordered in space or time," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(1), pages 1-19, February.
Yoav Benjamini, 2010. "Discovering the false discovery rate," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(4), pages 405-416, September.
Goeman Jelle J. & Finos Livio, 2012. "The Inheritance Procedure: Multiple Testing of Tree-structured Hypotheses," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-18, January.
Xue Wu & Chixiang Chen & Zheng Li & Lijun Zhang & Vernon M. Chinchilli & Ming Wang, 2024. "A three-stage approach to identify biomarker signatures for cancer genetic data with survival endpoints," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 33(3), pages 863-883, July.
Claude Renaux & Laura Buzdugan & Markus Kalisch & Peter Bühlmann, 2020. "Hierarchical inference for genome-wide association studies: a view on methodology with software," Computational Statistics, Springer, vol. 35(1), pages 1-40, March.
T. Tony Cai & Wenguang Sun, 2017. "Optimal screening and discovery of sparse signals with applications to multistage high throughput studies," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 197-223, January.
Gilles R. Ducharme & Walid Al Akhras, 2016. "Tree based diagnostic procedures following a smooth test of goodness-of-fit," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 79(8), pages 971-989, November.
Zemin Zheng & Jie Zhang & Yang Li, 2022. "L 0 -Regularized Learning for High-Dimensional Additive Hazards Regression," INFORMS Journal on Computing, INFORMS, vol. 34(5), pages 2762-2775, September.
Kock, Anders Bredahl, 2016. "Oracle inequalities, variable selection and uniform inference in high-dimensional correlated random effects panel data models," Journal of Econometrics, Elsevier, vol. 195(1), pages 71-85.
Lucas Janson & Rina Foygel Barber & Emmanuel Candès, 2017. "EigenPrism: inference for high dimensional signal-to-noise ratios," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1037-1065, September.
T. Tony Cai & Zijian Guo & Yin Xia, 2023. "Statistical inference and large-scale multiple testing for high-dimensional regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(4), pages 1135-1171, December.
Qing Zhou & Seunghyun Min, 2017. "Uncertainty quantification under group sparsity," Biometrika, Biometrika Trust, vol. 104(3), pages 613-632.
Kim Kyung In & Roquain Etienne & van de Wiel Mark A, 2010. "Spatial Clustering of Array CGH Features in Combination with Hierarchical Multiple Testing," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-25, November.
Gueuning, Thomas & Claeskens, Gerda, 2016. "Confidence intervals for high-dimensional partially linear single-index models," Journal of Multivariate Analysis, Elsevier, vol. 149(C), pages 13-29.
Lan, Wei & Zhong, Ping-Shou & Li, Runze & Wang, Hansheng & Tsai, Chih-Ling, 2016. "Testing a single regression coefficient in high dimensional linear models," Journal of Econometrics, Elsevier, vol. 195(1), pages 154-168.
Breunig, Christoph & Mammen, Enno & Simoni, Anna, 2020. "Ill-posed estimation in high-dimensional models with instrumental variables," Journal of Econometrics, Elsevier, vol. 219(1), pages 171-200.
- Christoph Breunig & Enno Mammen & Anna Simoni, 2018. "Ill-posed Estimation in High-Dimensional Models with Instrumental Variables," Papers 1806.00666, arXiv.org, revised Aug 2020.
- Christoph Breunig & Enno Mammen & Anna Simoni, 2020. "Ill-posed estimation in high-dimensional models with instrumental variables," Post-Print hal-03089879, HAL.
Gao Wang & Abhishek Sarkar & Peter Carbonetto & Matthew Stephens, 2020. "A simple new approach to variable selection in regression, with application to genetic fine mapping," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(5), pages 1273-1300, December.
Tianxi Cai & T. Tony Cai & Zijian Guo, 2021. "Optimal statistical inference for individualized treatment effects in high‐dimensional models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(4), pages 669-719, September.
Caner, Mehmet & Kock, Anders Bredahl, 2018. "Asymptotically honest confidence regions for high dimensional parameters by the desparsified conservative Lasso," Journal of Econometrics, Elsevier, vol. 203(1), pages 143-168.
- Mehmet Caner & Anders Bredahl Kock, 2014. "Asymptotically Honest Confidence Regions for High Dimensional Parameters by the Desparsified Conservative Lasso," CREATES Research Papers 2014-36, Department of Economics and Business Economics, Aarhus University.
Akbar Zamanzadeh & Tony Cavoli, 2022. "The effect of nonpharmaceutical interventions on COVID-19 infections for lower and middle-income countries: A debiased LASSO approach," PLOS ONE, Public Library of Science, vol. 17(7), pages 1-17, July.

More about this item

Keywords

; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:37:y:2022:i:3:d:10.1007_s00180-021-01148-6. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Hierarchical correction of p-values via an ultrametric tree running Ornstein-Uhlenbeck process

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data