IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v10y2019i1d10.1038_s41467-019-13225-y.html
   My bibliography  Save this article

Accurate, scalable and integrative haplotype estimation

Author

Listed:
  • Olivier Delaneau

    (University of Lausanne, Génopode
    University of Lausanne, Quartier Sorge – Batiment Amphipole)

  • Jean-François Zagury

    (HESAM Université)

  • Matthew R. Robinson

    (University of Lausanne, Génopode
    University of Lausanne, Quartier Sorge – Batiment Amphipole)

  • Jonathan L. Marchini

    (University of Oxford)

  • Emmanouil T. Dermitzakis

    (University of Geneva Medical School
    University of Geneva
    University of Geneva Medical School)

Abstract

The number of human genomes being genotyped or sequenced increases exponentially and efficient haplotype estimation methods able to handle this amount of data are now required. Here we present a method, SHAPEIT4, which substantially improves upon other methods to process large genotype and high coverage sequencing datasets. It notably exhibits sub-linear running times with sample size, provides highly accurate haplotypes and allows integrating external phasing information such as large reference panels of haplotypes, collections of pre-phased variants and long sequencing reads. We provide SHAPEIT4 in an open source format and demonstrate its performance in terms of accuracy and running times on two gold standard datasets: the UK Biobank data and the Genome In A Bottle.

Suggested Citation

  • Olivier Delaneau & Jean-François Zagury & Matthew R. Robinson & Jonathan L. Marchini & Emmanouil T. Dermitzakis, 2019. "Accurate, scalable and integrative haplotype estimation," Nature Communications, Nature, vol. 10(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:10:y:2019:i:1:d:10.1038_s41467-019-13225-y
    DOI: 10.1038/s41467-019-13225-y
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-019-13225-y
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-019-13225-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lijing Tang & Benjamin Swedlund & Sébastien Dupont & Chad Harland & Gabriel Costa Monteiro Moreira & Keith Durkin & Maria Artesi & Eric Mullaart & Arnaud Sartelet & Latifa Karim & Wouter Coppieters & , 2024. "GWAS reveals determinants of mobilization rate and dynamics of an active endogenous retrovirus of cattle," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    2. Robin J. Hofmeister & Simone Rubinacci & Diogo M. Ribeiro & Alfonso Buil & Zoltán Kutalik & Olivier Delaneau, 2022. "Parent-of-Origin inference for biobanks," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    3. Xinkai Tong & Dong Chen & Jianchao Hu & Shiyao Lin & Ziqi Ling & Huashui Ai & Zhiyan Zhang & Lusheng Huang, 2023. "Accurate haplotype construction and detection of selection signatures enabled by high quality pig genome sequences," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    4. Bárbara Sousa da Mota & Simone Rubinacci & Diana Ivette Cruz Dávalos & Carlos Eduardo G. Amorim & Martin Sikora & Niels N. Johannsen & Marzena H. Szmyt & Piotr Włodarczak & Anita Szczepanek & Marcin M, 2023. "Imputation of ancient human genomes," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    5. Parker C. Wilson & Yoshiharu Muto & Haojia Wu & Anil Karihaloo & Sushrut S. Waikar & Benjamin D. Humphreys, 2022. "Multimodal single cell sequencing implicates chromatin accessibility and genetic background in diabetic kidney disease progression," Nature Communications, Nature, vol. 13(1), pages 1-20, December.
    6. Seppe Goovaerts & Hanne Hoskens & Ryan J. Eller & Noah Herrick & Anthony M. Musolf & Cristina M. Justice & Meng Yuan & Sahin Naqvi & Myoung Keun Lee & Dirk Vandermeulen & Heather L. Szabo-Rogers & Pau, 2023. "Joint multi-ancestry and admixed GWAS reveals the complex genetics behind human cranial vault shape," Nature Communications, Nature, vol. 14(1), pages 1-21, December.
    7. Gabriel E. Rech & Santiago Radío & Sara Guirao-Rico & Laura Aguilera & Vivien Horvath & Llewellyn Green & Hannah Lindstadt & Véronique Jamilloux & Hadi Quesneville & Josefa González, 2022. "Population-scale long-read sequencing uncovers transposable elements associated with gene expression variation and adaptive signatures in Drosophila," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    8. Junhui Yuan & Sanjie Jiang & Jianbo Jian & Mingyu Liu & Zhen Yue & Jiabao Xu & Juan Li & Chunyan Xu & Lihong Lin & Yi Jing & Xiaoxiao Zhang & Haixin Chen & Linjuan Zhang & Tao Fu & Shuiyan Yu & Zhangy, 2022. "Genomic basis of the giga-chromosomes and giga-genome of tree peony Paeonia ostii," Nature Communications, Nature, vol. 13(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:10:y:2019:i:1:d:10.1038_s41467-019-13225-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.