IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v5y2014i1d10.1038_ncomms6695.html
   My bibliography  Save this article

High-quality genome (re)assembly using chromosomal contact data

Author

Listed:
  • Hervé Marie-Nelly

    (Institut Pasteur, Groupe Régulation Spatiale des Génomes
    CNRS, UMR 3525
    Institut Pasteur, Unité Imagerie et Modélisation
    CNRS, URA 2582)

  • Martial Marbouty

    (Institut Pasteur, Groupe Régulation Spatiale des Génomes
    CNRS, UMR 3525)

  • Axel Cournac

    (Institut Pasteur, Groupe Régulation Spatiale des Génomes
    CNRS, UMR 3525)

  • Jean-François Flot

    (Max Planck Institute for Dynamics and Self-Organization, Group Biological Physics and Evolutionary Dynamics)

  • Gianni Liti

    (Institute for Research on Cancer and Ageing of Nice (IRCAN), CNRS UMR 7284—INSERM U108, Université de Nice Sophia Antipolis)

  • Dante Poggi Parodi

    (Sorbonne Universités, UPMC Univ Paris06, IFD
    IFP Energies Nouvelles)

  • Sylvie Syan

    (Institut Pasteur, Unité Cell Biology of Parasitism)

  • Nancy Guillén

    (Institut Pasteur, Unité Cell Biology of Parasitism)

  • Antoine Margeot

    (IFP Energies Nouvelles)

  • Christophe Zimmer

    (Institut Pasteur, Unité Imagerie et Modélisation
    CNRS, URA 2582)

  • Romain Koszul

    (Institut Pasteur, Groupe Régulation Spatiale des Génomes
    CNRS, UMR 3525)

Abstract

Closing gaps in draft genome assemblies can be costly and time-consuming, and published genomes are therefore often left ‘unfinished.’ Here we show that genome-wide chromosome conformation capture (3C) data can be used to overcome these limitations, and present a computational approach rooted in polymer physics that determines the most likely genome structure using chromosomal contact data. This algorithm—named GRAAL—generates high-quality assemblies of genomes in which repeated and duplicated regions are accurately represented and offers a direct probabilistic interpretation of the computed structures. We first validated GRAAL on the reference genome of Saccharomyces cerevisiae, as well as other yeast isolates, where GRAAL recovered both known and unknown complex chromosomal structural variations. We then applied GRAAL to the finishing of the assembly of Trichoderma reesei and obtained a number of contigs congruent with the know karyotype of this species. Finally, we showed that GRAAL can accurately reconstruct human chromosomes from either fragments generated in silico or contigs obtained from de novo assembly. In all these applications, GRAAL compared favourably to recently published programmes implementing related approaches.

Suggested Citation

  • Hervé Marie-Nelly & Martial Marbouty & Axel Cournac & Jean-François Flot & Gianni Liti & Dante Poggi Parodi & Sylvie Syan & Nancy Guillén & Antoine Margeot & Christophe Zimmer & Romain Koszul, 2014. "High-quality genome (re)assembly using chromosomal contact data," Nature Communications, Nature, vol. 5(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:5:y:2014:i:1:d:10.1038_ncomms6695
    DOI: 10.1038/ncomms6695
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/ncomms6695
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/ncomms6695?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Simone Mozzachiodi & Lorenzo Tattini & Agnes Llored & Agurtzane Irizar & Neža Škofljanc & Melania D’Angiolo & Matteo De Chiara & Benjamin P. Barré & Jia-Xing Yue & Angela Lutazi & Sophie Loeillet & Ra, 2021. "Aborting meiosis allows recombination in sterile diploid yeast hybrids," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    2. Robert Schöpflin & Uirá Souto Melo & Hossein Moeinzadeh & David Heller & Verena Laupert & Jakob Hertzberg & Manuel Holtgrewe & Nico Alavi & Marius-Konstantin Klever & Julius Jungnitsch & Emel Comak & , 2022. "Integration of Hi-C with short and long-read genome sequencing reveals the structure of germline rearranged genomes," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    3. Jacques Serizay & Cyril Matthey-Doret & Amaury Bignaud & Lyam Baudry & Romain Koszul, 2024. "Orchestrating chromosome conformation capture analysis with Bioconductor," Nature Communications, Nature, vol. 15(1), pages 1-9, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:5:y:2014:i:1:d:10.1038_ncomms6695. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.