IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v489y2012i7414d10.1038_nature11212.html
   My bibliography  Save this article

An expansive human regulatory lexicon encoded in transcription factor footprints

Author

Listed:
  • Shane Neph

    (University of Washington)

  • Jeff Vierstra

    (University of Washington)

  • Andrew B. Stergachis

    (University of Washington)

  • Alex P. Reynolds

    (University of Washington)

  • Eric Haugen

    (University of Washington)

  • Benjamin Vernot

    (University of Washington)

  • Robert E. Thurman

    (University of Washington)

  • Sam John

    (University of Washington)

  • Richard Sandstrom

    (University of Washington)

  • Audra K. Johnson

    (University of Washington)

  • Matthew T. Maurano

    (University of Washington)

  • Richard Humbert

    (University of Washington)

  • Eric Rynes

    (University of Washington)

  • Hao Wang

    (University of Washington)

  • Shinny Vong

    (University of Washington)

  • Kristen Lee

    (University of Washington)

  • Daniel Bates

    (University of Washington)

  • Morgan Diegel

    (University of Washington)

  • Vaughn Roach

    (University of Washington)

  • Douglas Dunn

    (University of Washington)

  • Jun Neri

    (University of Washington)

  • Anthony Schafer

    (University of Washington)

  • R. Scott Hansen

    (University of Washington
    University of Washington)

  • Tanya Kutyavin

    (University of Washington)

  • Erika Giste

    (University of Washington)

  • Molly Weaver

    (University of Washington)

  • Theresa Canfield

    (University of Washington)

  • Peter Sabo

    (University of Washington)

  • Miaohua Zhang

    (Fred Hutchison Cancer Research Center)

  • Gayathri Balasundaram

    (Fred Hutchison Cancer Research Center)

  • Rachel Byron

    (Fred Hutchison Cancer Research Center)

  • Michael J. MacCoss

    (University of Washington)

  • Joshua M. Akey

    (University of Washington)

  • M. A. Bender

    (Fred Hutchison Cancer Research Center
    University of Washington)

  • Mark Groudine

    (Fred Hutchison Cancer Research Center
    University of Washington)

  • Rajinder Kaul

    (University of Washington
    University of Washington)

  • John A. Stamatoyannopoulos

    (University of Washington
    University of Washington)

Abstract

Regulatory factor binding to genomic DNA protects the underlying sequence from cleavage by DNase I, leaving nucleotide-resolution footprints. Using genomic DNase I footprinting across 41 diverse cell and tissue types, we detected 45 million transcription factor occupancy events within regulatory regions, representing differential binding to 8.4 million distinct short sequence elements. Here we show that this small genomic sequence compartment, roughly twice the size of the exome, encodes an expansive repertoire of conserved recognition sequences for DNA-binding proteins that nearly doubles the size of the human cis–regulatory lexicon. We find that genetic variants affecting allelic chromatin states are concentrated in footprints, and that these elements are preferentially sheltered from DNA methylation. High-resolution DNase I cleavage patterns mirror nucleotide-level evolutionary conservation and track the crystallographic topography of protein–DNA interfaces, indicating that transcription factor structure has been evolutionarily imprinted on the human genome sequence. We identify a stereotyped 50-base-pair footprint that precisely defines the site of transcript origination within thousands of human promoters. Finally, we describe a large collection of novel regulatory factor recognition motifs that are highly conserved in both sequence and function, and exhibit cell-selective occupancy patterns that closely parallel major regulators of development, differentiation and pluripotency.

Suggested Citation

  • Shane Neph & Jeff Vierstra & Andrew B. Stergachis & Alex P. Reynolds & Eric Haugen & Benjamin Vernot & Robert E. Thurman & Sam John & Richard Sandstrom & Audra K. Johnson & Matthew T. Maurano & Richar, 2012. "An expansive human regulatory lexicon encoded in transcription factor footprints," Nature, Nature, vol. 489(7414), pages 83-90, September.
  • Handle: RePEc:nat:nature:v:489:y:2012:i:7414:d:10.1038_nature11212
    DOI: 10.1038/nature11212
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/nature11212
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/nature11212?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Carlos Company & Matthias Jürgen Schmitt & Yuliia Dramaretska & Michela Serresi & Sonia Kertalli & Ben Jiang & Jiang-An Yin & Adriano Aguzzi & Iros Barozzi & Gaetano Gargiulo, 2024. "Logical design of synthetic cis-regulatory DNA for genetic tracing of cell identities and state changes," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    2. Shengen Shawn Hu & Lin Liu & Qi Li & Wenjing Ma & Michael J. Guertin & Clifford A. Meyer & Ke Deng & Tingting Zhang & Chongzhi Zang, 2022. "Intrinsic bias estimation for improved analysis of bulk and single-cell chromatin accessibility profiles using SELMA," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    3. Yuyan Cheng & Yuqin Yin & Alice Zhang & Alexander M. Bernstein & Riki Kawaguchi & Kun Gao & Kyra Potter & Hui-Ya Gilbert & Yan Ao & Jing Ou & Catherine J. Fricano-Kugler & Jeffrey L. Goldberg & Zhigan, 2022. "Transcription factor network analysis identifies REST/NRSF as an intrinsic regulator of CNS regeneration in mice," Nature Communications, Nature, vol. 13(1), pages 1-22, December.
    4. Pedro Madrigal & Siwei Deng & Yuliang Feng & Stefania Militi & Kim Jee Goh & Reshma Nibhani & Rodrigo Grandy & Anna Osnato & Daniel Ortmann & Stephanie Brown & Siim Pauklin, 2023. "Epigenetic and transcriptional regulations prime cell fate before division during human pluripotent stem cell differentiation," Nature Communications, Nature, vol. 14(1), pages 1-23, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:489:y:2012:i:7414:d:10.1038_nature11212. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.