IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v639y2025i8054d10.1038_s41586-024-08430-9.html
   My bibliography  Save this article

Massively parallel characterization of transcriptional regulatory elements

Author

Listed:
  • Vikram Agarwal

    (University of Washington
    Sanofi)

  • Fumitaka Inoue

    (University of California, San Francisco
    University of California, San Francisco
    Kyoto University)

  • Max Schubach

    (Berlin Institute of Health at Charité-Universitätsmedizin Berlin)

  • Dmitry Penzar

    (Russian Academy of Sciences
    Russian Academy of Sciences
    Pirogov Russian National Research Medical University)

  • Beth K. Martin

    (University of Washington)

  • Pyaree Mohan Dash

    (Berlin Institute of Health at Charité-Universitätsmedizin Berlin)

  • Pia Keukeleire

    (University of Lübeck)

  • Zicong Zhang

    (Kyoto University)

  • Ajuni Sohota

    (University of California, San Francisco
    University of California, San Francisco)

  • Jingjing Zhao

    (University of California, San Francisco
    University of California, San Francisco)

  • Ilias Georgakopoulos-Soares

    (The Pennsylvania State University College of Medicine)

  • William S. Noble

    (University of Washington
    University of Washington)

  • Galip Gürkan Yardımcı

    (University of Washington
    Oregon Health and Science University
    Oregon Health and Science University)

  • Ivan V. Kulakovskiy

    (Russian Academy of Sciences
    Russian Academy of Sciences
    Life Improvement by Future Technologies (LIFT) Center)

  • Martin Kircher

    (Berlin Institute of Health at Charité-Universitätsmedizin Berlin
    University of Lübeck)

  • Jay Shendure

    (University of Washington
    Howard Hughes Medical Institute
    University of Washington
    Seattle)

  • Nadav Ahituv

    (University of California, San Francisco
    University of California, San Francisco)

Abstract

The human genome contains millions of candidate cis-regulatory elements (cCREs) with cell-type-specific activities that shape both health and many disease states1. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these cCREs. Here we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of more than 680,000 sequences, representing an extensive set of annotated cCREs among three cell types (HepG2, K562 and WTC11), and found that 41.7% of these sequences were active. By testing sequences in both orientations, we find promoters to have strand-orientation biases and their 200-nucleotide cores to function as non-cell-type-specific ‘on switches’ that provide similar expression levels to their associated gene. By contrast, enhancers have weaker orientation biases, but increased tissue-specific characteristics. Utilizing our lentiMPRA data, we develop sequence-based models to predict cCRE function and variant effects with high accuracy, delineate regulatory motifs and model their combinatorial effects. Testing a lentiMPRA library encompassing 60,000 cCREs in all three cell types further identified factors that determine cell-type specificity. Collectively, our work provides an extensive catalogue of functional CREs in three widely used cell lines and showcases how large-scale functional measurements can be used to dissect regulatory grammar.

Suggested Citation

  • Vikram Agarwal & Fumitaka Inoue & Max Schubach & Dmitry Penzar & Beth K. Martin & Pyaree Mohan Dash & Pia Keukeleire & Zicong Zhang & Ajuni Sohota & Jingjing Zhao & Ilias Georgakopoulos-Soares & Willi, 2025. "Massively parallel characterization of transcriptional regulatory elements," Nature, Nature, vol. 639(8054), pages 411-420, March.
  • Handle: RePEc:nat:nature:v:639:y:2025:i:8054:d:10.1038_s41586-024-08430-9
    DOI: 10.1038/s41586-024-08430-9
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-024-08430-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-024-08430-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:639:y:2025:i:8054:d:10.1038_s41586-024-08430-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.