IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1003227.html
   My bibliography  Save this article

GINI: From ISH Images to Gene Interaction Networks

Author

Listed:
  • Kriti Puniyani
  • Eric P Xing

Abstract

Accurate inference of molecular and functional interactions among genes, especially in multicellular organisms such as Drosophila, often requires statistical analysis of correlations not only between the magnitudes of gene expressions, but also between their temporal-spatial patterns. The ISH (in-situ-hybridization)-based gene expression micro-imaging technology offers an effective approach to perform large-scale spatial-temporal profiling of whole-body mRNA abundance. However, analytical tools for discovering gene interactions from such data remain an open challenge due to various reasons, including difficulties in extracting canonical representations of gene activities from images, and in inference of statistically meaningful networks from such representations. In this paper, we present GINI, a machine learning system for inferring gene interaction networks from Drosophila embryonic ISH images. GINI builds on a computer-vision-inspired vector-space representation of the spatial pattern of gene expression in ISH images, enabled by our recently developed system; and a new multi-instance-kernel algorithm that learns a sparse Markov network model, in which, every gene (i.e., node) in the network is represented by a vector-valued spatial pattern rather than a scalar-valued gene intensity as in conventional approaches such as a Gaussian graphical model. By capturing the notion of spatial similarity of gene expression, and at the same time properly taking into account the presence of multiple images per gene via multi-instance kernels, GINI is well-positioned to infer statistically sound, and biologically meaningful gene interaction networks from image data. Using both synthetic data and a small manually curated data set, we demonstrate the effectiveness of our approach in network building. Furthermore, we report results on a large publicly available collection of Drosophila embryonic ISH images from the Berkeley Drosophila Genome Project, where GINI makes novel and interesting predictions of gene interactions. Software for GINI is available at http://sailing.cs.cmu.edu/Drosophila_ISH_images/Author Summary: As high-throughput technologies for molecular abundance profiling are becoming more inexpensive and accessible, computational inference of gene interaction networks from such data based on well-founded statistical principles is imperative to advance the understanding of regulatory mechanisms in various biological systems. Reverse engineering of gene networks has traditionally relied on analysis of whole-genome microarray data; here we present a new method, GINI, to infer gene networks from ISH images, thereby enabling exploration of spatial characteristics of gene expressions for network inference. Our method generates a Markov network, which encapsulates globally meaningful statistical-dependencies from vector-valued gene spatial patterns. In other words, we advance the state-of-art in both the usage of richer forms of expression data, and the employment of principled statistical methodology for sound network inference on such new form of data. Our results show that analyzing the spatial distribution of gene expression enables us to capture information not available from microarray data. Such an analysis is especially important in analyzing genes involved in embryonic development of Drosophila to reveal specific spatial patterning that determines the development of the 14 segments of the adult fly.

Suggested Citation

  • Kriti Puniyani & Eric P Xing, 2013. "GINI: From ISH Images to Gene Interaction Networks," PLOS Computational Biology, Public Library of Science, vol. 9(10), pages 1-15, October.
  • Handle: RePEc:plo:pcbi00:1003227
    DOI: 10.1371/journal.pcbi.1003227
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003227
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1003227&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1003227?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Maria Stella Carro & Wei Keat Lim & Mariano Javier Alvarez & Robert J. Bollo & Xudong Zhao & Evan Y. Snyder & Erik P. Sulman & Sandrine L. Anne & Fiona Doetsch & Howard Colman & Anna Lasorella & Ken A, 2010. "The transcriptional network for mesenchymal transformation of brain tumours," Nature, Nature, vol. 463(7279), pages 318-325, January.
    2. Dobra, Adrian & Hans, Chris & Jones, Beatrix & Nevins, J.R.Joseph R. & Yao, Guang & West, Mike, 2004. "Sparse graphical models for exploring gene expression data," Journal of Multivariate Analysis, Elsevier, vol. 90(1), pages 196-212, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Giraud Christophe & Huet Sylvie & Verzelen Nicolas, 2012. "Graph Selection with GGMselect," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(3), pages 1-52, February.
    2. Paci, Lucia & Consonni, Guido, 2020. "Structural learning of contemporaneous dependencies in graphical VAR models," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    3. Wessel N. van Wieringen & Carel F. W. Peeters & Renee X. de Menezes & Mark A. van de Wiel, 2018. "Testing for pathway (in)activation by using Gaussian graphical models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(5), pages 1419-1436, November.
    4. Byol Kim & Song Liu & Mladen Kolar, 2021. "Two‐sample inference for high‐dimensional Markov networks," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(5), pages 939-962, November.
    5. Agius Phaedra & Ying Yiming & Campbell Colin, 2009. "Bayesian Unsupervised Learning with Multiple Data Types," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-27, June.
    6. Wang, Hao, 2010. "Sparse seemingly unrelated regression modelling: Applications in finance and econometrics," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2866-2877, November.
    7. Daniel Felix Ahelegbey & Luis Carvalho & Eric D. Kolaczyk, 2020. "A Bayesian Covariance Graph And Latent Position Model For Multivariate Financial Time Series," DEM Working Papers Series 181, University of Pavia, Department of Economics and Management.
    8. Christine Peterson & Francesco C. Stingo & Marina Vannucci, 2015. "Bayesian Inference of Multiple Gaussian Graphical Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 159-174, March.
    9. Aliaksandr Hubin & Geir Storvik, 2024. "Sparse Bayesian Neural Networks: Bridging Model and Parameter Uncertainty through Scalable Variational Inference," Mathematics, MDPI, vol. 12(6), pages 1-28, March.
    10. Laurenţiu Cătălin Hinoveanu & Fabrizio Leisen & Cristiano Villa, 2020. "A loss‐based prior for Gaussian graphical models," Australian & New Zealand Journal of Statistics, Australian Statistical Publishing Association Inc., vol. 62(4), pages 444-466, December.
    11. Xiaodong Cai & Juan Andrés Bazerque & Georgios B Giannakis, 2013. "Inference of Gene Regulatory Networks with Sparse Structural Equation Models Exploiting Genetic Perturbations," PLOS Computational Biology, Public Library of Science, vol. 9(5), pages 1-13, May.
    12. Chun, Hyonho & Lee, Myung Hee & Fleet, James C. & Oh, Ji Hwan, 2016. "Graphical models via joint quantile regression with component selection," Journal of Multivariate Analysis, Elsevier, vol. 152(C), pages 162-171.
    13. Ashish Arora & Michelle Gittelman & Sarah Kaplan & John Lynch & Will Mitchell & Nicolaj Siggelkow & Mei Li & Ying Lin & Shuai Huang & Craig Crossland, 2016. "The use of sparse inverse covariance estimation for relationship detection and hypothesis generation in strategic management," Strategic Management Journal, Wiley Blackwell, vol. 37(1), pages 86-97, January.
    14. Villers Fanny & Schaeffer Brigitte & Bertin Caroline & Huet Sylvie, 2008. "Assessing the Validity Domains of Graphical Gaussian Models in Order to Infer Relationships among Components of Complex Biological Systems," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(2), pages 1-37, September.
    15. Zhixiang Lin & Tao Wang & Can Yang & Hongyu Zhao, 2017. "On joint estimation of Gaussian graphical models for spatial and temporal data," Biometrics, The International Biometric Society, vol. 73(3), pages 769-779, September.
    16. Chen, Shyh-Huei & Ip, Edward H. & Wang, Yuchung J., 2011. "Gibbs ensembles for nearly compatible and incompatible conditional models," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1760-1769, April.
    17. Codazzi, Laura & Colombi, Alessandro & Gianella, Matteo & Argiento, Raffaele & Paci, Lucia & Pini, Alessia, 2022. "Gaussian graphical modeling for spectrometric data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    18. Yiming Hu & Hongyu Zhao, 2016. "CCor: A whole genome network‐based similarity measure between two genes," Biometrics, The International Biometric Society, vol. 72(4), pages 1216-1225, December.
    19. Nikolaos Petrakis & Stefano Peluso & Dimitris Fouskakis & Guido Consonni, 2020. "Objective methods for graphical structural learning," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 74(3), pages 420-438, August.
    20. Donatello Telesca & Peter Müller & Steven M. Kornblau & Marc A. Suchard & Yuan Ji, 2012. "Modeling Protein Expression and Protein Signaling Pathways," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1372-1384, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1003227. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.