IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0068664.html
   My bibliography  Save this article

Comparison of Profile Similarity Measures for Genetic Interaction Networks

Author

Listed:
  • Raamesh Deshpande
  • Benjamin VanderSluis
  • Chad L Myers

Abstract

Analysis of genetic interaction networks often involves identifying genes with similar profiles, which is typically indicative of a common function. While several profile similarity measures have been applied in this context, they have never been systematically benchmarked. We compared a diverse set of correlation measures, including measures commonly used by the genetic interaction community as well as several other candidate measures, by assessing their utility in extracting functional information from genetic interaction data. We find that the dot product, one of the simplest vector operations, outperforms most other measures over a large range of gene pairs. More generally, linear similarity measures such as the dot product, Pearson correlation or cosine similarity perform better than set overlap measures such as Jaccard coefficient. Similarity measures that involve L2-normalization of the profiles tend to perform better for the top-most similar pairs but perform less favorably when a larger set of gene pairs is considered or when the genetic interaction data is thresholded. Such measures are also less robust to the presence of noise and batch effects in the genetic interaction data. Overall, the dot product measure performs consistently among the best measures under a variety of different conditions and genetic interaction datasets.

Suggested Citation

  • Raamesh Deshpande & Benjamin VanderSluis & Chad L Myers, 2013. "Comparison of Profile Similarity Measures for Genetic Interaction Networks," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-11, July.
  • Handle: RePEc:plo:pone00:0068664
    DOI: 10.1371/journal.pone.0068664
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0068664
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0068664&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0068664?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bernd‐Jürgen Falkowski, 1998. "On certain generalizations of inner product similarity measures," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 49(9), pages 854-858.
    2. Loet Leydesdorff, 2008. "On the normalization and visualization of author co‐citation data: Salton's Cosine versus the Jaccard index," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 59(1), pages 77-85, January.
    3. Schechtman, E. & Yitzhaki, S., 1999. "On the proper bounds of the Gini correlation," Economics Letters, Elsevier, vol. 63(2), pages 133-138, May.
    4. Jesse Gillis & Paul Pavlidis, 2011. "The Impact of Multifunctional Genes on "Guilt by Association" Analysis," PLOS ONE, Public Library of Science, vol. 6(2), pages 1-16, February.
    5. Leo Egghe & Loet Leydesdorff, 2009. "The relation between Pearson's correlation coefficient r and Salton's cosine measure," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(5), pages 1027-1036, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yi Ge & Wen Dou & Jianping Dai, 2017. "A New Approach to Identify Social Vulnerability to Climate Change in the Yangtze River Delta," Sustainability, MDPI, vol. 9(12), pages 1-19, December.
    2. Stephanie Chang & Jackie Yip & Shona Zijll de Jong & Rebecca Chaster & Ashley Lowcock, 2015. "Using vulnerability indicators to develop resilience networks: a similarity approach," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 78(3), pages 1827-1841, September.
    3. Chihyun Park & JungRim Kim & Jeongwoo Kim & Sanghyun Park, 2018. "Machine learning-based identification of genetic interactions from heterogeneous gene expression profiles," PLOS ONE, Public Library of Science, vol. 13(7), pages 1-15, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Serfling, Robert & Xiao, Peng, 2007. "A contribution to multivariate L-moments: L-comoment matrices," Journal of Multivariate Analysis, Elsevier, vol. 98(9), pages 1765-1781, October.
    2. Yoel Finkel & Yevgeny Artsev & Shlomo Yitzhaki, 2006. "Inequality measurement and the time structure of household income in Israel," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 4(2), pages 153-179, August.
    3. Joachim R. Frick & Jan Goebel & Edna Schechtman & Gert G. Wagner & Shlomo Yitzhaki, 2006. "Using Analysis of Gini (ANOGI) for Detecting Whether Two Subsamples Represent the Same Universe," Sociological Methods & Research, , vol. 34(4), pages 427-468, May.
    4. Jun-Ping Qiu & Ke Dong & Hou-Qiang Yu, 2014. "Comparative study on structure and correlation among author co-occurrence networks in bibliometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1345-1360, November.
    5. Georg Groh & Christoph Fuchs, 2011. "Multi-modal social networks for modeling scientific fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(2), pages 569-590, November.
    6. Th I Götz & G Lahmer & V Strnad & Ch Bert & B Hensel & A M Tomé & E W Lang, 2017. "A tool to automatically analyze electromagnetic tracking data from high dose rate brachytherapy of breast cancer patients," PLOS ONE, Public Library of Science, vol. 12(9), pages 1-31, September.
    7. Masih Hosseinzadeh & Hossein Mashhadimoslem & Farid Maleki & Ali Elkamel, 2022. "Prediction of Solid Conversion Process in Direct Reduction Iron Oxide Using Machine Learning," Energies, MDPI, vol. 15(24), pages 1-25, December.
    8. Yang Li & Frank Neffke, 2022. "Relatedness in regional development: in search of the right specification," Papers in Evolutionary Economic Geography (PEEG) 2208, Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography, revised Apr 2022.
    9. Mercedes Campi & Marco Duenas & Giorgio Fagiolo, 2019. "How do countries specialize in food production? A complex-network analysis of the global agricultural product space," LEM Papers Series 2019/37, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    10. Lu Huang & Yijie Cai & Erdong Zhao & Shengting Zhang & Yue Shu & Jiao Fan, 2022. "Measuring the interdisciplinarity of Information and Library Science interactions using citation analysis and semantic analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6733-6761, November.
    11. Joachim Frick & Jan Goebel, 2008. "Regional Income Stratification in Unified Germany Using a Gini Decomposition Approach," Regional Studies, Taylor & Francis Journals, vol. 42(4), pages 555-577.
    12. Jean-Pierre Lachaud, 1999. "Envois de fonds, inégalité et pauvreté au Burkina Faso," Revue Tiers Monde, Programme National Persée, vol. 40(160), pages 793-827.
    13. Loet Leydesdorff, 2011. "“Structuration” by intellectual organization: the configuration of knowledge in relations among structural components in networks of science," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(2), pages 499-520, August.
    14. Carsten Schröder & Yolanda Golan & Shlomo Yitzhaki, 2014. "Inequality and the time structure of earnings: evidence from Germany," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 12(3), pages 393-409, September.
    15. Majid Asadi & Somayeh Zarezadeh, 2020. "A unified approach to constructing correlation coefficients between random variables," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 83(6), pages 657-676, August.
    16. Stéphane Mussard & Fattouma Souissi-Benrejab, 2019. "Gini-PLS Regressions," Journal of Quantitative Economics, Springer;The Indian Econometric Society (TIES), vol. 17(3), pages 477-512, September.
    17. Domenica Panzera & Alfredo Cartone & Paolo Postiglione, 2022. "New evidence on measuring the geographical concentration of economic activities," Papers in Regional Science, Wiley Blackwell, vol. 101(1), pages 59-79, February.
    18. Rakas, Marija & Hain, Daniel S., 2019. "The state of innovation system research: What happens beneath the surface?," Research Policy, Elsevier, vol. 48(9), pages 1-1.
    19. Campi, Mercedes & Dueñas, Marco, 2022. "Clusters and Resilience during the COVID–19 Crisis: Evidence from Colombian Exporting Firms," IDB Publications (Working Papers) 12527, Inter-American Development Bank.
    20. Wolfram, Dietmar & Zhao, Yuehua, 2014. "A comparison of journal similarity across six disciplines using citing discipline analysis," Journal of Informetrics, Elsevier, vol. 8(4), pages 840-853.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0068664. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.