IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1006616.html
   My bibliography  Save this article

SFPEL-LPI: Sequence-based feature projection ensemble learning for predicting LncRNA-protein interactions

Author

Listed:
  • Wen Zhang
  • Xiang Yue
  • Guifeng Tang
  • Wenjian Wu
  • Feng Huang
  • Xining Zhang

Abstract

LncRNA-protein interactions play important roles in post-transcriptional gene regulation, poly-adenylation, splicing and translation. Identification of lncRNA-protein interactions helps to understand lncRNA-related activities. Existing computational methods utilize multiple lncRNA features or multiple protein features to predict lncRNA-protein interactions, but features are not available for all lncRNAs or proteins; most of existing methods are not capable of predicting interacting proteins (or lncRNAs) for new lncRNAs (or proteins), which don’t have known interactions. In this paper, we propose the sequence-based feature projection ensemble learning method, “SFPEL-LPI”, to predict lncRNA-protein interactions. First, SFPEL-LPI extracts lncRNA sequence-based features and protein sequence-based features. Second, SFPEL-LPI calculates multiple lncRNA-lncRNA similarities and protein-protein similarities by using lncRNA sequences, protein sequences and known lncRNA-protein interactions. Then, SFPEL-LPI combines multiple similarities and multiple features with a feature projection ensemble learning frame. In computational experiments, SFPEL-LPI accurately predicts lncRNA-protein associations and outperforms other state-of-the-art methods. More importantly, SFPEL-LPI can be applied to new lncRNAs (or proteins). The case studies demonstrate that our method can find out novel lncRNA-protein interactions, which are confirmed by literature. Finally, we construct a user-friendly web server, available at http://www.bioinfotech.cn/SFPEL-LPI/.Author summary: LncRNA-protein interactions play important roles in post-transcriptional gene regulation, poly-adenylation, splicing and translation. Identification of lncRNA-protein interactions helps to understand lncRNA-related activities. In this paper, we propose a novel computational method “SFPEL-LPI” to predict lncRNA-protein interactions. SFPEL-LPI makes use of lncRNA sequences, protein sequences and known lncRNA-protein associations to extract features and calculate similarities for lncRNAs and proteins, and then combines them with a feature projection ensemble learning frame. SFPEL-LPI can predict unobserved interactions between lncRNAs and proteins, and also can make predictions for new lncRNAs (or proteins), which have no interactions with any proteins (or lncRNAs). SFPEL-LPI produces high-accuracy performances on the benchmark dataset when evaluated by five-fold cross validation, and outperforms state-of-the-art methods. The case studies demonstrate that SFPEL-LPI can find out novel associations, which are confirmed by literature. To facilitate the lncRNA-protein interaction prediction, we develop a user-friendly web server, available at http://www.bioinfotech.cn/SFPEL-LPI/.

Suggested Citation

  • Wen Zhang & Xiang Yue & Guifeng Tang & Wenjian Wu & Feng Huang & Xining Zhang, 2018. "SFPEL-LPI: Sequence-based feature projection ensemble learning for predicting LncRNA-protein interactions," PLOS Computational Biology, Public Library of Science, vol. 14(12), pages 1-21, December.
  • Handle: RePEc:plo:pcbi00:1006616
    DOI: 10.1371/journal.pcbi.1006616
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006616
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1006616&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1006616?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. U Martin Singh-Blom & Nagarajan Natarajan & Ambuj Tewari & John O Woods & Inderjit S Dhillon & Edward M Marcotte, 2013. "Prediction and Validation of Gene-Disease Associations Using Methods Inspired by Social Network Analyses," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-17, May.
    2. Yunan Luo & Xinbin Zhao & Jingtian Zhou & Jinglin Yang & Yanqing Zhang & Wenhua Kuang & Jian Peng & Ligong Chen & Jianyang Zeng, 2017. "A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information," Nature Communications, Nature, vol. 8(1), pages 1-13, December.
    3. Yue-Ping Liu & Hai-Yan Wu & Xiang Yang & Han-Qing Xu & Yong-Chuan Li & Da-Chuan Shi & Jun-Fu Huang & Qing Huang & Wei-Ling Fu, 2015. "Association between Thiopurine S-methyltransferase Polymorphisms and Thiopurine-Induced Adverse Drug Reactions in Patients with Inflammatory Bowel Disease: A Meta-Analysis," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-12, March.
    4. Longqiang Luo & Dingfang Li & Wen Zhang & Shikui Tu & Xiaopeng Zhu & Gang Tian, 2016. "Accurate Prediction of Transposon-Derived piRNAs by Integrating Various Sequential and Physicochemical Features," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-13, April.
    5. Wen Zhang & Yanqing Niu & Hua Zou & Longqiang Luo & Qianchao Liu & Weijian Wu, 2015. "Accurate Prediction of Immunogenic T-Cell Epitopes from Epitope Sequences Using the Genetic Algorithm-Based Ensemble Learning," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-14, May.
    6. Wen Zhang & Yanqing Niu & Yi Xiong & Meng Zhao & Rongwei Yu & Juan Liu, 2012. "Computational Prediction of Conformational B-Cell Epitopes from Antigen Primary Structures by Ensemble Learning," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-9, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Longqiang Luo & Dingfang Li & Wen Zhang & Shikui Tu & Xiaopeng Zhu & Gang Tian, 2016. "Accurate Prediction of Transposon-Derived piRNAs by Integrating Various Sequential and Physicochemical Features," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-13, April.
    2. Yue-Ping Liu & Han-Qing Xu & Ming Li & Xiang Yang & Shu Yu & Wei-Ling Fu & Qing Huang, 2015. "Association between Thiopurine S-Methyltransferase Polymorphisms and Azathioprine-Induced Adverse Drug Reactions in Patients with Autoimmune Diseases: A Meta-Analysis," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-14, December.
    3. Mingxuan Che & Kui Yao & Chao Che & Zhangwei Cao & Fanchen Kong, 2021. "Knowledge-Graph-Based Drug Repositioning against COVID-19 by Graph Convolutional Network with Attention Mechanism," Future Internet, MDPI, vol. 13(1), pages 1-10, January.
    4. Akram Vasighizaker & Alok Sharma & Abdollah Dehzangi, 2019. "A novel one-class classification approach to accurately predict disease-gene association in acute myeloid leukemia cancer," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-12, December.
    5. Cui, Ying & Cai, Meng & Stanley, H. Eugene, 2018. "Discovering disease-associated genes in weighted protein–protein interaction networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 496(C), pages 53-61.
    6. Yuxuan Wang & Ying Xia & Junchi Yan & Ye Yuan & Hong-Bin Shen & Xiaoyong Pan, 2023. "ZeroBind: a protein-specific zero-shot predictor with subgraph matching for drug-target interactions," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    7. Qing Ye & Chang-Yu Hsieh & Ziyi Yang & Yu Kang & Jiming Chen & Dongsheng Cao & Shibo He & Tingjun Hou, 2021. "A unified drug–target interaction prediction framework based on knowledge graph and recommendation system," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    8. Wen Zhang & Yanqing Niu & Hua Zou & Longqiang Luo & Qianchao Liu & Weijian Wu, 2015. "Accurate Prediction of Immunogenic T-Cell Epitopes from Epitope Sequences Using the Genetic Algorithm-Based Ensemble Learning," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-14, May.
    9. Xiaomin Liang & Daifeng Li & Min Song & Andrew Madden & Ying Ding & Yi Bu, 2019. "Predicting biomedical relationships using the knowledge and graph embedding cascade model," PLOS ONE, Public Library of Science, vol. 14(6), pages 1-23, June.
    10. Jianzhao Gao & Wei Cui & Yajun Sheng & Jishou Ruan & Lukasz Kurgan, 2016. "PSIONplus: Accurate Sequence-Based Predictor of Ion Channels and Their Types," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-18, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1006616. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.