IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0011900.html
   My bibliography  Save this article

Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties

Author

Listed:
  • Tao Huang
  • Ping Wang
  • Zhi-Qiang Ye
  • Heng Xu
  • Zhisong He
  • Kai-Yan Feng
  • LeLe Hu
  • WeiRen Cui
  • Kai Wang
  • Xiao Dong
  • Lu Xie
  • Xiangyin Kong
  • Yu-Dong Cai
  • Yixue Li

Abstract

Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structural features. However, these features cannot fully explain the association between a SAP and the observed pathophysiological phenotype. We believe the better rationale for deleterious SAP prediction should be: If a SAP lies in the protein with important functions and it can change the protein sequence and structure severely, it is more likely related to disease. So we established a method to predict deleterious SAPs based on both protein interaction network and traditional hybrid properties. Each SAP is represented by 472 features that include sequential features, structural features and network features. Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection (IFS) were applied to obtain the optimal feature set and the prediction model was Nearest Neighbor Algorithm (NNA). In jackknife cross-validation, 83.27% of SAPs were correctly predicted when the optimized 263 features were used. The optimized predictor with 263 features was also tested in an independent dataset and the accuracy was still 80.00%. In contrast, SIFT, a widely used predictor of deleterious SAPs based on sequential features, has a prediction accuracy of 71.05% on the same dataset. In our study, network features were found to be most important for accurate prediction and can significantly improve the prediction performance. Our results suggest that the protein interaction context could provide important clues to help better illustrate SAP's functional association. This research will facilitate the post genome-wide association studies.

Suggested Citation

  • Tao Huang & Ping Wang & Zhi-Qiang Ye & Heng Xu & Zhisong He & Kai-Yan Feng & LeLe Hu & WeiRen Cui & Kai Wang & Xiao Dong & Lu Xie & Xiangyin Kong & Yu-Dong Cai & Yixue Li, 2010. "Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties," PLOS ONE, Public Library of Science, vol. 5(7), pages 1-7, July.
  • Handle: RePEc:plo:pone00:0011900
    DOI: 10.1371/journal.pone.0011900
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0011900
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0011900&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0011900?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Tao Huang & WeiRen Cui & LeLe Hu & KaiYan Feng & Yi-Xue Li & Yu-Dong Cai, 2009. "Prediction of Pharmacological and Xenobiotic Responses to Drugs Based on Time Course Gene Expression Profiles," PLOS ONE, Public Library of Science, vol. 4(12), pages 1-7, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tao Huang & Lei Chen & Yu-Dong Cai & Kuo-Chen Chou, 2011. "Classification and Analysis of Regulatory Pathways Using Graph Property, Biochemical and Physicochemical Property, and Functional Property," PLOS ONE, Public Library of Science, vol. 6(9), pages 1-11, September.
    2. Yu-Dong Cai & Tao Huang & Kai-Yan Feng & Lele Hu & Lu Xie, 2010. "A Unified 35-Gene Signature for both Subtype Classification and Survival Prediction in Diffuse Large B-Cell Lymphomas," PLOS ONE, Public Library of Science, vol. 5(9), pages 1-8, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jianhua Guan & Zuguo Yu & Yongan Liao & Runbin Tang & Ming Duan & Guosheng Han, 2024. "Predicting Critical Path of Labor Dispute Resolution in Legal Domain by Machine Learning Models Based on SHapley Additive exPlanations and Soft Voting Strategy," Mathematics, MDPI, vol. 12(2), pages 1-17, January.
    2. Bi-Qing Li & Tao Huang & Jian Zhang & Ning Zhang & Guo-Hua Huang & Lei Liu & Yu-Dong Cai, 2013. "An Ensemble Prognostic Model for Colorectal Cancer," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-8, May.
    3. Lu-Lu Zheng & Shen Niu & Pei Hao & KaiYan Feng & Yu-Dong Cai & Yixue Li, 2011. "Prediction of Protein Modification Sites of Pyrrolidone Carboxylic Acid Using mRMR Feature Selection and Analysis," PLOS ONE, Public Library of Science, vol. 6(12), pages 1-11, December.
    4. Yu-Dong Cai & Tao Huang & Kai-Yan Feng & Lele Hu & Lu Xie, 2010. "A Unified 35-Gene Signature for both Subtype Classification and Survival Prediction in Diffuse Large B-Cell Lymphomas," PLOS ONE, Public Library of Science, vol. 5(9), pages 1-8, September.
    5. Lei Chen & Chen Chu & Xiangyin Kong & Guohua Huang & Tao Huang & Yu-Dong Cai, 2015. "A Hybrid Computational Method for the Discovery of Novel Reproduction-Related Genes," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-15, March.
    6. Tao Huang & Lei Chen & Yu-Dong Cai & Kuo-Chen Chou, 2011. "Classification and Analysis of Regulatory Pathways Using Graph Property, Biochemical and Physicochemical Property, and Functional Property," PLOS ONE, Public Library of Science, vol. 6(9), pages 1-11, September.
    7. Le-Le Hu & Shen Niu & Tao Huang & Kai Wang & Xiao-He Shi & Yu-Dong Cai, 2010. "Prediction and Analysis of Protein Hydroxyproline and Hydroxylysine," PLOS ONE, Public Library of Science, vol. 5(12), pages 1-8, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0011900. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.