IDEAS home Printed from https://ideas.repec.org/a/spr/aodasc/v11y2024i3d10.1007_s40745-024-00530-7.html
   My bibliography  Save this article

Predicting the Functional Changes in Protein Mutations Through the Application of BiLSTM and the Self-Attention Mechanism

Author

Listed:
  • Zixuan Fan

    (University of Science and Technology Beijing)

  • Yan Xu

    (University of Science and Technology Beijing)

Abstract

In the field of bioinformatics, changes in protein functionality are mainly influenced by protein mutations. Accurately predicting these functional changes can enhance our understanding of evolutionary mechanisms, promote developments in protein engineering-related fields, and accelerate progress in medical research. In this study, we introduced two different models: one based on bidirectional long short-term memory (BiLSTM), and the other based on self-attention. These models were integrated using a weighted fusion method to predict protein functional changes associated with mutation sites. The findings indicate that the model's predictive precision matches that of the current model, along with its capacity for generalization. Furthermore, the ensemble model surpasses the performance of the single models, highlighting the value of utilizing their synergistic capabilities. This finding may improve the accuracy of predicting protein functional changes associated with mutations and has potential applications in protein engineering and drug research. We evaluated the efficacy of our models under different scenarios by comparing the predicted results of protein functional changes across various numbers of mutation sites. As the number of mutation sites increases, the prediction accuracy decreases significantly, highlighting the inherent limitations of these models in handling cases involving more mutation sites.

Suggested Citation

  • Zixuan Fan & Yan Xu, 2024. "Predicting the Functional Changes in Protein Mutations Through the Application of BiLSTM and the Self-Attention Mechanism," Annals of Data Science, Springer, vol. 11(3), pages 1077-1094, June.
  • Handle: RePEc:spr:aodasc:v:11:y:2024:i:3:d:10.1007_s40745-024-00530-7
    DOI: 10.1007/s40745-024-00530-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40745-024-00530-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40745-024-00530-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Siquan Hu & Ruixiong Ma & Haiou Wang, 2019. "An improved deep learning method for predicting DNA-binding proteins based on contextual features in amino acid sequences," PLOS ONE, Public Library of Science, vol. 14(11), pages 1-21, November.
    2. Yunan Luo & Guangde Jiang & Tianhao Yu & Yang Liu & Lam Vo & Hantian Ding & Yufeng Su & Wesley Wei Qian & Huimin Zhao & Jian Peng, 2021. "ECNet is an evolutionary context-integrated deep learning framework for protein engineering," Nature Communications, Nature, vol. 12(1), pages 1-14, December.
    3. Benedetta Bolognesi & Andre J. Faure & Mireia Seuma & Jörn M. Schmiedel & Gian Gaetano Tartaglia & Ben Lehner, 2019. "The mutational landscape of a prion-like domain," Nature Communications, Nature, vol. 10(1), pages 1-12, December.
    4. Karen S. Sarkisyan & Dmitry A. Bolotin & Margarita V. Meer & Dinara R. Usmanova & Alexander S. Mishin & George V. Sharonov & Dmitry N. Ivankov & Nina G. Bozhanova & Mikhail S. Baranov & Onuralp Soylem, 2016. "Local fitness landscape of the green fluorescent protein," Nature, Nature, vol. 533(7603), pages 397-401, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ziyi Zhou & Liang Zhang & Yuanxi Yu & Banghao Wu & Mingchen Li & Liang Hong & Pan Tan, 2024. "Enhancing efficiency of protein language models with minimal wet-lab data through few-shot learning," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    2. Lisa Van den Broeck & Dinesh Kiran Bhosale & Kuncheng Song & Cássio Flavio Fonseca de Lima & Michael Ashley & Tingting Zhu & Shanshuo Zhu & Brigitte Van De Cotte & Pia Neyt & Anna C. Ortiz & Tiffany R, 2023. "Functional annotation of proteins for signaling network inference in non-model species," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    3. Alicia Sanchez-Gorostiaga & Djordje Bajić & Melisa L Osborne & Juan F Poyatos & Alvaro Sanchez, 2019. "High-order interactions distort the functional landscape of microbial consortia," PLOS Biology, Public Library of Science, vol. 17(12), pages 1-34, December.
    4. Mireia Seuma & Ben Lehner & Benedetta Bolognesi, 2022. "An atlas of amyloid aggregation: the impact of substitutions, insertions, deletions and truncations on amyloid beta fibril nucleation," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    5. Yinghui Chen & Yunxin Xu & Di Liu & Yaoguang Xing & Haipeng Gong, 2024. "An end-to-end framework for the prediction of protein structure and fitness from single sequence," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    6. Avichai Tendler & Anat Zimmer & Avi Mayo & Uri Alon, 2019. "Noise-precision tradeoff in predicting combinations of mutations and drugs," PLOS Computational Biology, Public Library of Science, vol. 15(5), pages 1-17, May.
    7. Nan Zheng & Yongchao Cai & Zehua Zhang & Huimin Zhou & Yu Deng & Shuang Du & Mai Tu & Wei Fang & Xiaole Xia, 2025. "Tailoring industrial enzymes for thermostability and activity evolution by the machine learning-based iCASE strategy," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    8. Rosa Antón & Miguel Á. Treviño & David Pantoja-Uceda & Sara Félix & María Babu & Eurico J. Cabrita & Markus Zweckstetter & Philip Tinnefeld & Andrés M. Vera & Javier Oroz, 2024. "Alternative low-populated conformations prompt phase transitions in polyalanine repeat expansions," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    9. Kerr Ding & Jiaqi Luo & Yunan Luo, 2024. "Leveraging conformal prediction to annotate enzyme function space with limited false positives," PLOS Computational Biology, Public Library of Science, vol. 20(5), pages 1-21, May.
    10. Jaime Carrasco & Rosa Antón & Alejandro Valbuena & David Pantoja-Uceda & Mayur Mukhi & Rubén Hervás & Douglas V. Laurents & María Gasset & Javier Oroz, 2023. "Metamorphism in TDP-43 prion-like domain determines chaperone recognition," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    11. Miller, Craig R. & Van Leuven, James T. & Wichman, Holly A. & Joyce, Paul, 2018. "Selecting among three basic fitness landscape models: Additive, multiplicative and stickbreaking," Theoretical Population Biology, Elsevier, vol. 122(C), pages 97-109.
    12. Kerr Ding & Michael Chin & Yunlong Zhao & Wei Huang & Binh Khanh Mai & Huanan Wang & Peng Liu & Yang Yang & Yunan Luo, 2024. "Machine learning-guided co-optimization of fitness and diversity facilitates combinatorial library design in enzyme engineering," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    13. Hongyi Wu & Loo Chien Wang & Belle M. Sow & Damien Leow & Jin Zhu & Kathryn M. Gallo & Kathleen Wilsbach & Roshni Gupta & Lyle W. Ostrow & Crystal J. J. Yeo & Radoslaw M. Sobota & Rong Li, 2024. "TDP43 aggregation at ER-exit sites impairs ER-to-Golgi transport," Nature Communications, Nature, vol. 15(1), pages 1-22, December.
    14. Takahiro Nemoto & Tommaso Ocari & Arthur Planul & Muge Tekinsoy & Emilia A. Zin & Deniz Dalkara & Ulisse Ferrari, 2023. "ACIDES: on-line monitoring of forward genetic screens for protein engineering," Nature Communications, Nature, vol. 14(1), pages 1-11, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aodasc:v:11:y:2024:i:3:d:10.1007_s40745-024-00530-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.