IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v27y2025i1d10.1007_s10796-023-10432-3.html
   My bibliography  Save this article

Synthesizing Knowledge through A Data Analytics-Based Systematic Literature Review Protocol

Author

Listed:
  • Rachael Ruizhu Xiong

    (Kansas State University)

  • Charles Zhechao Liu

    (University of Texas at San Antonio)

  • Kim-Kwang Raymond Choo

    (University of Texas at San Antonio)

Abstract

Systematic literature reviews (SLR) are commonly undertaken by researchers to stay informed of the latest development in a particular topic, but this manual process is demanding and can only locate and analyze a limited number of articles. We propose a data analytic-based SLR protocol and a set of semi-automated tools to leverage the latest advances in data analytics and facilitate a more effective, objective, and comprehensive SLR process. Our protocol incorporates scraping tools to collect articles from seven bibliographic databases, and text analytics, social network analysis, natural language processing, citation analysis, and main path analysis to analyze a large number of articles. To demonstrate its utility of, we apply the protocol on the topic of “information diffusion in social networks”. The results reveal 11 latent topics under this broad domain along with the most critical articles for each topic, and the connections among the associated 1,229 articles and their references.

Suggested Citation

  • Rachael Ruizhu Xiong & Charles Zhechao Liu & Kim-Kwang Raymond Choo, 2025. "Synthesizing Knowledge through A Data Analytics-Based Systematic Literature Review Protocol," Information Systems Frontiers, Springer, vol. 27(1), pages 235-258, February.
  • Handle: RePEc:spr:infosf:v:27:y:2025:i:1:d:10.1007_s10796-023-10432-3
    DOI: 10.1007/s10796-023-10432-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-023-10432-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-023-10432-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Daniel D. Lee & H. Sebastian Seung, 1999. "Learning the parts of objects by non-negative matrix factorization," Nature, Nature, vol. 401(6755), pages 788-791, October.
    2. Greene, Derek & Cross, James P., 2017. "Exploring the Political Agenda of the European Parliament Using a Dynamic Topic Modeling Approach," Political Analysis, Cambridge University Press, vol. 25(1), pages 77-94, January.
    3. Guy Paré & Mary Tate & David Johnstone & Spyros Kitsiou, 2016. "Contextualizing the twin concepts of systematicity and transparency in information systems literature reviews," European Journal of Information Systems, Taylor & Francis Journals, vol. 25(6), pages 493-508, November.
    4. Rowe, Gene & Wright, George, 1999. "The Delphi technique as a forecasting tool: issues and analysis," International Journal of Forecasting, Elsevier, vol. 15(4), pages 353-375, October.
    5. Ghassan Beydoun & Babak Abedin & José M. Merigó & Melanie Vera, 2019. "Twenty Years of Information Systems Frontiers," Information Systems Frontiers, Springer, vol. 21(2), pages 485-494, April.
    6. J. Piet Hausberg & Sabrina Korreck, 2020. "Business incubators and accelerators: a co-citation analysis-based, systematic literature review," The Journal of Technology Transfer, Springer, vol. 45(1), pages 151-176, February.
    7. Arun Varghese & Michelle Cawley & Tao Hong, 2018. "Supervised clustering for automated document classification and prioritization: a case study using toxicological abstracts," Environment Systems and Decisions, Springer, vol. 38(3), pages 398-414, September.
    8. Chyi-Kwei Yau & Alan Porter & Nils Newman & Arho Suominen, 2014. "Clustering scientific documents with topic modeling," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(3), pages 767-786, September.
    9. Vahe Tshitoyan & John Dagdelen & Leigh Weston & Alexander Dunn & Ziqin Rong & Olga Kononova & Kristin A. Persson & Gerbrand Ceder & Anubhav Jain, 2019. "Unsupervised word embeddings capture latent knowledge from materials science literature," Nature, Nature, vol. 571(7763), pages 95-98, July.
    10. John S. Liu & Chung-Huei Kuan, 2016. "A new approach for main path analysis: Decay in knowledge diffusion," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(2), pages 465-476, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ancil Crayton, 2018. "Central Bank Communication and the Yield Curve: A Semi-Automatic Approach using Non-Negative Matrix Factorization," Papers 1809.08718, arXiv.org.
    2. Amir Hossein Azadnia & Simon Stephens & Pezhman Ghadimi & George Onofrei, 2022. "A comprehensive performance measurement framework for business incubation centres: Empirical evidence in an Irish context," Business Strategy and the Environment, Wiley Blackwell, vol. 31(5), pages 2437-2455, July.
    3. Marko Orošnjak & Branko Štrbac & Srđan Vulanović & Biserka Runje & Amalija Horvatić Novak & Andrej Razumić, 2024. "RCE (rationale–cogency–extent) criterion unravels features affecting citation impact of top-ranked systematic literature reviews: leaving the impression…is all you need," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(3), pages 1891-1947, March.
    4. Simon Jehnen & Joaqu'in Ordieres-Mer'e & Javier Villalba-D'iez, 2025. "FinTextSim: Enhancing Financial Text Analysis with BERTopic," Papers 2504.15683, arXiv.org.
    5. Jia-Min Lu & Hui-Feng Wang & Qi-Hang Guo & Jian-Wei Wang & Tong-Tong Li & Ke-Xin Chen & Meng-Ting Zhang & Jian-Bo Chen & Qian-Nuan Shi & Yi Huang & Shao-Wen Shi & Guang-Yong Chen & Jian-Zhang Pan & Zh, 2024. "Roboticized AI-assisted microfluidic photocatalytic synthesis and screening up to 10,000 reactions per day," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    6. Rafael Teixeira & Mário Antunes & Diogo Gomes & Rui L. Aguiar, 2024. "Comparison of Semantic Similarity Models on Constrained Scenarios," Information Systems Frontiers, Springer, vol. 26(4), pages 1307-1330, August.
    7. Tingcan Ma & Ruinan Li & Guiyan Ou & Mingliang Yue, 2018. "Topic based research competitiveness evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 789-803, November.
    8. Prommer, Lisa & Tiberius, Victor & Kraus, Sascha, 2020. "Exploring the future of startup leadership development," Journal of Business Venturing Insights, Elsevier, vol. 14(C).
    9. Del Corso, Gianna M. & Romani, Francesco, 2019. "Adaptive nonnegative matrix factorization and measure comparisons for recommender systems," Applied Mathematics and Computation, Elsevier, vol. 354(C), pages 164-179.
    10. Bas Kolen & Matthijs Kok & Ira Helsloot & Bob Maaskant, 2013. "EvacuAid: A Probabilistic Model to Determine the Expected Loss of Life for Different Mass Evacuation Strategies During Flood Threats," Risk Analysis, John Wiley & Sons, vol. 33(7), pages 1312-1333, July.
    11. P Fogel & C Geissler & P Cotte & G Luta, 2022. "Applying separative non-negative matrix factorization to extra-financial data," Working Papers hal-03689774, HAL.
    12. Xiao-Bai Li & Jialun Qin, 2017. "Anonymizing and Sharing Medical Text Records," Information Systems Research, INFORMS, vol. 28(2), pages 332-352, June.
    13. Meissner, Philip & Brands, Christian & Wulf, Torsten, 2017. "Quantifiying blind spots and weak signals in executive judgment: A structured integration of expert judgment into the scenario development process," International Journal of Forecasting, Elsevier, vol. 33(1), pages 244-253.
    14. Chao Min & Qingyu Chen & Erjia Yan & Yi Bu & Jianjun Sun, 2021. "Citation cascade and the evolution of topic relevance," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(1), pages 110-127, January.
    15. Ananthan Nambiar & Tobias Rubel & James McCaull & Jon deVries & Mark Bedau, 2021. "Dropping diversity of products of large US firms: Models and measures," Papers 2110.08367, arXiv.org.
    16. Fabio Salamanca-Buentello & Mary V Seeman & Abdallah S Daar & Ross E G Upshur, 2020. "The ethical, social, and cultural dimensions of screening for mental health in children and adolescents of the developing world," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-25, August.
    17. Prianto Budi Saptono & Gustofan Mahmud & Intan Pratiwi & Dwi Purwanto & Ismail Khozen & Muhamad Akbar Aditama & Siti Khodijah & Maria Eurelia Wayan & Rina Yuliastuty Asmara & Ferry Jie, 2023. "Development of Climate-Related Disclosure Indicators for Application in Indonesia: A Delphi Method Study," Sustainability, MDPI, vol. 15(14), pages 1-25, July.
    18. János Abonyi & Ádám Ipkovich & Gyula Dörgő & Károly Héberger, 2023. "Matrix factorization-based multi-objective ranking–What makes a good university?," PLOS ONE, Public Library of Science, vol. 18(4), pages 1-30, April.
    19. Naiyang Guan & Lei Wei & Zhigang Luo & Dacheng Tao, 2013. "Limited-Memory Fast Gradient Descent Method for Graph Regularized Nonnegative Matrix Factorization," PLOS ONE, Public Library of Science, vol. 8(10), pages 1-10, October.
    20. Amiri, Babak & Karimianghadim, Ramin, 2024. "A novel text clustering model based on topic modelling and social network analysis," Chaos, Solitons & Fractals, Elsevier, vol. 181(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:27:y:2025:i:1:d:10.1007_s10796-023-10432-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.