IDEAS home Printed from https://ideas.repec.org/a/igg/jhisi0/v14y2019i4p1-20.html
   My bibliography  Save this article

Exploring Big Data Analytic Approaches to Cancer Blog Text Analysis

Author

Listed:
  • Viju Raghupathi

    (Koppelman School of Business, Brooklyn College of the City University of New York, Brooklyn, USA)

  • Yilu Zhou

    (Gabelli School of Business, Fordham University, New York, USA)

  • Wullianallur Raghupathi

    (Gabelli School of Business, Fordham University, New York, USA)

Abstract

In this article, the authors explore the potential of a big data analytics approach to unstructured text analytics of cancer blogs. The application is developed using Cloudera platform's Hadoop MapReduce framework. It uses several text analytics algorithms, including word count, word association, clustering, and classification, to identify and analyze the patterns and keywords in cancer blog postings. This article establishes an exploratory approach to involving big data analytics methods in developing text analytics applications for the analysis of cancer blogs. Additional insights are extracted through various means, including the development of categories or keywords contained in the blogs, the development of a taxonomy, and the examination of relationships among the categories. The application has the potential for generalizability and implementation with health content in other blogs and social media. It can provide insight and decision support for cancer management and facilitate efficient and relevant searches for information related to cancer.

Suggested Citation

  • Viju Raghupathi & Yilu Zhou & Wullianallur Raghupathi, 2019. "Exploring Big Data Analytic Approaches to Cancer Blog Text Analysis," International Journal of Healthcare Information Systems and Informatics (IJHISI), IGI Global, vol. 14(4), pages 1-20, October.
  • Handle: RePEc:igg:jhisi0:v:14:y:2019:i:4:p:1-20
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJHISI.2019100101
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wullianallur Raghupathi & Dominik Molitor & Viju Raghupathi & Aditya Saharia, 2023. "Identifying Key Issues in Climate Change Litigation: A Machine Learning Text Analytic Approach," Sustainability, MDPI, vol. 15(23), pages 1-30, December.
    2. Viju Raghupathi & Jie Ren & Wullianallur Raghupathi, 2020. "Identifying Corporate Sustainability Issues by Analyzing Shareholder Resolutions: A Machine-Learning Text Analytics Approach," Sustainability, MDPI, vol. 12(11), pages 1-24, June.
    3. Viju Raghupathi & Jie Ren & Wullianallur Raghupathi, 2020. "Studying Public Perception about Vaccination: A Sentiment Analysis of Tweets," IJERPH, MDPI, vol. 17(10), pages 1-23, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jhisi0:v:14:y:2019:i:4:p:1-20. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.