IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v20y2018i5d10.1007_s10796-018-9830-2.html
   My bibliography  Save this article

Weakly Supervised and Online Learning of Word Models for Classification to Detect Disaster Reporting Tweets

Author

Listed:
  • Girish Keshav Palshikar

    (Tata Consultancy Services Limited)

  • Manoj Apte

    (Tata Consultancy Services Limited)

  • Deepak Pandita

    (University of Rochester)

Abstract

Social media has quickly established itself as an important means that people, NGOs and governments use to spread information during natural or man-made disasters, mass emergencies and crisis situations. Given this important role, real-time analysis of social media contents to locate, organize and use valuable information for disaster management is crucial. In this paper, we propose self-learning algorithms that, with minimal supervision, construct a simple bag-of-words model of information expressed in the news about various natural disasters. Such a model is human-understandable, human-modifiable and usable in a real-time scenario. Since tweets are a different category of documents than news, we next propose a model transfer algorithm, which essentially refines the model learned from news by analyzing a large unlabeled corpus of tweets. We show empirically that model transfer improves the predictive accuracy of the model. We demonstrate empirically that our model learning algorithm is better than several state of the art semi-supervised learning algorithms. Finally, we present an online algorithm that learns the weights for words in the model and demonstrate the efficacy of the model with word weights.

Suggested Citation

  • Girish Keshav Palshikar & Manoj Apte & Deepak Pandita, 2018. "Weakly Supervised and Online Learning of Word Models for Classification to Detect Disaster Reporting Tweets," Information Systems Frontiers, Springer, vol. 20(5), pages 949-959, October.
  • Handle: RePEc:spr:infosf:v:20:y:2018:i:5:d:10.1007_s10796-018-9830-2
    DOI: 10.1007/s10796-018-9830-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-018-9830-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-018-9830-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhen Zhao & Zongmin Ma & Li Yan, 2021. "An Efficient Classification of Fuzzy XML Documents Based on Kernel ELM," Information Systems Frontiers, Springer, vol. 23(3), pages 515-530, June.
    2. A. Geethapriya & S. Valli, 2021. "An Enhanced Approach to Map Domain-Specific Words in Cross-Domain Sentiment Analysis," Information Systems Frontiers, Springer, vol. 23(3), pages 791-805, June.
    3. Guizhe Song & Degen Huang, 2021. "A Sentiment-Aware Contextual Model for Real-Time Disaster Prediction Using Twitter Data," Future Internet, MDPI, vol. 13(7), pages 1-15, June.
    4. Lin-Chih Chen, 0. "Interactive Topic Search System Based on Topic Cluster Technology," Information Systems Frontiers, Springer, vol. 0, pages 1-17.
    5. Abhinav Kumar & Jyoti Prakash Singh & Nripendra P. Rana & Yogesh K. Dwivedi, 2023. "Multi-Channel Convolutional Neural Network for the Identification of Eyewitness Tweets of Disaster," Information Systems Frontiers, Springer, vol. 25(4), pages 1589-1604, August.
    6. Yanxin Wang & Jian Li & Xi Zhao & Gengzhong Feng & Xin (Robert) Luo, 2020. "Using Mobile Phone Data for Emergency Management: a Systematic Literature Review," Information Systems Frontiers, Springer, vol. 22(6), pages 1539-1559, December.
    7. Lin-Chih Chen, 2021. "Interactive Topic Search System Based on Topic Cluster Technology," Information Systems Frontiers, Springer, vol. 23(5), pages 1227-1243, September.
    8. Saptarshi Ghosh & Kripabandhu Ghosh & Debasis Ganguly & Tanmoy Chakraborty & Gareth J. F. Jones & Marie-Francine Moens & Muhammad Imran, 2018. "Exploitation of Social Media for Emergency Relief and Preparedness: Recent Research and Trends," Information Systems Frontiers, Springer, vol. 20(5), pages 901-907, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:20:y:2018:i:5:d:10.1007_s10796-018-9830-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.