IDEAS home Printed from https://ideas.repec.org/a/kap/jgeosy/v24y2022i2d10.1007_s10109-022-00375-9.html
   My bibliography  Save this article

Chinese toponym recognition with variant neural structures from social media messages based on BERT methods

Author

Listed:
  • Kai Ma

    (China Three Gorges University)

  • YongJian Tan

    (China Three Gorges University)

  • Zhong Xie

    (National Engineering Research Center of Geographic Information System
    China University of Geosciences)

  • Qinjun Qiu

    (National Engineering Research Center of Geographic Information System
    China University of Geosciences
    China University of Geosciences)

  • Siqiong Chen

    (China University of Geosciences)

Abstract

Many natural language tasks related to geographic information retrieval (GIR) require toponym recognition, and identifying Chinese toponyms from social media messages to share real-time information is a critical problem for many practical applications, such as natural disaster response and geolocating. In this article, we focused on toponym recognition from social media messages in Chinese. While existing off-the-shelf Chinese named entity recognition (NER) tools could be applied to identify toponyms, these approaches cannot address a variety of language irregularities taken from social media messages, including location name abbreviations, informal sentence structures and combination toponyms. We present a deep neural network named BERT-BiLSTM-CRF, which extends a basic bidirectional recurrent neural network model (BiLSTM) with the pretraining bidirectional encoder representation from transformers (BERT) representation to handle the toponym recognition task in Chinese text. Using three datasets taken from lists of alternative location names, the experimental results showed that the proposed model can significantly outperform previous Chinese NER models/algorithms and a set of state-of-the-art deep learning models.

Suggested Citation

  • Kai Ma & YongJian Tan & Zhong Xie & Qinjun Qiu & Siqiong Chen, 2022. "Chinese toponym recognition with variant neural structures from social media messages based on BERT methods," Journal of Geographical Systems, Springer, vol. 24(2), pages 143-169, April.
  • Handle: RePEc:kap:jgeosy:v:24:y:2022:i:2:d:10.1007_s10109-022-00375-9
    DOI: 10.1007/s10109-022-00375-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10109-022-00375-9
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10109-022-00375-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Robin Lovelace, 2021. "Open source tools for geographic analysis in transport planning," Journal of Geographical Systems, Springer, vol. 23(4), pages 547-578, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Higgins, Christopher D. & Páez, Antonio & Kim, Gyoorie & Wang, Jue, 2021. "Changes in accessibility to emergency and community food services during COVID-19 and implications for low income populations in Hamilton, Ontario," Social Science & Medicine, Elsevier, vol. 291(C).
    2. Antonio Páez, 2021. "Open spatial sciences: an introduction," Journal of Geographical Systems, Springer, vol. 23(4), pages 467-476, October.
    3. Elise Desjardins & Christopher D. Higgins & Darren M. Scott & Emma Apatu & Antonio Páez, 2022. "Correlates of bicycling trip flows in Hamilton, Ontario: fastest, quietest, or balanced routes?," Transportation, Springer, vol. 49(3), pages 867-895, June.
    4. Aston, Laura & Currie, Graham & Kamruzzaman, Md. & Delbosc, Alexa & Brands, Ties & van Oort, Niels & Teller, David, 2021. "Multi-city exploration of built environment and transit mode use: Comparison of Melbourne, Amsterdam and Boston," Journal of Transport Geography, Elsevier, vol. 95(C).
    5. Soomauroo, Zakia & Blechinger, Philipp & Creutzig, Felix, 2023. "Electrifying public transit benefits public finances in small island developing states," Transport Policy, Elsevier, vol. 138(C), pages 45-59.
    6. Pereira, Rafael H. M. & Andrade, Pedro R. & Bazzo Vieira, João Pedro, 2022. "Exploring the time geography of public transport networks with the gtfs2gps package," SocArXiv qydr6, Center for Open Science.
    7. Vanessa Brum-Bastos & Antonio Páez, 2023. "Hägerstrand meets big data: time-geography in the age of mobility analytics," Journal of Geographical Systems, Springer, vol. 25(3), pages 327-336, July.
    8. Rafael H. M. Pereira & Pedro R. Andrade & João Pedro Bazzo Vieira, 2023. "Exploring the time geography of public transport networks with the gtfs2gps package," Journal of Geographical Systems, Springer, vol. 25(3), pages 453-466, July.
    9. Laura Alessandretti & Luis Guillermo Natera Orozco & Meead Saberi & Michael Szell & Federico Battiston, 2023. "Multimodal urban mobility and multilayer transport networks," Environment and Planning B, , vol. 50(8), pages 2038-2070, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:jgeosy:v:24:y:2022:i:2:d:10.1007_s10109-022-00375-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.