IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v466y2017icp462-475.html
   My bibliography  Save this article

Analysis of co-occurrence toponyms in web pages based on complex networks

Author

Listed:
  • Zhong, Xiang
  • Liu, Jiajun
  • Gao, Yong
  • Wu, Lun

Abstract

A large number of geographical toponyms exist in web pages and other documents, providing abundant geographical resources for GIS. It is very common for toponyms to co-occur in the same documents. To investigate these relations associated with geographic entities, a novel complex network model for co-occurrence toponyms is proposed. Then, 12 toponym co-occurrence networks are constructed from the toponym sets extracted from the People’s Daily Paper documents of 2010. It is found that two toponyms have a high co-occurrence probability if they are at the same administrative level or if they possess a part-whole relationship. By applying complex network analysis methods to toponym co-occurrence networks, we find the following characteristics. (1) The navigation vertices of the co-occurrence networks can be found by degree centrality analysis. (2) The networks express strong cluster characteristics, and it takes only several steps to reach one vertex from another one, implying that the networks are small-world graphs. (3) The degree distribution satisfies the power law with an exponent of 1.7, so the networks are free-scale. (4) The networks are disassortative and have similar assortative modes, with assortative exponents of approximately 0.18 and assortative indexes less than 0. (5) The frequency of toponym co-occurrence is weakly negatively correlated with geographic distance, but more strongly negatively correlated with administrative hierarchical distance. Considering the toponym frequencies and co-occurrence relationships, a novel method based on link analysis is presented to extract the core toponyms from web pages. This method is suitable and effective for geographical information retrieval.

Suggested Citation

  • Zhong, Xiang & Liu, Jiajun & Gao, Yong & Wu, Lun, 2017. "Analysis of co-occurrence toponyms in web pages based on complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 466(C), pages 462-475.
  • Handle: RePEc:eee:phsmap:v:466:y:2017:i:c:p:462-475
    DOI: 10.1016/j.physa.2016.09.024
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437116306409
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2016.09.024?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Barabási, Albert-László & Ravasz, Erzsébet & Vicsek, Tamás, 2001. "Deterministic scale-free networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 299(3), pages 559-564.
    2. Jun-Ping Qiu & Ke Dong & Hou-Qiang Yu, 2014. "Comparative study on structure and correlation among author co-occurrence networks in bibliometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1345-1360, November.
    3. Scott, John, 1988. "Social Network Analysis and Intercorporate Relations," Hitotsubashi Journal of commerce and management, Hitotsubashi University, vol. 23(1), pages 53-68, December.
    4. Henry Small, 1973. "Co‐citation in the scientific literature: A new measure of the relationship between two documents," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 24(4), pages 265-269, July.
    5. Liang, Wei & Wang, Yanli & Shi, Yuming & Chen, Guanrong, 2015. "Co-occurrence network analysis of modern Chinese poems," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 420(C), pages 284-293.
    6. Liang, Wei & Shi, Yuming & Tse, Chi K. & Liu, Jing & Wang, Yanli & Cui, Xunqiang, 2009. "Comparison of co-occurrence networks of the Chinese and English languages," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(23), pages 4901-4909.
    7. Howard D. White & Belver C. Griffith, 1981. "Author cocitation: A literature measure of intellectual structure," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 32(3), pages 163-171, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shakibian, Hadi & Charkari, Nasrollah Moghadam, 2018. "Statistical similarity measures for link prediction in heterogeneous complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 501(C), pages 248-263.
    2. Ma, Jun-Chao & Wang, Li & Jiang, Zhi-Qiang & Yan, Wanfeng & Zhou, Wei-Xing, 2021. "City logistics networks based on online freight orders in China," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 583(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gaviria-Marin, Magaly & Merigó, José M. & Baier-Fuentes, Hugo, 2019. "Knowledge management: A global examination based on bibliometric analysis," Technological Forecasting and Social Change, Elsevier, vol. 140(C), pages 194-220.
    2. Pamela E. Sandstrom, 2001. "Scholarly communication as a socioecological system," Scientometrics, Springer;Akadémiai Kiadó, vol. 51(3), pages 573-605, July.
    3. Jianhua Hou, 2017. "Exploration into the evolution and historical roots of citation analysis by referenced publication year spectroscopy," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(3), pages 1437-1452, March.
    4. Yu-Wei Chang & Mu-Hsuan Huang & Chiao-Wen Lin, 2015. "Evolution of research subjects in library and information science based on keyword, bibliographical coupling, and co-citation analyses," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 2071-2087, December.
    5. Ying Huang & Wolfgang Glänzel & Lin Zhang, 2021. "Tracing the development of mapping knowledge domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6201-6224, July.
    6. João J. M. Ferreira & Cristina I. Fernandes & Sascha Kraus, 2019. "Entrepreneurship research: mapping intellectual structures and research trends," Review of Managerial Science, Springer, vol. 13(1), pages 181-205, February.
    7. Masaki Eto, 2013. "Evaluations of context-based co-citation searching," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(2), pages 651-673, February.
    8. Georg Groh & Christoph Fuchs, 2011. "Multi-modal social networks for modeling scientific fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(2), pages 569-590, November.
    9. Xuerong Li & Han Qiao & Shouyang Wang, 2017. "Exploring evolution and emerging trends in business model study: a co-citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 869-887, May.
    10. Perianes-Rodriguez, Antonio & Waltman, Ludo & van Eck, Nees Jan, 2016. "Constructing bibliometric networks: A comparison between full and fractional counting," Journal of Informetrics, Elsevier, vol. 10(4), pages 1178-1195.
    11. João Paulo Coelho Ribeiro & Fábio Duarte & Ana Paula Matias Gama, 2022. "Does microfinance foster the development of its clients? A bibliometric analysis and systematic literature review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.
    12. José Luis Ortega Priego, 2003. "A Vector Space Model as a methodological approach to the Triple Helix dimensionality: A comparative study of Biology and Biomedicine Centres of two European National Research Councils from a Webometri," Scientometrics, Springer;Akadémiai Kiadó, vol. 58(2), pages 429-443, October.
    13. Paúl Carrión-Mero & Néstor Montalván-Burbano & Fernando Morante-Carballo & Adolfo Quesada-Román & Boris Apolo-Masache, 2021. "Worldwide Research Trends in Landslide Science," IJERPH, MDPI, vol. 18(18), pages 1-24, September.
    14. Ruth Zárate-Rueda & Yolima Ivonne Beltrán-Villamizar & Daniella Murallas-Sánchez, 2021. "Social representations of socioenvironmental dynamics in extractive ecosystems and conservation practices with sustainable development: a bibliometric analysis," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 23(11), pages 16428-16453, November.
    15. Floriana Fusco & Marta Marsilio & Chiara Guglielmetti, 2018. "La co-production in sanit?: un?analisi bibliometrica," MECOSAN, FrancoAngeli Editore, vol. 2018(108), pages 35-54.
    16. Muaz Niazi & Amir Hussain, 2011. "Agent-based computing from multi-agent systems to agent-based models: a visual survey," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(2), pages 479-499, November.
    17. Antonio Rafael Ramos-Rodriguez & Salustiano Martinez-Fierro & Jose Aurelio Medina-Garrido & Jose Ruiz-Navarro, 2023. "Global Entrepreneurship Monitor versus Panel Study of Entrepreneurial Dynamics: comparing their intellectual structures," Papers 2401.13684, arXiv.org.
    18. Boyack, Kevin W. & Klavans, Richard, 2014. "Including cited non-source items in a large-scale map of science: What difference does it make?," Journal of Informetrics, Elsevier, vol. 8(3), pages 569-580.
    19. Markus Gmür, 2003. "Co-citation analysis and the search for invisible colleges: A methodological evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 57(1), pages 27-57, January.
    20. Saurav Chandra Talukder & Zoltán Lakner, 2023. "Exploring the Landscape of Social Entrepreneurship and Crowdfunding: A Bibliometric Analysis," Sustainability, MDPI, vol. 15(12), pages 1-22, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:466:y:2017:i:c:p:462-475. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.