IDEAS home Printed from https://ideas.repec.org/a/spr/infotm/v17y2016i2d10.1007_s10799-015-0238-0.html
   My bibliography  Save this article

Chinese trending search terms popularity rank prediction

Author

Listed:
  • Soyeon Caren Han

    (University of Tasmania)

  • Yulu Liang

    (University of Tasmania)

  • Hyunsuk Chung

    (University of Tasmania)

  • Hyejin Kim

    (Sungshin Women’s University)

  • Byeong Ho Kang

    (University of Tasmania)

Abstract

Baidu, the most popular Chinese search engine, monitors what their users are currently searching and provides top 50 search terms, called trending search terms, in descending order of popularity ranking. The paper focused on predicting the popularity ranking trends of this top trending search terms in Baidu. Based on the data analysis, two issues were identified that could affect accuracy of using the ranking data for predicting the popularity of trending searched terms. Firstly, all trending terms are disappeared from the top 50 terms list when the popularity is getting lower. However, there are several trending terms that reappear to the top 50 terms list after they disappeared. New distinct search terms can be differentiated from reappearances of old terms so we proposed the term distinction model by using the related news articles of a trending search term provided by Baidu. Secondly, it is necessary to handle the missing value when the term is out of the trending term list. To achieve the goal of this paper, we collected top 50 trending search terms from Baidu engine and its related news articles hourly for 6 months (from 1st March 2013 to 31th August 2013). Based on the proposed model, we found that the optimal disappearing interval can be 9 h, and using rank 51 for the missing values was the most successful. We conducted evaluations by using 3 months data (from 1st September 2013 to 30th November 2013), and four machine learning techniques where compared to evaluate the most accurate for predicting the popularity rank of trending search terms. Feed Forward Neural Network was achieved 78.81 % the most highest prediction accuracy, and achieved 85.55 % accuracy in ±3 error range.

Suggested Citation

  • Soyeon Caren Han & Yulu Liang & Hyunsuk Chung & Hyejin Kim & Byeong Ho Kang, 2016. "Chinese trending search terms popularity rank prediction," Information Technology and Management, Springer, vol. 17(2), pages 133-139, June.
  • Handle: RePEc:spr:infotm:v:17:y:2016:i:2:d:10.1007_s10799-015-0238-0
    DOI: 10.1007/s10799-015-0238-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10799-015-0238-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10799-015-0238-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Simeon Vosen & Torsten Schmidt, 2011. "Forecasting private consumption: survey‐based indicators vs. Google trends," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 30(6), pages 565-578, September.
    2. Kesten Green & J. Scott Armstrong & Andreas Graefe, 2007. "Methods to Elicit Forecasts from Groups: Delphi and Prediction Markets Compared," Foresight: The International Journal of Applied Forecasting, International Institute of Forecasters, issue 8, pages 17-20, Fall.
    3. Nesreen Ahmed & Amir Atiya & Neamat El Gayar & Hisham El-Shishiny, 2010. "An Empirical Comparison of Machine Learning Models for Time Series Forecasting," Econometric Reviews, Taylor & Francis Journals, vol. 29(5-6), pages 594-621.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin Obschonka & Mingjie Zhou & Yixin Zhou & Jianxin Zhang & Rainer K. Silbereisen, 2019. "“Confucian” traits, entrepreneurial personality, and entrepreneurship in China: a regional analysis," Small Business Economics, Springer, vol. 53(4), pages 961-979, December.
    2. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    3. Yang, Xin & Pan, Bing & Evans, James A. & Lv, Benfu, 2015. "Forecasting Chinese tourist volume with search engine data," Tourism Management, Elsevier, vol. 46(C), pages 386-397.
    4. Jacques Bughin, 2015. "Google searches and twitter mood: nowcasting telecom sales performance," Netnomics, Springer, vol. 16(1), pages 87-105, August.
    5. Fu, Chun & Miller, Clayton, 2022. "Using Google Trends as a proxy for occupant behavior to predict building energy consumption," Applied Energy, Elsevier, vol. 310(C).
    6. Petar Soric & Enric Monte & Salvador Torra & Oscar Claveria, 2022. ""Density forecasts of inflation using Gaussian process regression models"," IREA Working Papers 202210, University of Barcelona, Research Institute of Applied Economics, revised Jul 2022.
    7. Robert Lehmann, 2021. "Forecasting exports across Europe: What are the superior survey indicators?," Empirical Economics, Springer, vol. 60(5), pages 2429-2453, May.
    8. Maghsoodi, Abtin Ijadi, 2023. "Cryptocurrency portfolio allocation using a novel hybrid and predictive big data decision support system," Omega, Elsevier, vol. 115(C).
    9. Vosen, Simeon & Schmidt, Torsten, 2012. "A monthly consumption indicator for Germany based on Internet search query data," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 19(7), pages 683-687.
    10. Chien-jung Ting & Yi-Long Hsiao, 2022. "Nowcasting the GDP in Taiwan and the Real-Time Tourism Data," Advances in Management and Applied Economics, SCIENPRESS Ltd, vol. 12(3), pages 1-2.
    11. Szafranek, Karol, 2019. "Bagged neural networks for forecasting Polish (low) inflation," International Journal of Forecasting, Elsevier, vol. 35(3), pages 1042-1059.
    12. Nikolaos Askitas & Klaus F. Zimmermann, 2015. "The internet as a data source for advancement in social sciences," International Journal of Manpower, Emerald Group Publishing Limited, vol. 36(1), pages 2-12, April.
    13. Huber, Jakob & Stuckenschmidt, Heiner, 2020. "Daily retail demand forecasting using machine learning with emphasis on calendric special days," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1420-1438.
    14. Andrei Dubovik & Adam Elbourne & Bram Hendriks & Mark Kattenberg, 2022. "Forecasting World Trade Using Big Data and Machine Learning Techniques," CPB Discussion Paper 441, CPB Netherlands Bureau for Economic Policy Analysis.
    15. Ivana Lolić & Marija Logarušić & Mirjana Čižmešija, 2022. "Recent Revision of the European Consumer Confidence Indicator: Is There any additional Space for Improvement?," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 159(3), pages 845-863, February.
    16. Kock, Anders Bredahl & Teräsvirta, Timo, 2014. "Forecasting performances of three automated modelling techniques during the economic crisis 2007–2009," International Journal of Forecasting, Elsevier, vol. 30(3), pages 616-631.
    17. Robert RUSU & Constantin AVRAM, 2022. "Deep Learning Systems Integrated into the Digital Strategy of a Company Involved in e-commerce," Economics and Applied Informatics, "Dunarea de Jos" University of Galati, Faculty of Economics and Business Administration, issue 1, pages 5-10.
    18. Götz, Thomas B. & Knetsch, Thomas A., 2019. "Google data in bridge equation models for German GDP," International Journal of Forecasting, Elsevier, vol. 35(1), pages 45-66.
    19. Bańbura, Marta & Giannone, Domenico & Modugno, Michele & Reichlin, Lucrezia, 2013. "Now-Casting and the Real-Time Data Flow," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 195-237, Elsevier.
    20. Milan Daus & Katharina Koberger & Kaan Koca & Felix Beckers & Jorge Encinas Fernández & Barbara Weisbrod & Daniel Dietrich & Sabine Ulrike Gerbersdorf & Rüdiger Glaser & Stefan Haun & Hilmar Hofmann &, 2021. "Interdisciplinary Reservoir Management—A Tool for Sustainable Water Resources Management," Sustainability, MDPI, vol. 13(8), pages 1-21, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infotm:v:17:y:2016:i:2:d:10.1007_s10799-015-0238-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.