IDEAS home Printed from https://ideas.repec.org/a/spr/infotm/v17y2016i2d10.1007_s10799-015-0238-0.html
   My bibliography  Save this article

Chinese trending search terms popularity rank prediction

Author

Listed:
  • Soyeon Caren Han

    (University of Tasmania)

  • Yulu Liang

    (University of Tasmania)

  • Hyunsuk Chung

    (University of Tasmania)

  • Hyejin Kim

    (Sungshin Women’s University)

  • Byeong Ho Kang

    (University of Tasmania)

Abstract

Baidu, the most popular Chinese search engine, monitors what their users are currently searching and provides top 50 search terms, called trending search terms, in descending order of popularity ranking. The paper focused on predicting the popularity ranking trends of this top trending search terms in Baidu. Based on the data analysis, two issues were identified that could affect accuracy of using the ranking data for predicting the popularity of trending searched terms. Firstly, all trending terms are disappeared from the top 50 terms list when the popularity is getting lower. However, there are several trending terms that reappear to the top 50 terms list after they disappeared. New distinct search terms can be differentiated from reappearances of old terms so we proposed the term distinction model by using the related news articles of a trending search term provided by Baidu. Secondly, it is necessary to handle the missing value when the term is out of the trending term list. To achieve the goal of this paper, we collected top 50 trending search terms from Baidu engine and its related news articles hourly for 6 months (from 1st March 2013 to 31th August 2013). Based on the proposed model, we found that the optimal disappearing interval can be 9 h, and using rank 51 for the missing values was the most successful. We conducted evaluations by using 3 months data (from 1st September 2013 to 30th November 2013), and four machine learning techniques where compared to evaluate the most accurate for predicting the popularity rank of trending search terms. Feed Forward Neural Network was achieved 78.81 % the most highest prediction accuracy, and achieved 85.55 % accuracy in ±3 error range.

Suggested Citation

  • Soyeon Caren Han & Yulu Liang & Hyunsuk Chung & Hyejin Kim & Byeong Ho Kang, 2016. "Chinese trending search terms popularity rank prediction," Information Technology and Management, Springer, vol. 17(2), pages 133-139, June.
  • Handle: RePEc:spr:infotm:v:17:y:2016:i:2:d:10.1007_s10799-015-0238-0
    DOI: 10.1007/s10799-015-0238-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10799-015-0238-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10799-015-0238-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Simeon Vosen & Torsten Schmidt, 2011. "Forecasting private consumption: survey‐based indicators vs. Google trends," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 30(6), pages 565-578, September.
    2. Kesten Green & J. Scott Armstrong & Andreas Graefe, 2007. "Methods to Elicit Forecasts from Groups: Delphi and Prediction Markets Compared," Foresight: The International Journal of Applied Forecasting, International Institute of Forecasters, issue 8, pages 17-20, Fall.
    3. Nesreen Ahmed & Amir Atiya & Neamat El Gayar & Hisham El-Shishiny, 2010. "An Empirical Comparison of Machine Learning Models for Time Series Forecasting," Econometric Reviews, Taylor & Francis Journals, vol. 29(5-6), pages 594-621.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin Obschonka & Mingjie Zhou & Yixin Zhou & Jianxin Zhang & Rainer K. Silbereisen, 2019. "“Confucian” traits, entrepreneurial personality, and entrepreneurship in China: a regional analysis," Small Business Economics, Springer, vol. 53(4), pages 961-979, December.
    2. Zhongchen Song & Tom Coupé, 2023. "Predicting Chinese consumption series with Baidu," Journal of Chinese Economic and Business Studies, Taylor & Francis Journals, vol. 21(3), pages 429-463, July.
    3. Bartram, Söhnke & Branke, Jürgen & Motahari, Mehrshad, 2020. "Artificial Intelligence in Asset Management," CEPR Discussion Papers 14525, C.E.P.R. Discussion Papers.
    4. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    5. Yang, Xin & Pan, Bing & Evans, James A. & Lv, Benfu, 2015. "Forecasting Chinese tourist volume with search engine data," Tourism Management, Elsevier, vol. 46(C), pages 386-397.
    6. Schaer, Oliver & Kourentzes, Nikolaos & Fildes, Robert, 2019. "Demand forecasting with user-generated online information," International Journal of Forecasting, Elsevier, vol. 35(1), pages 197-212.
    7. Kock, Anders Bredahl & Teräsvirta, Timo, 2014. "Forecasting performances of three automated modelling techniques during the economic crisis 2007–2009," International Journal of Forecasting, Elsevier, vol. 30(3), pages 616-631.
    8. Chauvet, Marcelle & Gabriel, Stuart & Lutz, Chandler, 2016. "Mortgage default risk: New evidence from internet search queries," Journal of Urban Economics, Elsevier, vol. 96(C), pages 91-111.
    9. Torsten Schmidt & Simeon Vosen, 2012. "Using Internet Data to Account for Special Events in Economic Forecasting," Ruhr Economic Papers 0382, Rheinisch-Westfälisches Institut für Wirtschaftsforschung, Ruhr-Universität Bochum, Universität Dortmund, Universität Duisburg-Essen.
    10. Jacques Bughin, 2015. "Google searches and twitter mood: nowcasting telecom sales performance," Netnomics, Springer, vol. 16(1), pages 87-105, August.
    11. Fu, Chun & Miller, Clayton, 2022. "Using Google Trends as a proxy for occupant behavior to predict building energy consumption," Applied Energy, Elsevier, vol. 310(C).
    12. Petar Soric & Enric Monte & Salvador Torra & Oscar Claveria, 2022. ""Density forecasts of inflation using Gaussian process regression models"," IREA Working Papers 202210, University of Barcelona, Research Institute of Applied Economics, revised Jul 2022.
    13. Robert Lehmann, 2021. "Forecasting exports across Europe: What are the superior survey indicators?," Empirical Economics, Springer, vol. 60(5), pages 2429-2453, May.
    14. Daniel E. O'Leary, 2024. "Toward an extended framework of exhaust data for predictive analytics: An empirical approach," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 31(2), June.
    15. Maghsoodi, Abtin Ijadi, 2023. "Cryptocurrency portfolio allocation using a novel hybrid and predictive big data decision support system," Omega, Elsevier, vol. 115(C).
    16. Samya Tajmouati & Bouazza El Wahbi & Mohamed Dakkon, 2023. "Classical and fast parameters tuning in nearest neighbors with stop condition," OPSEARCH, Springer;Operational Research Society of India, vol. 60(3), pages 1063-1081, September.
    17. Georgios Bampinas & Theodore Panagiotidis & Christina Rouska, 2019. "Volatility persistence and asymmetry under the microscope: the role of information demand for gold and oil," Scottish Journal of Political Economy, Scottish Economic Society, vol. 66(1), pages 180-197, February.
    18. Vosen, Simeon & Schmidt, Torsten, 2012. "A monthly consumption indicator for Germany based on Internet search query data," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 19(7), pages 683-687.
    19. Onur Enginar & Kazim Baris Atici, 2022. "Optimal forecast error as an unbiased estimator of abnormal return: A proposition," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(1), pages 158-166, January.
    20. Akın, Melda, 2015. "A novel approach to model selection in tourism demand modeling," Tourism Management, Elsevier, vol. 48(C), pages 64-72.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infotm:v:17:y:2016:i:2:d:10.1007_s10799-015-0238-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.