IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1712.03152.html
   My bibliography  Save this paper

Aggregating Google Trends: Multivariate Testing and Analysis

Author

Listed:
  • Stephen L. France
  • Yuying Shi

Abstract

Web search data are a valuable source of business and economic information. Previous studies have utilized Google Trends web search data for economic forecasting. We expand this work by providing algorithms to combine and aggregate search volume data, so that the resulting data is both consistent over time and consistent between data series. We give a brand equity example, where Google Trends is used to analyze shopping data for 100 top ranked brands and these data are used to nowcast economic variables. We describe the importance of out of sample prediction and show how principal component analysis (PCA) can be used to improve the signal to noise ratio and prevent overfitting in nowcasting models. We give a finance example, where exploratory data analysis and classification is used to analyze the relationship between Google Trends searches and stock prices.

Suggested Citation

  • Stephen L. France & Yuying Shi, 2017. "Aggregating Google Trends: Multivariate Testing and Analysis," Papers 1712.03152, arXiv.org, revised Mar 2018.
  • Handle: RePEc:arx:papers:1712.03152
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1712.03152
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Giannone, Domenico & Reichlin, Lucrezia & Small, David, 2008. "Nowcasting: The real-time informational content of macroeconomic data," Journal of Monetary Economics, Elsevier, vol. 55(4), pages 665-676, May.
    2. Hamid, Alain & Heiden, Moritz, 2015. "Forecasting volatility with empirical similarity and Google Trends," Journal of Economic Behavior & Organization, Elsevier, vol. 117(C), pages 62-81.
    3. Francisca Beer & Fabrice Hervé & Mohamed Zouaoui, 2013. "Is Big Brother Watching Us? Google, Investor Sentiment and the Stock Market," Economics Bulletin, AccessEcon, vol. 33(1), pages 454-466.
    4. McLaren, Nick & Shanbhogue, Rachana, 2011. "Using internet search data as economic indicators," Bank of England Quarterly Bulletin, Bank of England, vol. 51(2), pages 134-140.
    5. Simeon Vosen & Torsten Schmidt, 2011. "Forecasting private consumption: survey‐based indicators vs. Google trends," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 30(6), pages 565-578, September.
    6. Hyunyoung Choi & Hal Varian, 2012. "Predicting the Present with Google Trends," The Economic Record, The Economic Society of Australia, vol. 88(s1), pages 2-9, June.
    7. Liwen Vaughan & Yue Chen, 2015. "Data mining from web search queries: A comparison of google trends and baidu index," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 66(1), pages 13-22, January.
    8. Chris Hand & Guy Judge, 2012. "Searching for the picture: forecasting UK cinema admissions using Google Trends data," Applied Economics Letters, Taylor & Francis Journals, vol. 19(11), pages 1051-1055, July.
    9. Montero, Pablo & Vilar, José A., 2014. "TSclust: An R Package for Time Series Clustering," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 62(i01).
    10. Sigrid A Troelstra & Jizzo R Bosdriesz & Michiel R de Boer & Anton E Kunst, 2016. "Effect of Tobacco Control Policies on Information Seeking for Smoking Cessation in the Netherlands: A Google Trends Study," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-16, February.
    11. Domenico Giannone & Lucrezia Reichlin & David H. Small, 2005. "Nowcasting GDP and inflation: the real-time informational content of macroeconomic data releases," Finance and Economics Discussion Series 2005-42, Board of Governors of the Federal Reserve System (U.S.).
    12. Tao Chen & Erin Pik Ki So & Liang Wu & Isabel Kit Ming Yan, 2015. "The 2007–2008 U.S. Recession: What Did The Real-Time Google Trends Data Tell The United States?," Contemporary Economic Policy, Western Economic Association International, vol. 33(2), pages 395-403, April.
    13. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rodrigo Mulero & Alfredo Garcia-Hiernaux, 2023. "Forecasting unemployment with Google Trends: age, gender and digital divide," Empirical Economics, Springer, vol. 65(2), pages 587-605, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. France, Stephen L. & Shi, Yuying & Kazandjian, Brett, 2021. "Web Trends: A valuable tool for business research," Journal of Business Research, Elsevier, vol. 132(C), pages 666-679.
    2. Monge, Manuel & Claudio-Quiroga, Gloria & Poza, Carlos, 2024. "Chinese economic behavior in times of covid-19. A new leading economic indicator based on Google trends," International Economics, Elsevier, vol. 177(C).
    3. Benedikt Maas, 2020. "Short‐term forecasting of the US unemployment rate," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(3), pages 394-411, April.
    4. Yang, Xin & Pan, Bing & Evans, James A. & Lv, Benfu, 2015. "Forecasting Chinese tourist volume with search engine data," Tourism Management, Elsevier, vol. 46(C), pages 386-397.
    5. Zhongchen Song & Tom Coupé, 2023. "Predicting Chinese consumption series with Baidu," Journal of Chinese Economic and Business Studies, Taylor & Francis Journals, vol. 21(3), pages 429-463, July.
    6. Coble, David & Pincheira, Pablo, 2017. "Nowcasting Building Permits with Google Trends," MPRA Paper 76514, University Library of Munich, Germany.
    7. Schaer, Oliver & Kourentzes, Nikolaos & Fildes, Robert, 2019. "Demand forecasting with user-generated online information," International Journal of Forecasting, Elsevier, vol. 35(1), pages 197-212.
    8. van der Wielen, Wouter & Barrios, Salvador, 2021. "Economic sentiment during the COVID pandemic: Evidence from search behaviour in the EU," Journal of Economics and Business, Elsevier, vol. 115(C).
    9. Tuhkuri, Joonas, 2016. "ETLAnow: A Model for Forecasting with Big Data – Forecasting Unemployment with Google Searches in Europe," ETLA Reports 54, The Research Institute of the Finnish Economy.
    10. Chien-jung Ting & Yi-Long Hsiao, 2022. "Nowcasting the GDP in Taiwan and the Real-Time Tourism Data," Advances in Management and Applied Economics, SCIENPRESS Ltd, vol. 12(3), pages 1-2.
    11. Nikolaos Askitas & Klaus F. Zimmermann, 2015. "The internet as a data source for advancement in social sciences," International Journal of Manpower, Emerald Group Publishing Limited, vol. 36(1), pages 2-12, April.
    12. Götz, Thomas B. & Knetsch, Thomas A., 2019. "Google data in bridge equation models for German GDP," International Journal of Forecasting, Elsevier, vol. 35(1), pages 45-66.
    13. Fondeur, Y. & Karamé, F., 2013. "Can Google data help predict French youth unemployment?," Economic Modelling, Elsevier, vol. 30(C), pages 117-125.
    14. Chien-jung Ting & Yi-Long Hsiao & Rui-jun Su, 2022. "Application of the Real-Time Tourism Data in Nowcasting the Service Consumption in Taiwan," Journal of Applied Finance & Banking, SCIENPRESS Ltd, vol. 12(4), pages 1-4.
    15. Oestmann Marco & Bennöhr Lars, 2015. "Determinants of house price dynamics. What can we learn from search engine data?," Review of Economics, De Gruyter, vol. 66(1), pages 99-127, April.
    16. Sarun Kamolthip, 2021. "Macroeconomic Forecasting with LSTM and Mixed Frequency Time Series Data," PIER Discussion Papers 165, Puey Ungphakorn Institute for Economic Research.
    17. Philip ME Garboden, 2019. "Sources and Types of Big Data for Macroeconomic Forecasting," Working Papers 2019-3, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    18. Zhang, Chuan & Tian, Yu-Xin & Fan, Zhi-Ping, 2022. "Forecasting sales using online review and search engine data: A method based on PCA–DSFOA–BPNN," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1005-1024.
    19. Tuhkuri, Joonas, 2016. "Forecasting Unemployment with Google Searches," ETLA Working Papers 35, The Research Institute of the Finnish Economy.
    20. Olivier Gergaud & Victor Ginsburgh, 2016. "Evaluating the Economic Effects of Cultural Events," Working Papers ECARES ECARES 2016-24, ULB -- Universite Libre de Bruxelles.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1712.03152. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.