IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1712.03152.html
   My bibliography  Save this paper

Aggregating Google Trends: Multivariate Testing and Analysis

Author

Listed:
  • Stephen L. France
  • Yuying Shi

Abstract

Web search data are a valuable source of business and economic information. Previous studies have utilized Google Trends web search data for economic forecasting. We expand this work by providing algorithms to combine and aggregate search volume data, so that the resulting data is both consistent over time and consistent between data series. We give a brand equity example, where Google Trends is used to analyze shopping data for 100 top ranked brands and these data are used to nowcast economic variables. We describe the importance of out of sample prediction and show how principal component analysis (PCA) can be used to improve the signal to noise ratio and prevent overfitting in nowcasting models. We give a finance example, where exploratory data analysis and classification is used to analyze the relationship between Google Trends searches and stock prices.

Suggested Citation

  • Stephen L. France & Yuying Shi, 2017. "Aggregating Google Trends: Multivariate Testing and Analysis," Papers 1712.03152, arXiv.org, revised Mar 2018.
  • Handle: RePEc:arx:papers:1712.03152
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1712.03152
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Giannone, Domenico & Reichlin, Lucrezia & Small, David, 2008. "Nowcasting: The real-time informational content of macroeconomic data," Journal of Monetary Economics, Elsevier, vol. 55(4), pages 665-676, May.
    2. Hamid, Alain & Heiden, Moritz, 2015. "Forecasting volatility with empirical similarity and Google Trends," Journal of Economic Behavior & Organization, Elsevier, vol. 117(C), pages 62-81.
    3. Francisca Beer & Fabrice Hervé & Mohamed Zouaoui, 2013. "Is Big Brother Watching Us? Google, Investor Sentiment and the Stock Market," Economics Bulletin, AccessEcon, vol. 33(1), pages 454-466.
    4. McLaren, Nick & Shanbhogue, Rachana, 2011. "Using internet search data as economic indicators," Bank of England Quarterly Bulletin, Bank of England, vol. 51(2), pages 134-140.
    5. Simeon Vosen & Torsten Schmidt, 2011. "Forecasting private consumption: survey‐based indicators vs. Google trends," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 30(6), pages 565-578, September.
    6. Hyunyoung Choi & Hal Varian, 2012. "Predicting the Present with Google Trends," The Economic Record, The Economic Society of Australia, vol. 88(s1), pages 2-9, June.
    7. Liwen Vaughan & Yue Chen, 2015. "Data mining from web search queries: A comparison of google trends and baidu index," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 66(1), pages 13-22, January.
    8. Chris Hand & Guy Judge, 2012. "Searching for the picture: forecasting UK cinema admissions using Google Trends data," Applied Economics Letters, Taylor & Francis Journals, vol. 19(11), pages 1051-1055, July.
    9. Montero, Pablo & Vilar, José A., 2014. "TSclust: An R Package for Time Series Clustering," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 62(i01).
    10. Sigrid A Troelstra & Jizzo R Bosdriesz & Michiel R de Boer & Anton E Kunst, 2016. "Effect of Tobacco Control Policies on Information Seeking for Smoking Cessation in the Netherlands: A Google Trends Study," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-16, February.
    11. Reichlin, Lucrezia & Giannone, Domenico & Small, David, 2005. "Nowcasting GDP and Inflation: The Real Time Informational Content of Macroeconomic Data Releases," CEPR Discussion Papers 5178, C.E.P.R. Discussion Papers.
    12. Tao Chen & Erin Pik Ki So & Liang Wu & Isabel Kit Ming Yan, 2015. "The 2007–2008 U.S. Recession: What Did The Real-Time Google Trends Data Tell The United States?," Contemporary Economic Policy, Western Economic Association International, vol. 33(2), pages 395-403, April.
    13. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rodrigo Mulero & Alfredo Garcia-Hiernaux, 2023. "Forecasting unemployment with Google Trends: age, gender and digital divide," Empirical Economics, Springer, vol. 65(2), pages 587-605, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. France, Stephen L. & Shi, Yuying & Kazandjian, Brett, 2021. "Web Trends: A valuable tool for business research," Journal of Business Research, Elsevier, vol. 132(C), pages 666-679.
    2. Benedikt Maas, 2020. "Short‐term forecasting of the US unemployment rate," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(3), pages 394-411, April.
    3. Zhongchen Song & Tom Coupé, 2023. "Predicting Chinese consumption series with Baidu," Journal of Chinese Economic and Business Studies, Taylor & Francis Journals, vol. 21(3), pages 429-463, July.
    4. Yang, Xin & Pan, Bing & Evans, James A. & Lv, Benfu, 2015. "Forecasting Chinese tourist volume with search engine data," Tourism Management, Elsevier, vol. 46(C), pages 386-397.
    5. Schaer, Oliver & Kourentzes, Nikolaos & Fildes, Robert, 2019. "Demand forecasting with user-generated online information," International Journal of Forecasting, Elsevier, vol. 35(1), pages 197-212.
    6. Coble, David & Pincheira, Pablo, 2017. "Nowcasting Building Permits with Google Trends," MPRA Paper 76514, University Library of Munich, Germany.
    7. van der Wielen, Wouter & Barrios, Salvador, 2021. "Economic sentiment during the COVID pandemic: Evidence from search behaviour in the EU," Journal of Economics and Business, Elsevier, vol. 115(C).
    8. Tuhkuri, Joonas, 2016. "ETLAnow: A Model for Forecasting with Big Data – Forecasting Unemployment with Google Searches in Europe," ETLA Reports 54, The Research Institute of the Finnish Economy.
    9. Pirschel, Inske, 2016. "Forecasting euro area recessions in real-time," Kiel Working Papers 2020, Kiel Institute for the World Economy (IfW Kiel).
    10. Chien-jung Ting & Yi-Long Hsiao, 2022. "Nowcasting the GDP in Taiwan and the Real-Time Tourism Data," Advances in Management and Applied Economics, SCIENPRESS Ltd, vol. 12(3), pages 1-2.
    11. Nikolaos Askitas & Klaus F. Zimmermann, 2015. "The internet as a data source for advancement in social sciences," International Journal of Manpower, Emerald Group Publishing Limited, vol. 36(1), pages 2-12, April.
    12. Götz, Thomas B. & Knetsch, Thomas A., 2019. "Google data in bridge equation models for German GDP," International Journal of Forecasting, Elsevier, vol. 35(1), pages 45-66.
    13. Imene Ben El Hadj Said & Skander Slim, 2022. "The Dynamic Relationship between Investor Attention and Stock Market Volatility: International Evidence," JRFM, MDPI, vol. 15(2), pages 1-25, February.
    14. Daniel Aaronson & Scott A. Brave & R. Andrew Butters & Daniel Sacks & Boyoung Seo, 2020. "Using the Eye of the Storm to Predict the Wave of Covid-19 UI Claims," Working Paper Series WP-2020-10, Federal Reserve Bank of Chicago, revised 16 Apr 2020.
    15. Fondeur, Y. & Karamé, F., 2013. "Can Google data help predict French youth unemployment?," Economic Modelling, Elsevier, vol. 30(C), pages 117-125.
    16. Philippe Wanner, 2021. "How well can we estimate immigration trends using Google data?," Quality & Quantity: International Journal of Methodology, Springer, vol. 55(4), pages 1181-1202, August.
    17. Chien-jung Ting & Yi-Long Hsiao & Rui-jun Su, 2022. "Application of the Real-Time Tourism Data in Nowcasting the Service Consumption in Taiwan," Journal of Applied Finance & Banking, SCIENPRESS Ltd, vol. 12(4), pages 1-4.
    18. Oestmann Marco & Bennöhr Lars, 2015. "Determinants of house price dynamics. What can we learn from search engine data?," Review of Economics, De Gruyter, vol. 66(1), pages 99-127, April.
    19. Sarun Kamolthip, 2021. "Macroeconomic Forecasting with LSTM and Mixed Frequency Time Series Data," PIER Discussion Papers 165, Puey Ungphakorn Institute for Economic Research.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1712.03152. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.