IDEAS home Printed from https://ideas.repec.org/a/aea/jecper/v28y2014i2p3-28.html
   My bibliography  Save this article

Big Data: New Tricks for Econometrics

Author

Listed:
  • Hal R. Varian

Abstract

Computers are now involved in many economic transactions and can capture data associated with these transactions, which can then be manipulated and analyzed. Conventional statistical and econometric techniques such as regression often work well, but there are issues unique to big datasets that may require different tools. First, the sheer size of the data involved may require more powerful data manipulation tools. Second, we may have more potential predictors than appropriate for estimation, so we need to do some kind of variable selection. Third, large datasets may allow for more flexible relationships than simple linear models. Machine learning techniques such as decision trees, support vector machines, neural nets, deep learning, and so on may allow for more effective ways to model complex relationships. In this essay, I will describe a few of these tools for manipulating and analyzing big data. I believe that these methods have a lot to offer and should be more widely known and used by economists.

Suggested Citation

  • Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
  • Handle: RePEc:aea:jecper:v:28:y:2014:i:2:p:3-28
    Note: DOI: 10.1257/jep.28.2.3
    as

    Download full text from publisher

    File URL: http://www.aeaweb.org/articles.php?doi=10.1257/jep.28.2.3
    Download Restriction: no

    File URL: http://www.aeaweb.org/jep/ds/2802/2802-0003_ds.zip
    Download Restriction: no

    File URL: http://www.aeaweb.org/jep/data/2802/2802-0003_data.zip
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hyunyoung Choi & Hal Varian, 2012. "Predicting the Present with Google Trends," The Economic Record, The Economic Society of Australia, vol. 88(s1), pages 2-9, June.
    2. Joshua D. Angrist & Alan B. Krueger, 2001. "Instrumental Variables and the Search for Identification: From Supply and Demand to Natural Experiments," Journal of Economic Perspectives, American Economic Association, vol. 15(4), pages 69-85, Fall.
    3. Joshua Angrist & Alan Krueger, 2001. "Instrumental Variables and the Search for Identification: From Supply and Demand to Natural Experiments," Working Papers 834, Princeton University, Department of Economics, Industrial Relations Section..
    4. Concha Artola & Enrique Galán, 2012. "Tracking the future on the web: construction of leading indicators using internet searches," Occasional Papers 1203, Banco de España.
    5. Jennifer L. Castle & Xiaochuan Qin & W. Robert Reed, 2009. "How To Pick The Best Regression Equation: A Review And Comparison Of Model Selection Algorithms," Working Papers in Economics 09/13, University of Canterbury, Department of Economics and Finance.
    6. McLaren, Nick & Shanbhogue, Rachana, 2011. "Using internet search data as economic indicators," Bank of England Quarterly Bulletin, Bank of England, vol. 51(2), pages 134-140.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dean Fantazzini & Julia Pushchelenko & Alexey Mironenkov & Alexey Kurbatskii, 2021. "Forecasting Internal Migration in Russia Using Google Trends: Evidence from Moscow and Saint Petersburg," Forecasting, MDPI, vol. 3(4), pages 1-30, October.
    2. Javier Sebastian, 2016. "Blockchain in financial services: Regulatory landscape and future challenges," Working Papers 16/21, BBVA Bank, Economic Research Department.
    3. D’Amuri, Francesco & Marcucci, Juri, 2017. "The predictive power of Google searches in forecasting US unemployment," International Journal of Forecasting, Elsevier, vol. 33(4), pages 801-816.
    4. Matthew Gentzkow & Bryan T. Kelly & Matt Taddy, 2017. "Text as Data," NBER Working Papers 23276, National Bureau of Economic Research, Inc.
    5. Chiara L. Comolli & Daniele Vignoli, 2019. "Spread-ing uncertainty, shrinking birth rates," Econometrics Working Papers Archive 2019_08, Universita' degli Studi di Firenze, Dipartimento di Statistica, Informatica, Applicazioni "G. Parenti".
    6. Mark Carlson & Kris James Mitchener, 2009. "Branch Banking as a Device for Discipline: Competition and Bank Survivorship during the Great Depression," Journal of Political Economy, University of Chicago Press, vol. 117(2), pages 165-210, April.
    7. Mioara, POPESCU, 2015. "Construction Of Economic Indicators Using Internet Searches," Annals of Spiru Haret University, Economic Series, Universitatea Spiru Haret, vol. 6(1), pages 25-31.
    8. Ilona Babenko & Benjamin Bennett & John M Bizjak & Jeffrey L Coles & Jason J Sandvik, 2023. "Clawback Provisions and Firm Risk," The Review of Corporate Finance Studies, Society for Financial Studies, vol. 12(2), pages 191-239.
    9. Henrekson, Magnus & Johansson, Dan, 2010. "Firm Growth, Institutions and Structural Transformation," Ratio Working Papers 150, The Ratio Institute.
    10. Ma, Lingjie & Koenker, Roger, 2006. "Quantile regression methods for recursive structural equation models," Journal of Econometrics, Elsevier, vol. 134(2), pages 471-506, October.
    11. KAMKOUM, Arnaud Cedric, 2023. "The Federal Reserve’s Response to the Global Financial Crisis and its Effects: An Interrupted Time-Series Analysis of the Impact of its Quantitative Easing Programs," Thesis Commons d7pvg, Center for Open Science.
    12. Wang, Xu & Zhang, Xiaobo & Xie, Zhuan & Huang, Yiping, 2016. "Roads to innovation: Firm-level evidence from China:," IFPRI discussion papers 1542, International Food Policy Research Institute (IFPRI).
    13. Olivier Bargain & Victor Stephane & Jérôme Valette, 2022. "Another brick in the wall. Immigration and electoral preferences: Direct evidence from state ballots," Review of International Economics, Wiley Blackwell, vol. 30(5), pages 1452-1477, November.
    14. Messer, Dolores & Wolter, Stefan C., 2005. "Are Student Exchange Programs Worth It?," IZA Discussion Papers 1656, Institute of Labor Economics (IZA).
    15. Aaron Jackson & William Miles, 2008. "Fixed Exchange Rates and Disinflation in Emerging Markets: How Large Is the Effect?," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 144(3), pages 538-557, October.
    16. Berthélemy Michel & Bonev Petyo & Dussaux Damien & Söderberg Magnus, 2019. "Methods for strengthening a weak instrument in the case of a persistent treatment," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 23(1), pages 1-30, February.
    17. Fali Huang & Myoung-Jae Lee, 2010. "Dynamic treatment effect analysis of TV effects on child cognitive development," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(3), pages 392-419.
    18. Asadul Islam & Dietrich K. Fausten, 2008. "Skilled Immigration and Wages in Australia," The Economic Record, The Economic Society of Australia, vol. 84(s1), pages 66-82, September.
    19. Richard Harris & John Moffat, 2011. "R&D, Innovation and Exporting," SERC Discussion Papers 0073, Centre for Economic Performance, LSE.
    20. Tesfaye, Wondimagegn & Tirivayi, Nyasha, 2020. "Crop diversity, household welfare and consumption smoothing under risk: Evidence from rural Uganda," World Development, Elsevier, vol. 125(C).

    More about this item

    JEL classification:

    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aea:jecper:v:28:y:2014:i:2:p:3-28. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Michael P. Albert (email available below). General contact details of provider: https://edirc.repec.org/data/aeaaaea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.