IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v12y2021i1d10.1038_s41467-020-20206-z.html
   My bibliography  Save this article

Survey data and human computation for improved flu tracking

Author

Listed:
  • Stefan Wojcik

    (Twitter)

  • Avleen S. Bijral

    (Microsoft, One Microsoft Way)

  • Richard Johnston

    (Microsoft, One Microsoft Way)

  • Juan M. Lavista Ferres

    (Microsoft, One Microsoft Way)

  • Gary King

    (Harvard University)

  • Ryan Kennedy

    (University of Houston)

  • Alessandro Vespignani

    (Northeastern University)

  • David Lazer

    (Harvard University
    Northeastern University)

Abstract

While digital trace data from sources like search engines hold enormous potential for tracking and understanding human behavior, these streams of data lack information about the actual experiences of those individuals generating the data. Moreover, most current methods ignore or under-utilize human processing capabilities that allow humans to solve problems not yet solvable by computers (human computation). We demonstrate how behavioral research, linking digital and real-world behavior, along with human computation, can be utilized to improve the performance of studies using digital data streams. This study looks at the use of search data to track prevalence of Influenza-Like Illness (ILI). We build a behavioral model of flu search based on survey data linked to users’ online browsing data. We then utilize human computation for classifying search strings. Leveraging these resources, we construct a tracking model of ILI prevalence that outperforms strong historical benchmarks using only a limited stream of search data and lends itself to tracking ILI in smaller geographic units. While this paper only addresses searches related to ILI, the method we describe has potential for tracking a broad set of phenomena in near real-time.

Suggested Citation

  • Stefan Wojcik & Avleen S. Bijral & Richard Johnston & Juan M. Lavista Ferres & Gary King & Ryan Kennedy & Alessandro Vespignani & David Lazer, 2021. "Survey data and human computation for improved flu tracking," Nature Communications, Nature, vol. 12(1), pages 1-8, December.
  • Handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-020-20206-z
    DOI: 10.1038/s41467-020-20206-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-020-20206-z
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-020-20206-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-020-20206-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.