IDEAS home Printed from https://ideas.repec.org/b/rsw/rswout/7-2de.html
   My bibliography  Save this book

Erhebung und Nutzung unstrukturierter Daten in den Sozial-, Verhaltens- und Wirtschaftswissenschaften

Editor

Listed:
  • Rat für Sozial- und Wirtschaftsdaten RatSWD

Abstract

Die zunehmende Digitalisierung unserer Lebenswelt in den letzten Jahrzehnten hat zu einer Reihe von neuen Datenquellen für die Sozial-, Verhaltens- und Wirtschaftswissenschaften geführt. Hierzu gehören vor allem auch unstrukturierte Daten, die sich dadurch auszeichnen, dass sie nicht in Form eines festen Datenformats vorliegen und daher nicht einfach datenanalytisch weiterverarbeitet werden können (z.B. Facebook-Texte, Instagram-Bilder, YouTube-Videos, Twitter-Nachrichten). Die Nutzung unstrukturierter Daten ist mit spezifischen Herausforderungen verknüpft, die gerade dadurch entstehen, dass die Daten typischerweise nicht in einer kontrollierten wissenschaftlichen Studie erhoben werden, sondern häufig im natürlichen Lebensumfeld anfallen. Aufbauend auf den Ergebnissen eines Expert:innen-Workshops werden die spezifischen Herausforderungen bei der Erhebung und Nutzung unstrukturierter Daten beschrieben und Empfehlungen formuliert. Diese orientieren sich am Total Error Framework und beziehen sich auf die Datengenerierung (Definition von Untersuchungseinheiten, Coverage und Sampling Error, Nonresponse und Missing Data Error), die Datenaufbereitung (Spezifikationsfehler, Validität, Messfehler und inhaltliche Fehler) sowie die Datenanalyse (Record Linkage und Verarbeitungsfehler, Modellierungsfehler, analytische Fehler). Abschließend werden offene Fragen und Herausforderungen bei der Forschung mit unstrukturierten Daten diskutiert.

Suggested Citation

  • Rat für Sozial- und Wirtschaftsdaten RatSWD (ed.), 2023. "Erhebung und Nutzung unstrukturierter Daten in den Sozial-, Verhaltens- und Wirtschaftswissenschaften," RatSWD Output Series, German Data Forum (RatSWD), volume 7, number 7-2de.
  • Handle: RePEc:rsw:rswout:7-2de
    DOI: https://doi.org/10.17620/02671.73
    as

    Download full text from publisher

    File URL: https://www.konsortswd.de/wp-content/uploads/RatSWD_Output2.7_Unstrukturierte-Daten_2023.pdf
    Download Restriction: no

    File URL: https://libkey.io/https://doi.org/10.17620/02671.73?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Victoria Stodden & Jennifer Seiler & Zhaokun Ma, 2018. "An empirical analysis of journal policy effectiveness for computational reproducibility," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 115(11), pages 2584-2589, March.
    2. Brady T West & Joseph W Sakshaug & Guy Alain S Aurelien, 2016. "How Big of a Problem is Analytic Error in Secondary Analyses of Survey Data?," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-29, June.
    3. Uri Simonsohn & Joseph P. Simmons & Leif D. Nelson, 2020. "Specification curve analysis," Nature Human Behaviour, Nature, vol. 4(11), pages 1208-1214, November.
    4. Uri Simonsohn & Joseph P. Simmons & Leif D. Nelson, 2020. "Publisher Correction: Specification curve analysis," Nature Human Behaviour, Nature, vol. 4(11), pages 1215-1215, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Felix Holzmeister & Magnus Johannesson & Robert Böhm & Anna Dreber & Jürgen Huber & Michael Kirchler, 2023. "Heterogeneity in effect size estimates: Empirical evidence and practical implications," Working Papers 2023-17, Faculty of Economics and Statistics, Universität Innsbruck.
    2. Eibich, Peter & Goldzahl, Léontine, 2021. "Does retirement affect secondary preventive care use? Evidence from breast cancer screening," Economics & Human Biology, Elsevier, vol. 43(C).
    3. Rubin, Mark, 2023. "Type I error rates are not usually inflated," MetaArXiv 3kv2b, Center for Open Science.
    4. Dreber, Anna & Johannesson, Magnus, 2023. "A framework for evaluating reproducibility and replicability in economics," Ruhr Economic Papers 1055, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
    5. Dorison, Charles A & Lerner, Jennifer S & Heller, Blake H & Rothman, Alexander J & Kawachi, Ichiro I & Wang, Ke & Rees, Vaughan W & Gill, Brian P & Gibbs, Nancy & Ebersole, Charles R & Vally, Zahir & , 2022. "In COVID-19 health messaging, loss framing increases anxiety with little-to-no concomitant benefits : Experimental evidence from 84 countries," Other publications TiSEM 235f67b6-6be5-4061-8693-3, Tilburg University, School of Economics and Management.
    6. Gretton, Jeremy & Roemer, Tobias & Schlüter, Elmar, 2024. "Replication of Hamel & Wilcox-Archuleta (2022): "Black Workers in White Places: Daytime Racial Diversity and White Public Opinion"," I4R Discussion Paper Series 61, The Institute for Replication (I4R), revised 2024.
    7. Fieberg, Christian & Günther, Steffen & Poddig, Thorsten & Zaremba, Adam, 2024. "Non-standard errors in the cryptocurrency world," International Review of Financial Analysis, Elsevier, vol. 92(C).
    8. Helmers, Viola & van der Werf, Edwin, 2022. "Did the German Aviation Tax Affect Passenger Numbers? New Evidence Employing Difference-in-differences," VfS Annual Conference 2022 (Basel): Big Data in Economics 264118, Verein für Socialpolitik / German Economic Association.
    9. Huber, Christoph & Kirchler, Michael, 2023. "Experiments in finance: A survey of historical trends," Journal of Behavioral and Experimental Finance, Elsevier, vol. 37(C).
    10. Slichter, David & Tran, Nhan, 2023. "Do better journals publish better estimates?," MPRA Paper 118433, University Library of Munich, Germany.
    11. Bachler, Sebastian & Erhart, Andrea & Holzknecht, Armando, 2023. "Replication Report on Altmann et al. (2022)," I4R Discussion Paper Series 43, The Institute for Replication (I4R).
    12. Nikolova, Milena & Cnossen, Femke & Nikolaev. Boris, 2022. "Robots, Meaning, and Self-Determination," GLO Discussion Paper Series 1191, Global Labor Organization (GLO).
    13. Cohn, Jonathan B. & Liu, Zack & Wardlaw, Malcolm I., 2022. "Count (and count-like) data in finance," Journal of Financial Economics, Elsevier, vol. 146(2), pages 529-551.
    14. Zhe-Fei Mao & Qi-Wei Li & Yi-Ming Wang & Jie Zhou, 2024. "Pro-religion attitude predicts lower vaccination coverage at country level," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-9, December.
    15. Schweinsberg, Martin & Feldman, Michael & Staub, Nicola & van den Akker, Olmo R. & van Aert, Robbie C.M. & van Assen, Marcel A.L.M. & Liu, Yang & Althoff, Tim & Heer, Jeffrey & Kale, Alex & Mohamed, Z, 2021. "Same data, different conclusions: Radical dispersion in empirical results when independent analysts operationalize and test the same hypothesis," Organizational Behavior and Human Decision Processes, Elsevier, vol. 165(C), pages 228-249.
    16. Guillaume Coqueret, 2023. "Forking paths in financial economics," Papers 2401.08606, arXiv.org.
    17. Alan D. Crane & Andrew Koch & Leming Lin, 2024. "Real Effects of Markets on Politics: Evidence from US Presidential Elections," American Economic Review: Insights, American Economic Association, vol. 6(1), pages 73-88, March.
    18. Christoph Semken & David Rossell, 2022. "Specification analysis for technology use and teenager well‐being: Statistical validity and a Bayesian proposal," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1330-1355, November.
    19. Brian Gill & others, "undated". "In COVID-19 Health Messaging, Loss Framing Increases Anxiety with Little-to-No Concomitant Benefits: Experimental Evidence from 84 Countries," Mathematica Policy Research Reports ac30d0619fd64793b2e1b108d, Mathematica Policy Research.
    20. Bensch, Gunther & Ankel-Peters, Jörg & Vance, Colin, 2023. "Spotlight on Researcher Decisions – Infrastructure Evaluation, Instrumental Variables, and Specification Screening," VfS Annual Conference 2023 (Regensburg): Growth and the "sociale Frage" 277703, Verein für Socialpolitik / German Economic Association.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:rsw:rswout:7-2de. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: RatSWD (email available below). General contact details of provider: https://edirc.repec.org/data/rtswdde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.