IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0138935.html
   My bibliography  Save this article

Human Rights Texts: Converting Human Rights Primary Source Documents into Data

Author

Listed:
  • Christopher J Fariss
  • Fridolin J Linder
  • Zachary M Jones
  • Charles D Crabtree
  • Megan A Biek
  • Ana-Sophia M Ross
  • Taranamol Kaur
  • Michael Tsai

Abstract

We introduce and make publicly available a large corpus of digitized primary source human rights documents which are published annually by monitoring agencies that include Amnesty International, Human Rights Watch, the Lawyers Committee for Human Rights, and the United States Department of State. In addition to the digitized text, we also make available and describe document-term matrices, which are datasets that systematically organize the word counts from each unique document by each unique term within the corpus of human rights documents. To contextualize the importance of this corpus, we describe the development of coding procedures in the human rights community and several existing categorical indicators that have been created by human coding of the human rights documents contained in the corpus. We then discuss how the new human rights corpus and the existing human rights datasets can be used with a variety of statistical analyses and machine learning algorithms to help scholars understand how human rights practices and reporting have evolved over time. We close with a discussion of our plans for dataset maintenance, updating, and availability.

Suggested Citation

  • Christopher J Fariss & Fridolin J Linder & Zachary M Jones & Charles D Crabtree & Megan A Biek & Ana-Sophia M Ross & Taranamol Kaur & Michael Tsai, 2015. "Human Rights Texts: Converting Human Rights Primary Source Documents into Data," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-19, September.
  • Handle: RePEc:plo:pone00:0138935
    DOI: 10.1371/journal.pone.0138935
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0138935
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0138935&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0138935?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Fariss, Christopher J., 2014. "Respect for Human Rights has Improved Over Time: Modeling the Changing Standard of Accountability," American Political Science Review, Cambridge University Press, vol. 108(2), pages 297-318, May.
    2. Grimmer, Justin & Stewart, Brandon M., 2013. "Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts," Political Analysis, Cambridge University Press, vol. 21(3), pages 267-297, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sara Kahn-Nisser, 2019. "When the targets are members and donors: Analyzing inter-governmental organizations’ human rights shaming," The Review of International Organizations, Springer, vol. 14(3), pages 431-451, September.
    2. Ping-Yu Hsu & Hong-Tsuen Lei & Shih-Hsiang Huang & Teng Hao Liao & Yao-Chung Lo & Chin-Chun Lo, 2019. "Effects of sentiment on recommendations in social network," Electronic Markets, Springer;IIM University of St. Gallen, vol. 29(2), pages 253-262, June.
    3. Justin Key Canfil, 2024. "Until consensus: Introducing the International Cyber Expression dataset," Journal of Peace Research, Peace Research Institute Oslo, vol. 61(1), pages 150-159, January.
    4. Yanto Chandra & Li Crystal Jiang & Cheng-Jun Wang, 2016. "Mining Social Entrepreneurship Strategies Using Topic Modeling," PLOS ONE, Public Library of Science, vol. 11(3), pages 1-28, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bernhardt, Lea & Dewenter, Ralf & Thomas, Tobias, 2023. "Measuring partisan media bias in US newscasts from 2001 to 2012," European Journal of Political Economy, Elsevier, vol. 78(C).
    2. Dreher, Axel & Fuchs, Andreas & Langlotz, Sarah, 2019. "The effects of foreign aid on refugee flows," European Economic Review, Elsevier, vol. 112(C), pages 127-147.
    3. Rauh, Christian, 2015. "Communicating supranational governance? The salience of EU affairs in the German Bundestag, 1991–2013," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 16(1), pages 116-138.
    4. Julia Seiermann, 2018. "Only Words? How Power in Trade Agreement Texts Affects International Trade Flows," UNCTAD Blue Series Papers 80, United Nations Conference on Trade and Development.
    5. Arthur Dyevre & Nicolas Lampach, 2021. "Issue attention on international courts: Evidence from the European Court of Justice," The Review of International Organizations, Springer, vol. 16(4), pages 793-815, October.
    6. Dewenter, Ralf & Dulleck, Uwe & Thomas, Tobias, 2018. "The political coverage index and its application to government capture," Research Papers 6, EcoAustria – Institute for Economic Research.
    7. Pastwa, Anna M. & Shrestha, Prabal & Thewissen, James & Torsin, Wouter, 2021. "Unpacking the black box of ICO white papers: a topic modeling approach," LIDAM Discussion Papers LFIN 2021018, Université catholique de Louvain, Louvain Finance (LFIN).
    8. Maksym Polyakov & Morteza Chalak & Md. Sayed Iftekhar & Ram Pandit & Sorada Tapsuwan & Fan Zhang & Chunbo Ma, 2018. "Authorship, Collaboration, Topics, and Research Gaps in Environmental and Resource Economics 1991–2015," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 71(1), pages 217-239, September.
    9. Milena Djourelova & Ruben Durante, 2019. "Media attention and strategic timing in politics: Evidence from U.S. presidential executive orders," Economics Working Papers 1675, Department of Economics and Business, Universitat Pompeu Fabra.
    10. Jule Krüger & Ragnhild Nordås, 2020. "A latent variable approach to measuring wartime sexual violence," Journal of Peace Research, Peace Research Institute Oslo, vol. 57(6), pages 728-739, November.
    11. Mohamed M. Mostafa, 2023. "A one-hundred-year structural topic modeling analysis of the knowledge structure of international management research," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3905-3935, August.
    12. Escriba-Folch, Abel & Meseguer, Covadonga & Wright, Joseph, 2018. "Remittances and protest in dictatorships," LSE Research Online Documents on Economics 89058, London School of Economics and Political Science, LSE Library.
    13. Erkan Işığıçok & Sadullah Çelik & Dilek Özdemir Yılmaz, 2023. "Analysis of Skills and Qualifications Required in Data Scientist Job Postings Based on the Pareto Analysis Perspective Using Text Mining," EKOIST Journal of Econometrics and Statistics, Istanbul University, Faculty of Economics, vol. 0(39), pages 10-25, December.
    14. Kimberly R Frugé, 2019. "Repressive agent defections: How power, costs, and uncertainty influence military behavior and state repression," Conflict Management and Peace Science, Peace Science Society (International), vol. 36(6), pages 591-607, November.
    15. Yuting Chen & Don Bredin & Valerio Potì & Roman Matkovskyy, 2022. "COVID risk narratives: a computational linguistic approach to the econometric identification of narrative risk during a pandemic," Digital Finance, Springer, vol. 4(1), pages 17-61, March.
    16. Purwoko Haryadi Santoso & Edi Istiyono & Haryanto & Wahyu Hidayatulloh, 2022. "Thematic Analysis of Indonesian Physics Education Research Literature Using Machine Learning," Data, MDPI, vol. 7(11), pages 1-41, October.
    17. Bjørnskov, Christian & Pfaff, Katharina, 2021. "Differences matter: The effect of coup types on physical integrity rights," European Journal of Political Economy, Elsevier, vol. 69(C).
    18. Markus Eberhardt & Giovanni Facchini & Valeria Rueda, 2023. "Gender Differences in Reference Letters: Evidence from the Economics Job Market," The Economic Journal, Royal Economic Society, vol. 133(655), pages 2676-2708.
    19. Thorin M. Wright, 2020. "Revisionist Conflict and State Repression," International Area Studies Review, Center for International Area Studies, Hankuk University of Foreign Studies, vol. 23(1), pages 49-72, March.
    20. Chen, Naiwei & Yu, Min-Teh, 2024. "Human rights and value of cash: Evidence from Islamic and non-Islamic countries," Pacific-Basin Finance Journal, Elsevier, vol. 86(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0138935. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.