IDEAS home Printed from
   My bibliography  Save this paper

Improving transparency of the Colombian Peace Treaty with NLP


  • Francisco Barreras


  • Mónica Ribero


  • Felipe Suárez



Factorization methods and probabilistic models provide useful ways to represent text that can capture properties like sentence relevance, topics in text and even semantic similarity. In general, methods that yield a low-dimensional representation of large volumes of text have become more important and have gained attention in a diversity of fields since they have the potential of assisting in the understanding of vast ammounts of information. The Colombian Peace treaty documented an exhaustive list of agreements between the FARC guerilla and Colombian Armed Forces across six main sections spanning 297 pages. Surprisingly, the final version of the treaty was made public only for 40 days before Colombians had to vote for its approval in a plebiscite. Given that most of the general population probably would not read the document and the political environment (including the media) was highly polarized, there was a growing need for an unbiased and practical way to review the document before the vote. This paper describes the technical details behind the implementation of a tool that analyses the 2016 Colombian peace treaty. By combining Natural Language Processing techniques we were able to provide a web-service that helped increase transparency and unbiased reviewing of each section of the peace treaty.

Suggested Citation

  • Francisco Barreras & Mónica Ribero & Felipe Suárez, 2017. "Improving transparency of the Colombian Peace Treaty with NLP," DOCUMENTOS DE TRABAJO QUANTIL 015402, QUANTIL.
  • Handle: RePEc:col:000508:015402

    Download full text from publisher

    File URL:
    Download Restriction: no

    References listed on IDEAS

    1. Li, Baibing & Martin, Elaine B. & Morris, A. Julian, 2002. "On principal component analysis in L1," Computational Statistics & Data Analysis, Elsevier, vol. 40(3), pages 471-474, September.
    Full references (including those not matched with items on IDEAS)

    More about this item


    GloVe; Latent Dirichlet Allocation; Natural Language Processing; NMF; Peace Treaty.;

    NEP fields

    This paper has been announced in the following NEP Reports:


    Access and download statistics


    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:col:000508:015402. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Natalia Iregui). General contact details of provider: .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.