IDEAS home Printed from https://ideas.repec.org/p/col/000547/022180.html

Sesgo de datos en aplicaciones de aprendizaje automático: un estudio de caso de un modelo no supervisado para identificar el riesgo de corrupción en la contratación pública colombiana

Author

Listed:
  • Kevin Mojica

    (School of Government, Universidad de los Andes)

Abstract

This study analyzes data bias in an unsupervised learning algorithm designed to identify the risk of corruption in public procurement of Colombia. The employed algorithm is a two-stage clustering model used to segment electronic contracts based on variables indicating corruption risk. The objective was to develop an early warning tool for corruption in the Programa de Alimentación Escolar (PAE) procurement, utilizing data from the Sistema Electrónico para la Contratación Pública (SECOP). Although the results demonstrate the potential of artificial intelligence algorithms for detecting corruption risks, they also reveal significant limitations in their practical implementation, attributable to data availability and quality deficiencies. Specifically, biases of representation, measurement, and omitted variables were identified, affecting the algorithm's reliability. The study provides a detailed analysis of these biases, assessing their impact on the algorithm's performance, and emphasizes the importance of recognizing and addressing biases during the development of such initiatives. Finally, recommendations are presented to improve the quality of data in SECOP, aiming to enhance the reliability and accuracy of these algorithms in future developments.

Suggested Citation

  • Kevin Mojica, 2025. "Sesgo de datos en aplicaciones de aprendizaje automático: un estudio de caso de un modelo no supervisado para identificar el riesgo de corrupción en la contratación pública colombiana," Documentos de trabajo 022180, Escuela de Gobierno - Universidad de los Andes.
  • Handle: RePEc:col:000547:022180
    as

    Download full text from publisher

    File URL: https://gobierno.uniandes.edu.co/documento-de-trabajo-no-120/
    Download Restriction: None

    File URL: https://gobierno.uniandes.edu.co/wp-content/uploads/DT_120.pdf
    Download Restriction: None
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:col:000547:022180. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Alejandra Rojas Forero (email available below). General contact details of provider: https://edirc.repec.org/data/egandco.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.