IDEAS home Printed from https://ideas.repec.org/a/taf/applec/v53y2021i52p5989-6001.html
   My bibliography  Save this article

Evaluation of technology clubs by clustering: a cautionary note

Author

Listed:
  • Antonio Rodríguez Andrés
  • Voxi Heinrich S. Amavilah
  • Abraham Otero

Abstract

Applications of machine learning techniques to economic problems are increasing. These are powerful techniques with great potential to extract insights from economic data. However, care must be taken to apply them correctly, or the wrong conclusions may be drawn. In the technology clubs literature, after applying a clustering algorithm, some authors train a supervised machine learning technique, such as a decision tree or a neural network, to predict the label of the clusters. Then, they use some performance metric (typically, accuracy) of that prediction as a measure of the quality of the clustering configuration they have found. This is an error with potential negative implications for policy, because obtaining a high accuracy in such a prediction does not mean that the clustering configuration found is correct. This paper explains in detail why this modus operandi is not sound from theoretical point of view and uses computer simulations to demonstrate it. We caution policy and indicate the direction for future investigations.

Suggested Citation

  • Antonio Rodríguez Andrés & Voxi Heinrich S. Amavilah & Abraham Otero, 2021. "Evaluation of technology clubs by clustering: a cautionary note," Applied Economics, Taylor & Francis Journals, vol. 53(52), pages 5989-6001, November.
  • Handle: RePEc:taf:applec:v:53:y:2021:i:52:p:5989-6001
    DOI: 10.1080/00036846.2021.1934393
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/00036846.2021.1934393
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/00036846.2021.1934393?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Durlauf, Steven N & Johnson, Paul A, 1995. "Multiple Regimes and Cross-Country Growth Behaviour," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 10(4), pages 365-384, Oct.-Dec..
    2. Ellis Scharfenaker, Markus P.A. Schneider, 2019. "Labor Market Segmentation and the Distribution of Income: New Evidence from Internal Census Bureau Data," Working Paper Series, Department of Economics, University of Utah 2019_08, University of Utah, Department of Economics.
    3. Patrick Bajari & Christina Dalton & Han Hong & Ahmed Khwaja, 2014. "Moral hazard, adverse selection, and health expenditures: A semiparametric analysis," RAND Journal of Economics, RAND Corporation, vol. 45(4), pages 747-763, December.
    4. Saba, Charles Shaaba & David, Oladipo Olalekan, 2020. "Convergence patterns in global ICT: Fresh insights from a club clustering algorithm," Telecommunications Policy, Elsevier, vol. 44(10).
    5. Castellacci, Fulvio, 2008. "Technology clubs, technology gaps and growth trajectories," Structural Change and Economic Dynamics, Elsevier, vol. 19(4), pages 301-314, December.
    6. Ashesh Rambachan & Jon Kleinberg & Jens Ludwig & Sendhil Mullainathan, 2020. "An Economic Perspective on Algorithmic Fairness," AEA Papers and Proceedings, American Economic Association, vol. 110, pages 91-95, May.
    7. Susan C. Athey & Kevin A. Bryan & Joshua S. Gans, 2020. "The Allocation of Decision Authority to Human and Artificial Intelligence," AEA Papers and Proceedings, American Economic Association, vol. 110, pages 80-84, May.
    8. Castellacci, Fulvio & Archibugi, Daniele, 2008. "The technology clubs: The distribution of knowledge across nations," Research Policy, Elsevier, vol. 37(10), pages 1659-1673, December.
    9. Steffen Q. Mueller, 2020. "Pre- and within-season attendance forecasting in Major League Baseball: a random forest approach," Applied Economics, Taylor & Francis Journals, vol. 52(41), pages 4512-4528, September.
    10. Janet Currie & Henrik Kleven & Esmée Zwiers, 2020. "Technology and Big Data Are Changing Economics: Mining Text to Track Methods," AEA Papers and Proceedings, American Economic Association, vol. 110, pages 42-48, May.
    11. Athey, Susan & Imbens, Guido W., 2019. "Machine Learning Methods Economists Should Know About," Research Papers 3776, Stanford University, Graduate School of Business.
    12. Jessica Clement, 2020. "Social protection clusters in sub‐Saharan Africa," International Journal of Social Welfare, John Wiley & Sons, vol. 29(1), pages 20-28, January.
    13. Bo Cowgill & Megan T. Stevenson, 2020. "Algorithmic Social Engineering," AEA Papers and Proceedings, American Economic Association, vol. 110, pages 96-100, May.
    14. Fagerberg, Jan & Srholec, Martin & Knell, Mark, 2007. "The Competitiveness of Nations: Why Some Countries Prosper While Others Fall Behind," World Development, Elsevier, vol. 35(10), pages 1595-1620, October.
    15. Susan Athey & Guido W. Imbens, 2019. "Machine Learning Methods That Economists Should Know About," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 685-725, August.
    16. Ricardo Fraiman & Badih Ghattas & Marcela Svarc, 2013. "Interpretable clustering using unsupervised binary trees," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(2), pages 125-145, June.
    17. Yan Liu & Tian Xie, 2019. "Machine learning versus econometrics: prediction of box office," Applied Economics Letters, Taylor & Francis Journals, vol. 26(2), pages 124-130, January.
    18. Fulvio Castellacci, 2011. "Closing the Technology Gap?," Review of Development Economics, Wiley Blackwell, vol. 15(1), pages 180-197, February.
    19. Nalan Baştürk & Richard Paap & Dick van Dijk, 2012. "Structural differences in economic growth: an endogenous clustering approach," Applied Economics, Taylor & Francis Journals, vol. 44(1), pages 119-134, January.
    20. Murray Wolfson & Zagros Madjd-Sadjadi & Patrick James, 2004. "Identifying National Types: A Cluster Analysis of Politics, Economics, and Conflict," Journal of Peace Research, Peace Research Institute Oslo, vol. 41(5), pages 607-623, September.
    21. Ahlquist, John S. & Breunig, Christian, 2012. "Model-based Clustering and Typologies in the Social Sciences," Political Analysis, Cambridge University Press, vol. 20(1), pages 92-112, January.
    22. Aaron Kreiner & John Duca, 2020. "Can machine learning on economic data better forecast the unemployment rate?," Applied Economics Letters, Taylor & Francis Journals, vol. 27(17), pages 1434-1437, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Walheer, Barnabé, 2023. "Meta-frontier and technology switchers: A nonparametric approach," European Journal of Operational Research, Elsevier, vol. 305(1), pages 463-474.
    2. Stöllinger, Roman, 2013. "International spillovers in a world of technology clubs," Structural Change and Economic Dynamics, Elsevier, vol. 27(C), pages 19-35.
    3. Fulvio Castellacci & José Miguel Natera, 2011. "A new panel dataset for cross-country analyses of national systems, growth and development (CANA)," Working Papers del Instituto Complutense de Estudios Internacionales 1105, Universidad Complutense de Madrid, Instituto Complutense de Estudios Internacionales.
    4. Walheer, Barnabé, 2021. "Labor productivity and technology heterogeneity," Journal of Macroeconomics, Elsevier, vol. 68(C).
    5. Fulvio Castellacci & Bart Los & Gaaitzen Vries, 2014. "Sectoral productivity trends: convergence islands in oceans of non-convergence," Journal of Evolutionary Economics, Springer, vol. 24(5), pages 983-1007, November.
    6. Castellacci, Fulvio & Natera, Jose Miguel, 2013. "The dynamics of national innovation systems: A panel cointegration analysis of the coevolution between innovative capability and absorptive capacity," Research Policy, Elsevier, vol. 42(3), pages 579-594.
    7. Charles Shaaba Saba & Oladipo Olalekan David, 2023. "Identifying Convergence in Telecommunication Infrastructures and the Dynamics of Their Influencing Factors Across Countries," Journal of the Knowledge Economy, Springer;Portland International Center for Management of Engineering and Technology (PICMET), vol. 14(2), pages 1413-1466, June.
    8. Antonio Rodríguez Andrés & Abraham Otero & Voxi Heinrich Amavilah, 2022. "Knowledge economy classification in African countries: A model-based clustering approach," Information Technology for Development, Taylor & Francis Journals, vol. 28(2), pages 372-396, April.
    9. repec:gdk:wpaper:18 is not listed on IDEAS
    10. José Afonso Mendes & Sandra T. Silva & Ester G. Silva, 2014. "Portuguese economic growth revisited: a technology-gap explanation," FEP Working Papers 545, Universidade do Porto, Faculdade de Economia do Porto.
    11. Areti Gkypali & Kostas Kounetas & Kostas Tsekouras, 2019. "European countries’ competitiveness and productive performance evolution: unraveling the complexity in a heterogeneity context," Journal of Evolutionary Economics, Springer, vol. 29(2), pages 665-695, April.
    12. CATTARUZZO Sebastiano, 2020. "On R&D sectoral intensities and convergence clubs," JRC Working Papers on Corporate R&D and Innovation 2020-01, Joint Research Centre.
    13. Rath, Badri Narayan & Panda, Bibhudutta & Akram, Vaseem, 2023. "Convergence and determinants of ICT development in case of emerging market economies," Telecommunications Policy, Elsevier, vol. 47(2).
    14. Ballestar, María Teresa & García-Lazaro, Aida & Sainz, Jorge & Sanz, Ismael, 2022. "Why is your company not robotic? The technology and human capital needed by firms to become robotic," Journal of Business Research, Elsevier, vol. 142(C), pages 328-343.
    15. Voxi Heinrich Amavilah & Antonio Rodriguez Andres, 2022. "Knowledge Economy and the Economic Performance of African Countries: A Seemingly Unrelated and Recursive Approach," Working Papers 57, The German University in Cairo, Faculty of Management Technology.
    16. Filippetti, Andrea & Peyrache, Antonio, 2010. "The Dynamic of Technological Capabilities of Countries: A Dual Approach Using Composite Indicators & Data Envelopment Analysis," MPRA Paper 21629, University Library of Munich, Germany.
    17. Natera, Jose Miguel & Pansera, Mario, 2013. "How Innovation Systems and Development Theories complement each other," MPRA Paper 53633, University Library of Munich, Germany.
    18. Rath, Badri Narayan, 2016. "Does the digital divide across countries lead to convergence? New international evidence," Economic Modelling, Elsevier, vol. 58(C), pages 75-82.
    19. Lee, Keun & Lee, Jongho & Lee, Juneyoung, 2021. "Variety of national innovation systems (NIS) and alternative pathways to growth beyond the middle-income stage: Balanced, imbalanced, catching-up, and trapped NIS," World Development, Elsevier, vol. 144(C).
    20. Sophie-Charlotte Klose & Johannes Lederer, 2020. "A Pipeline for Variable Selection and False Discovery Rate Control With an Application in Labor Economics," Papers 2006.12296, arXiv.org, revised Jun 2020.
    21. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.

    More about this item

    JEL classification:

    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • O38 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Government Policy
    • O57 - Economic Development, Innovation, Technological Change, and Growth - - Economywide Country Studies - - - Comparative Studies of Countries
    • P41 - Political Economy and Comparative Economic Systems - - Other Economic Systems - - - Planning, Coordination, and Reform

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:applec:v:53:y:2021:i:52:p:5989-6001. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/RAEC20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.