IDEAS home Printed from https://ideas.repec.org/p/dis/wpaper/dis2601.html

VEUCTOR : Training and Selecting Best Vector Space Models from Online Job Ads for European Countries

Author

Listed:
  • Emilio Colombo

  • Simone D'Amico

  • Fabio Mercorio

  • Mario Mezzanzanica

Abstract

Over the last decade, word embeddings have enabled machines to represent words and sentences as vectors, enabling researchers to reason on text for tasks like semantic similarity, contextual understanding, machine translation, etc. However, the synthesis of embeddings involves domain-specific parameters that affect semantic accuracy and contextual relevance, often leading to unpredictable biases and inconsistent comparisons. This issue is particularly relevant in labor market analysis, where different embeddings yield varying results, making the selection of the most appropriate model a key element. This paper addresses these challenges by (i) proposing a methodology to train, select, and align vector space models for a target taxonomy, ensuring comparability across dimensions and languages; (ii) applying this approach to 4.5 million job ads in 28 languages, aligning country-specific embeddings using the ESCO taxonomy; (iii) generating over 3,000 models over 142 machine days, making the best-performing ones publicly available via VEUCTOR; and (iv) showing how model choice significantly impacts labor market analysis, revealing substantial variations in occupational skill bundles across embeddings.

Suggested Citation

  • Emilio Colombo & Simone D'Amico & Fabio Mercorio & Mario Mezzanzanica, 2026. "VEUCTOR : Training and Selecting Best Vector Space Models from Online Job Ads for European Countries," DISEIS - Quaderni del Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo dis2601, Università Cattolica del Sacro Cuore, Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo (DISEIS).
  • Handle: RePEc:dis:wpaper:dis2601
    as

    Download full text from publisher

    File URL: http://dipartimenti.unicatt.it/diseis-wp_2601.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Gu, Ran & Zhong, Ling, 2023. "Effects of stay-at-home orders on skill requirements in vacancy postings," Labour Economics, Elsevier, vol. 82(C).
    2. Goldman, Matt & Kaplan, David M., 2018. "Comparing distributions by multiple testing across quantiles or CDF values," Journal of Econometrics, Elsevier, vol. 206(1), pages 143-166.
    3. Emilio Colombo & Alberto Marcato, 2023. "Skill demand and labour market concentration: evidence from Italian vacancies," International Journal of Manpower, Emerald Group Publishing Limited, vol. 44(9), pages 156-198, October.
    4. Arthur Turrell & Bradley Speigner & Jyldyz Djumalieva & David Copple & James Thurgood, 2018. "Using job vacancies to understand the effects of labour market mismatch on UK output and productivity," Bank of England working papers 737, Bank of England.
    5. Azar, José & Marinescu, Ioana & Steinbaum, Marshall & Taska, Bledi, 2020. "Concentration in US labor markets: Evidence from online vacancy data," Labour Economics, Elsevier, vol. 66(C).
    6. Alicia Sasser Modestino & Daniel Shoag & Joshua Ballance, 2020. "Upskilling: Do Employers Demand Greater Skill When Workers Are Plentiful?," The Review of Economics and Statistics, MIT Press, vol. 102(4), pages 793-805, October.
    7. Colombo, Emilio & Mercorio, Fabio & Mezzanzanica, Mario, 2019. "AI meets labor market: Exploring the link between automation and skills," Information Economics and Policy, Elsevier, vol. 47(C), pages 27-37.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Arendt, Lukasz & Gałecka-Burdziak, Ewa & Núñez, Fernando & Pater, Robert & Usabiaga, Carlos, 2023. "Skills requirements across task-content groups in Poland: What online job offers tell us," Technological Forecasting and Social Change, Elsevier, vol. 187(C).
    2. Maciej Berk{e}sewicz & Herman Cherniaiev & Robert Pater, 2021. "Estimating the number of entities with vacancies using administrative and online data," Papers 2106.03263, arXiv.org.
    3. Pham, Tho & Talavera, Oleksandr & Wu, Zhuangchen, 2023. "Labor markets during war time: Evidence from online job advertisements," Journal of Comparative Economics, Elsevier, vol. 51(4), pages 1316-1333.
    4. David Evans & Claire Mason & Haohui Chen & Andrew Reeson, 2024. "Accelerated demand for interpersonal skills in the Australian post-pandemic labour market," Nature Human Behaviour, Nature, vol. 8(1), pages 32-42, January.
    5. Hemelt, Steven W. & Hershbein, Brad & Martin, Shawn & Stange, Kevin M., 2023. "College majors and skills: Evidence from the universe of online job ads," Labour Economics, Elsevier, vol. 85(C).
    6. Angelica Bertucci & Emilio Colombo & Patrizio Tirelli, 2025. "Broadband Internet and Labour Market Dynamism," DISEIS - Quaderni del Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo dis2506, Università Cattolica del Sacro Cuore, Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo (DISEIS).
    7. Daly, Moira & Groes, Fane & Jensen, Mathias Fjællegaard, 2025. "Skill demand versus skill use: Comparing job posts with individual skill use on the job," Labour Economics, Elsevier, vol. 92(C).
    8. Tho Pham & Oleksandr Talavera & Zhuangchen Wu, 2023. "Labor Markets during War Time: Evidence from Online Job Ads," Discussion Papers 23-03, Department of Economics, University of Birmingham.
    9. Gregor Jarosch & Jan Sebastian Nimczik & Isaac Sorkin, 2024. "Granular Search, Market Structure, and Wages," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 91(6), pages 3569-3607.
    10. José Azar & Emiliano Huet & Ioana Marinescu & Bledi Taska & Till von, 2024. "Minimum Wage Employment Effects and Labour Market Concentration," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 91(4), pages 1843-1883.
    11. Grinis, Inna, 2017. "The STEM requirements of "non-STEM" jobs: evidence from UK online vacancy postings and implications for skills & knowledge shortages," LSE Research Online Documents on Economics 85123, London School of Economics and Political Science, LSE Library.
    12. Austan Goolsbee & Chad Syverson, 2023. "Monopsony Power in Higher Education: A Tale of Two Tracks," Journal of Labor Economics, University of Chicago Press, vol. 41(S1), pages 257-290.
    13. Andrew Glover & Jacob Short, 2020. "Demographic Origins of the Decline in Labor's Share," BIS Working Papers 874, Bank for International Settlements.
    14. David Deming & Lisa B. Kahn, 2018. "Skill Requirements across Firms and Labor Markets: Evidence from Job Postings for Professionals," Journal of Labor Economics, University of Chicago Press, vol. 36(S1), pages 337-369.
    15. Morgan Raux, 2019. "Looking for the "Best and Brightest": Hiring difficulties and high-skilled foreign workers," Working Papers halshs-02364921, HAL.
    16. Pérez, Jorge & Vial, Felipe & Zárate, Román, 2022. "Urban Transit Infrastructure: Spatial Mismatch and Labor Market Power," Research Department working papers 1992, CAF Development Bank Of Latinamerica.
    17. Karelin, Iliya & Kapelyuk, Sergey, 2023. "Digital Skills of Russian Citizens: Regional Differences," MPRA Paper 119494, University Library of Munich, Germany.
    18. Daniel Monte & Roberto Pinheiro, 2021. "Labor market competition over the business cycle," Economic Inquiry, Western Economic Association International, vol. 59(4), pages 1593-1615, October.
    19. Orley Ashenfelter & David Card & Henry Farber & Michael R. Ransom, 2022. "Monopsony in the Labor Market: New Empirical Results and New Public Policies," Journal of Human Resources, University of Wisconsin Press, vol. 57(S), pages 1-10.
    20. Chaklader, Barnali & Gupta, Brij B. & Panigrahi, Prabin Kumar, 2023. "Analyzing the progress of FINTECH-companies and their integration with new technologies for innovation and entrepreneurship," Journal of Business Research, Elsevier, vol. 161(C).

    More about this item

    JEL classification:

    • J63 - Labor and Demographic Economics - - Mobility, Unemployment, Vacancies, and Immigrant Workers - - - Turnover; Vacancies; Layoffs
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:dis:wpaper:dis2601. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emilio Colombo (email available below). General contact details of provider: https://edirc.repec.org/data/dicatit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.