IDEAS home Printed from https://ideas.repec.org/a/bla/labour/v40y2026i2p131-161.html

Analyzing Non‐Random Selectivity in Online Job Advertisements Using Eurostat Benchmark Data and Generalized Sample Selection Models: An Application to EU Regional Labor Markets

Author

Listed:
  • Pietro Giorgio Lovaglio
  • Mario Mezzanzanica

Abstract

The present paper provides an overall framework to afford the problem of non‐representativeness and non‐random selectivity arising from online job ads data, using Generalized sample selection models and Eurostat benchmark data. We jointly model the outcome intensity (number of online job ads in observed profiles, whose levels are defined by auxiliary variables) and the probability of endogenous selection (likelihood that online job ads are not missing in a given profile), allowing us to model the missing data mechanism without the need of a priori justification of missingness at random, as generally supposed by multilevel regression and post‐stratification, a popular benchmark technique in this field. Moreover, we offer new post‐stratification strategies to calibrate the unconditional predictions on benchmark/reference samples. We use data from the Cedefop's Skill Ovate platform collecting online job advertisements for all EU regions in 2022 and an Italian web‐platform during 2013Q2‐2018Q2, whereas as reference samples, aggregated LFS recent job starters and LFS new hires from microdata that represent reasonable lower bounds for job advertisements. Online job ads present a strong overrepresentation with respect to benchmark data (+40% with respect to LFS recent job starters and +400% over new hires from LFS microdata), whereas generalized sample selection models reduced this bias by half, unlike Multilevel post‐stratification and other univariate approaches, which furthermore resulted in bias.

Suggested Citation

  • Pietro Giorgio Lovaglio & Mario Mezzanzanica, 2026. "Analyzing Non‐Random Selectivity in Online Job Advertisements Using Eurostat Benchmark Data and Generalized Sample Selection Models: An Application to EU Regional Labor Markets," LABOUR, CEIS, vol. 40(2), pages 131-161, June.
  • Handle: RePEc:bla:labour:v:40:y:2026:i:2:p:131-161
    DOI: 10.1111/labr.70008
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/labr.70008
    Download Restriction: no

    File URL: https://libkey.io/10.1111/labr.70008?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:labour:v:40:y:2026:i:2:p:131-161. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/csrotit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.