Author
Listed:
- IANCU CRISTINA
(DEPARTMENT OF ECONOMIC INFORMATICS AND CYBERNETICS, BUCHAREST UNIVERSITY OF ECONOMIC STUDIES, ROMANIA DOCTORAL SCHOOL OF ECONOMIC INFORMATICS, BUCHAREST UNIVERSITY OF ECONOMIC STUDIES, ROMANIA)
- CIUVERCA ALEXANDRA-CRISTINA-DANIELA
(DEPARTMENT OF ECONOMIC INFORMATICS AND CYBERNETICS, BUCHAREST UNIVERSITY OF ECONOMIC STUDIES, ROMANIA DOCTORAL SCHOOL OF ECONOMIC INFORMATICS, BUCHAREST UNIVERSITY OF ECONOMIC STUDIES, , ROMANIA)
- OPREA SIMONA-VASILICA
(DEPARTMENT OF ECONOMIC INFORMATICS AND CYBERNETICS, BUCHAREST UNIVERSITY OF ECONOMIC STUDIES, ROMANIA DOCTORAL SCHOOL OF ECONOMIC INFORMATICS, BUCHAREST UNIVERSITY OF ECONOMIC STUDIES, ROMANIA)
Abstract
This study explores the intersection between human resources discourse and macroeconomic indicators through the use of web scraping, natural language processing (NLP), and machine learning techniques. By extracting job postings from Romanian platforms and correlating them with national economic indicators, we identify hidden signals embedded in hiring trends. We aimed to demonstrate how Human Resources (HR) data can anticipate shifts in prices, salaries, and economic performance in the age of AI, willing to uncover predictive signals in the given discourse. Using large-scale web scraping from major Romanian job platforms—eJobs and BestJobs—we compiled a dataset of IT and non-IT job advertisements across multiple industries. Textual and structural features from these listings were analyzed using NLP methods including Named Entity Recognition (NER), Latent Dirichlet Allocation (LDA), BERTopic for topic modeling, and BERT-based embeddings for semantic classification. Moreover, to contextualize these insights within broader economic trends, macroeconomic indicators such as the Harmonised Index of Consumer Prices (HICP) and industry-specific productivity data were integrated. Visualization techniques such as tSNE, PCA, and heatmaps were applied to capture correlations and trends across industries and timeframes. The results indicate strong associations between the linguistic features of job postings and key economic indicators. Notably, industries with higher semantic complexity in job descriptions tended to exhibit elevated labor costs and stronger consumer price growth. These findings suggest that HR discourse can serve as a reliable, early indicator of economic developments, offering valuable insights for both workforce planning and price prediction when they are combined with relevant macroeconomic data.
Suggested Citation
Iancu Cristina & Ciuverca Alexandra-Cristina-Daniela & Oprea Simona-Vasilica, 2025.
"Web Scraping And Nlp For Price Prediction: The Hidden Signals In Hr Discourse,"
Annals - Economy Series, Constantin Brancusi University, Faculty of Economics, vol. 4, pages 51-74, August.
Handle:
RePEc:cbu:jrnlec:y:2025:v:4:p:51-74
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cbu:jrnlec:y:2025:v:4:p:51-74. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Ecobici Nicolae The email address of this maintainer does not seem to be valid anymore. Please ask Ecobici Nicolae to update the entry or send us the correct address
(email available below). General contact details of provider: https://edirc.repec.org/data/fetgjro.html .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.