IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0255419.html
   My bibliography  Save this article

Analysis of big data job requirements based on K-means text clustering in China

Author

Listed:
  • Dai Debao
  • Ma Yinxia
  • Zhao Min

Abstract

This paper aims to understand the characteristics of domestic big data jobs requirements through k-means text clustering, help enterprises, and employees to identify big data talents, and promote the further development of big data-related research. Firstly, the crawler software is used to crawl the recruitment information about "big data" on the zhaopin.com recruitment website. Then, Jieba word segmentation and K-means text clustering are used to cluster big data recruitment positions, and the number of clustering was determined by the average sum of squares within the group. Finally, big data jobs are divided into 10 categories, and the urban distribution, salary level, education requirements, and experience requirements of big data jobs are discussed and analyzed from the perspectives of the overall data set and clustering results, to clarify the characteristics of big data job demands. The analysis results show that the job demands of big data are mainly distributed in first-tier cities and new first-tier cities. Enterprises are more inclined to job seekers with a college degree or bachelor’s degree and more than one year’s relevant experience. There are wage differences among different types of jobs. The higher the position, the higher the requirement for education and experience will be.

Suggested Citation

  • Dai Debao & Ma Yinxia & Zhao Min, 2021. "Analysis of big data job requirements based on K-means text clustering in China," PLOS ONE, Public Library of Science, vol. 16(8), pages 1-14, August.
  • Handle: RePEc:plo:pone00:0255419
    DOI: 10.1371/journal.pone.0255419
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0255419
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0255419&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0255419?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Petya Puncheva-Michelotti & Sarah Hudson & Gewen Jin, 2018. "Employer branding and CSR communication in online recruitment advertising," Post-Print hal-01992543, HAL.
    2. Xiaoqi Zhang & Yanqiao Zheng, 2019. "Gender differences in self-view and desired salaries: A study on online recruitment website users in China," PLOS ONE, Public Library of Science, vol. 14(1), pages 1-17, January.
    3. Yanru Lu & Kai Cao, 2019. "Spatial Analysis of Big Data Industrial Agglomeration and Development in China," Sustainability, MDPI, vol. 11(6), pages 1-22, March.
    4. Martin Hilbert, 2016. "Big Data for Development: A Review of Promises and Challenges," Development Policy Review, Overseas Development Institute, vol. 34(1), pages 135-174, January.
    5. Puncheva-Michelotti, Petya & Hudson, Sarah & Jin, Gewen, 2018. "Employer branding and CSR communication in online recruitment advertising," Business Horizons, Elsevier, vol. 61(4), pages 643-651.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Krisztina Szegedi & Tamás Németh & Dorina Körtvési, 2023. "Employer Branding in the Fashion Industry: CSR Actions by Fashion SMEs," Sustainability, MDPI, vol. 15(3), pages 1-18, January.
    2. Kaenat Malik & Prof.Dr.Tariq Jalees, 2019. "The Mediating Role Of Employer Branding Between Employee Satisfaction And Talent Management," IBT Journal of Business Studies (JBS), Ilma University, Faculty of Management Science, vol. 15(2), pages 75-94.
    3. Agnieszka Izabela Baruk & Grzegorz Wesołowski, 2021. "The Effect of Using Social Media in the Modern Marketing Communication on the Shaping an External Employer’s Image," Energies, MDPI, vol. 14(14), pages 1-23, July.
    4. Vörös, Máté & Fűrész, Diána Ivett, 2021. "A részmunkaidős foglalkoztatás hatékonyságának empirikus vizsgálata [The efficiency of part-time employment]," Közgazdasági Szemle (Economic Review - monthly of the Hungarian Academy of Sciences), Közgazdasági Szemle Alapítvány (Economic Review Foundation), vol. 0(2), pages 178-204.
    5. Paolo Antonetti & Benedetta Crisafulli & Aybars Tuncdogan, 2021. "“Just Look the Other Way”: Job Seekers’ Reactions to the Irresponsibility of Market-Dominant Employers," Journal of Business Ethics, Springer, vol. 174(2), pages 403-422, November.
    6. Rohit Aggarwal & Michael J. Lee & Vishal Midha, 2023. "Differential Impact of Content in Online Communication on Heterogeneous Candidates: A Field Study in Technical Recruitment," Information Systems Research, INFORMS, vol. 34(2), pages 609-628, June.
    7. Kaenat Malik & Prof. Dr. Tariq Jalees, 2019. "The Mediating Role Of Employer Branding Between Employee Satisfaction And Talent Management," IBT Journal of Business Studies (JBS), Ilma University, Faculty of Management Science, vol. 15(2), pages 15-16.
    8. Faheem Gul Gilal & Nisar Ahmed Channa & Naeem Gul Gilal & Rukhsana Gul Gilal & Zhenxing Gong & Na Zhang, 2020. "Corporate social responsibility and brand passion among consumers: Theory and evidence," Corporate Social Responsibility and Environmental Management, John Wiley & Sons, vol. 27(5), pages 2275-2285, September.
    9. Romuald Grouille, 2021. "Segmenter Les Perceptions De La Marque Employeur Chez Des Recrutés : Quel(S) Apport(S) Rh ?," Working Papers hal-04128830, HAL.
    10. Sheehan, Norman T. & Vaidyanathan, Ganesh & Fox, Kenneth A. & Klassen, Mark, 2023. "Making the invisible, visible: Overcoming barriers to ESG performance with an ESG mindset," Business Horizons, Elsevier, vol. 66(2), pages 265-276.
    11. Luyu Liu & Harvey J Miller, 2021. "Measuring risk of missing transfers in public transit systems using high-resolution schedule and real-time bus location data," Urban Studies, Urban Studies Journal Limited, vol. 58(15), pages 3140-3156, November.
    12. Martin Hilbert, 2017. "Complementary Variety: When Can Cooperation in Uncertain Environments Outperform Competitive Selection?," Complexity, Hindawi, vol. 2017, pages 1-15, September.
    13. Anke Joubert & Matthias Murawski & Markus Bick, 2023. "Measuring the Big Data Readiness of Developing Countries – Index Development and its Application to Africa," Information Systems Frontiers, Springer, vol. 25(1), pages 327-350, February.
    14. Raymond Lang & Marguerite Schneider & Maria Kett & Ellie Cole & Nora Groce, 2019. "Policy development: An analysis of disability inclusion in a selection of African Union policies," Development Policy Review, Overseas Development Institute, vol. 37(2), pages 155-175, March.
    15. Makoza, Frank, 2023. "Analyzing policy change of Malawi ICT and Digitalization policy: Policy Assemblage Perspective," EconStor Preprints 273309, ZBW - Leibniz Information Centre for Economics.
    16. Yihan Chi & Yongheng Fang & Jiamin Liu, 2022. "Spatial–Temporal Evolution Characteristics and Economic Effects of China’s Cultural and Tourism Industries’ Collaborative Agglomeration," Sustainability, MDPI, vol. 14(22), pages 1-23, November.
    17. Richard Heeks & Vanya Rakesh & Ritam Sengupta & Sumandro Chattapadhyay & Christopher Foster, 2021. "Datafication, value and power in developing countries: Big data in two Indian public service organizations," Development Policy Review, Overseas Development Institute, vol. 39(1), pages 82-102, January.
    18. Zheng, Yanqiao & Zhang, Xiaoqi & Zhu, Yu, 2021. "Overeducation, major mismatch, and return to higher education tiers: Evidence from novel data source of a major online recruitment platform in China," China Economic Review, Elsevier, vol. 66(C).
    19. Ana Fernandes & Martin Huber & Giannina Vaccaro, 2021. "Gender differences in wage expectations," PLOS ONE, Public Library of Science, vol. 16(6), pages 1-24, June.
    20. Prof. Dr.Sejdi Rexhepi & Mjellma Kadriu, 2018. "The Importance of Resource Assessment for Entrepreneurship and Local Economic Development in Kosovo," European Journal of Economics and Business Studies Articles, Revistia Research and Publishing, vol. 4, January -.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0255419. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.