IDEAS home Printed from https://ideas.repec.org/p/iza/izadps/dp15941.html

On the Validity of Using Webpage Texts to Identify the Target Population of a Survey: An Application to Detect Online Platforms

Author

Listed:
  • Daas, Piet

    (Eindhoven University of Technology)

  • Hassink, Wolter

    (Utrecht University)

  • Klijs, Bart

    (Statistics Netherlands)

Abstract

A statistical classification model was developed to identify online platform organizations based on the texts on their website. The model was subsequently used to identify all (potential) platform organizations with a website included in the Dutch Business Register. The empirical outcomes of the statistical model were plausible in terms of the words and the bimodal distribution of fitted probabilities, but the results indicated an overestimation of the number of platform organizations. Next, the external validity of the outcomes was investigated through a survey held under the organizations that were identified as a platform organization by the statistical classification model. The response by the organizations to the survey confirmed a substantial number of type-I errors. Furthermore, it revealed a positive association between the fitted probability of the text-based classification model and the organization's response to the survey question on being an online platform organization. The survey results indicated that the text-based classification model can be used to obtain a subpopulation of potential platform organizations from the entire population of businesses with a website.

Suggested Citation

  • Daas, Piet & Hassink, Wolter & Klijs, Bart, 2023. "On the Validity of Using Webpage Texts to Identify the Target Population of a Survey: An Application to Detect Online Platforms," IZA Discussion Papers 15941, IZA Network @ LISER.
  • Handle: RePEc:iza:izadps:dp15941
    as

    Download full text from publisher

    File URL: https://docs.iza.org/dp15941.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ducci,Francesco, 2020. "Natural Monopolies in Digital Platform Markets," Cambridge Books, Cambridge University Press, number 9781108491143, August.
    2. Sutherland, Will & Jarrahi, Mohammad Hossein, 2018. "The sharing economy and digital platforms: A review and research agenda," International Journal of Information Management, Elsevier, vol. 43(C), pages 328-341.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. V. I. Blanutsa, 2022. "Geographic Research of the Platform Economy: Existing and Potential Approaches," Regional Research of Russia, Springer, vol. 12(2), pages 133-142, June.
    2. Decai Tang & Ziqian Zhao & Jiannan Li & Valentina Boamah, 2024. "Research on coupling coordination degree of digital finance and economic resilience in the Yangtze River Economic Belt," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 120(15), pages 14279-14309, December.
    3. Wan Kei Tham & Weng Marc Lim & Julian Vieceli, 2023. "Foundations of consumption and production in the sharing economy," Electronic Commerce Research, Springer, vol. 23(4), pages 2979-3002, December.
    4. Pei Zhang & Peiran Chen & Fan Xiao & Yong Sun & Shuyan Ma & Ziwei Zhao, 2022. "The Impact of Information Infrastructure on Air Pollution: Empirical Evidence from China," IJERPH, MDPI, vol. 19(21), pages 1-17, November.
    5. Zhan, Yuanzhu & Kumar, Ajay & Hosany, Sameer & Xia, Yusen & Schoenherr, Tobias & Xia, Lan, 2025. "Market competition in the platform economy: new insights, integrative framework and research agenda," Journal of Business Research, Elsevier, vol. 200(C).
    6. Ren, Xiaohang & Zeng, Gudian & Zhao, Yang, 2023. "Digital finance and corporate ESG performance: Empirical evidence from listed companies in China," Pacific-Basin Finance Journal, Elsevier, vol. 79(C).
    7. Yao, Qiongrui (Missy) & Baker, LaKami T. & Lohrke, Franz T., 2022. "Building and sustaining trust in remote work by platform-dependent entrepreneurs on digital labor platforms: Toward an integrative framework," Journal of Business Research, Elsevier, vol. 149(C), pages 327-339.
    8. Thomas Derave & Tiago Princes Sales & Frederik Gailly & Geert Poels, 2022. "Sharing Platform Ontology Development: Proof-of-Concept," Sustainability, MDPI, vol. 14(4), pages 1-24, February.
    9. Aiqin Zhang & Wenjie Zhang & Xiaoqiang Guo, 2024. "The Digital Economy, Integration of Productive Services and Manufacturing, and High-Quality Development of the Manufacturing Sector: Evidence from China," Sustainability, MDPI, vol. 16(23), pages 1-22, November.
    10. Mohamed Wael Ben Khaled & Nadia Ouertani Abaoub, 2024. "Energy Sector Evolution: Perspectives on Energy Platforms and Energy Transition," Post-Print hal-04612503, HAL.
    11. Panniello, Umberto & Natalicchio, Angelo & Ardito, Lorenzo & Messeni Petruzzelli, Antonio, 2024. "A cross-sector exploration of the barriers to customer satisfaction in the sharing economy," Journal of Business Research, Elsevier, vol. 183(C).
    12. Weizhi Yao & Lianshui Li, 2023. "Information technology capability, open technological innovation and firm growth," PLOS ONE, Public Library of Science, vol. 18(10), pages 1-19, October.
    13. Hannan Amoozad Mahdiraji & Fatemeh Yaftiyan & Aliasghar Abbasi-Kamardi & Vahid Jafari-Sadeghi & Jean-Michel Sahut & Leo-Paul Dana, 2023. "A synthesis of boundary conditions with adopting digital platforms in SMEs: an intuitionistic multi-layer decision-making framework," The Journal of Technology Transfer, Springer, vol. 48(5), pages 1723-1751, October.
    14. Izabela Jonek-Kowalska & Radosław Wolniak, 2022. "Sharing Economies’ Initiatives in Municipal Authorities’ Perspective: Research Evidence from Poland in the Context of Smart Cities’ Development," Sustainability, MDPI, vol. 14(4), pages 1-22, February.
    15. Lei Zhang & Jingfeng Yuan & Bing Zhu & Bingsheng Liu & Qiqi Ai, 2025. "The Mediating Role of Virtual Agglomeration in How ICT Infrastructure Drives Urban–Rural Integration: Evidence from China," Land, MDPI, vol. 14(10), pages 1-21, October.
    16. Qiutong Xue & Sixian Feng & Kairan Chen & Muchen Li, 2022. "Impact of Digital Finance on Regional Carbon Emissions: An Empirical Study of Sustainable Development in China," Sustainability, MDPI, vol. 14(14), pages 1-26, July.
    17. Ji, Peinan & Guo, Linke & Yu, Lianchao & Yan, Xiangbin, 2025. "Pathway to prosperity or disparity?: The impact of regional IT penetration on common wealth," China Economic Review, Elsevier, vol. 94(PB).
    18. Benoit, Sabine & Merfeld, Katrin & Tunn, Vivian S.C. & Schaefers, Tobias & Andreassen, Tor Wallin, 2025. "The B2B sharing economy: Framework, implications, and future research," Journal of Business Research, Elsevier, vol. 191(C).
    19. Wei Chen & Jian Chen & Guopeng Yin, 2022. "Exploring side effects of ridesharing services in urban China: role of pollution–averting behavior," Electronic Commerce Research, Springer, vol. 22(4), pages 1007-1034, December.
    20. Yuan, Ruizhi & Luo, Jun & Liu, Martin J. & Yu, Jiang, 2022. "Understanding organizational resilience in a platform-based sharing business: The role of absorptive capacity," Journal of Business Research, Elsevier, vol. 141(C), pages 85-99.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • C83 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Survey Methods; Sampling Methods
    • D20 - Microeconomics - - Production and Organizations - - - General
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
    • L20 - Industrial Organization - - Firm Objectives, Organization, and Behavior - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iza:izadps:dp15941. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mark Fallak (email available below). General contact details of provider: https://edirc.repec.org/data/izaaalu.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.