IDEAS home Printed from https://ideas.repec.org/a/spr/intemj/v19y2023i4d10.1007_s11365-023-00882-1.html
   My bibliography  Save this article

How can entrepreneurs improve digital market segmentation? A comparative analysis of supervised and unsupervised learning algorithms

Author

Listed:
  • Laura Sáez-Ortuño

    (Universitat de Barcelona)

  • Ruben Huertas-Garcia

    (Universitat de Barcelona)

  • Santiago Forgas-Coll

    (Universitat de Barcelona)

  • Eloi Puertas-Prats

    (Universitat de Barcelona)

Abstract

The identification of digital market segments to make value-creating propositions is a major challenge for entrepreneurs and marketing managers. New technologies and the Internet have made it possible to collect huge volumes of data that are difficult to analyse using traditional techniques. The purpose of this research is to address this challenge by proposing the use of AI algorithms to cluster customers. Specifically, the proposal is to compare the suitability of supervised algorithms, XGBoost, versus unsupervised algorithms, K-means, for segmenting the digital market. To do so, both algorithms have been applied to a sample of 5 million Spanish users captured between 2010 and 2022 by a lead generation start-up. The results show that supervised learning with this type of data is more useful for segmenting markets than unsupervised learning, as it provides solutions that are better suited to entrepreneurs’ commercial objectives.

Suggested Citation

  • Laura Sáez-Ortuño & Ruben Huertas-Garcia & Santiago Forgas-Coll & Eloi Puertas-Prats, 2023. "How can entrepreneurs improve digital market segmentation? A comparative analysis of supervised and unsupervised learning algorithms," International Entrepreneurship and Management Journal, Springer, vol. 19(4), pages 1893-1920, December.
  • Handle: RePEc:spr:intemj:v:19:y:2023:i:4:d:10.1007_s11365-023-00882-1
    DOI: 10.1007/s11365-023-00882-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11365-023-00882-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11365-023-00882-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Artem Timoshenko & John R. Hauser, 2019. "Identifying Customer Needs from User-Generated Content," Marketing Science, INFORMS, vol. 38(1), pages 1-20, January.
    2. Glenn Milligan & Martha Cooper, 1988. "A study of standardization of variables in cluster analysis," Journal of Classification, Springer;The Classification Society, vol. 5(2), pages 181-204, September.
    3. Alexander Buhmann & Johannes Paßmann & Christian Fieseler, 2020. "Managing Algorithmic Accountability: Balancing Reputational Concerns, Engagement Strategies, and the Potential of Rational Discourse," Journal of Business Ethics, Springer, vol. 163(2), pages 265-280, May.
    4. Oded Netzer & Ronen Feldman & Jacob Goldenberg & Moshe Fresko, 2012. "Mine Your Own Business: Market-Structure Surveillance Through Text Mining," Marketing Science, INFORMS, vol. 31(3), pages 521-543, May.
    5. Ying Liu & Sudha Ram & Robert F. Lusch & Michael Brusco, 2010. "Multicriterion Market Segmentation: A New Model, Implementation, and Evaluation," Marketing Science, INFORMS, vol. 29(5), pages 880-894, 09-10.
    6. Sascha Kraus & Matthias Breier & Sonia Dasí-Rodríguez, 2020. "The art of crafting a systematic literature review in entrepreneurship research," International Entrepreneurship and Management Journal, Springer, vol. 16(3), pages 1023-1042, September.
    7. Hartmann, Jochen & Heitmann, Mark & Siebert, Christian & Schamp, Christina, 2023. "More than a Feeling: Accuracy and Application of Sentiment Analysis," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 75-87.
    8. Alexander Strehl & Joydeep Ghosh, 2003. "Relationship-Based Clustering and Visualization for High-Dimensional Data Mining," INFORMS Journal on Computing, INFORMS, vol. 15(2), pages 208-230, May.
    9. Hartmann, Jochen & Huppertz, Juliana & Schamp, Christina & Heitmann, Mark, 2019. "Comparing automated text classification methods," International Journal of Research in Marketing, Elsevier, vol. 36(1), pages 20-38.
    10. Panos Vassiliadis, 2009. "A Survey of Extract–Transform–Load Technology," International Journal of Data Warehousing and Mining (IJDWM), IGI Global, vol. 5(3), pages 1-27, July.
    11. Wayne S. DeSarbo & C. Anthony Di Benedetto & Michael Song & Indrajit Sinha, 2005. "Revisiting the Miles and Snow strategic framework: uncovering interrelationships between strategic types, capabilities, environmental uncertainty, and firm performance," Strategic Management Journal, Wiley Blackwell, vol. 26(1), pages 47-74, January.
    12. David B. Audretsch & Maksim Belitski & Rosa Caiazza & Erik E. Lehmann, 2020. "Knowledge management and entrepreneurship," International Entrepreneurship and Management Journal, Springer, vol. 16(2), pages 373-385, June.
    13. Ying-Kai Liao & Vu Huu Anh Nguyen & Andrea Caputo, 2022. "Unveiling the role of entrepreneurial knowledge and cognition as antecedents of entrepreneurial intention: a meta-analytic study," International Entrepreneurship and Management Journal, Springer, vol. 18(4), pages 1623-1652, December.
    14. Bremer, Lucas & Heitmann, Mark & Schreiner, Thomas F., 2017. "When and how to infer heuristic consideration set rules of consumers," International Journal of Research in Marketing, Elsevier, vol. 34(2), pages 516-535.
    15. Steven Tadelis, 2016. "Reputation and Feedback Systems in Online Platform Markets," Annual Review of Economics, Annual Reviews, vol. 8(1), pages 321-340, October.
    16. M Poongodi & Mohit Malviya & Chahat Kumar & Mounir Hamdi & V Vijayakumar & Jamel Nebhen & Hasan Alyamani, 2022. "New York City taxi trip duration prediction using MLP and XGBoost," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 13(1), pages 16-27, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ricardo Costa-Climent & Samuel Ribeiro Navarrete & Darek M. Haftor & Marcin W. Staniewski, 2024. "Value creation and appropriation from the use of machine learning: a study of start-ups using fuzzy-set qualitative comparative analysis," International Entrepreneurship and Management Journal, Springer, vol. 20(2), pages 935-967, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jan Ole Krugmann & Jochen Hartmann, 2024. "Sentiment Analysis in the Age of Generative AI," Customer Needs and Solutions, Springer;Institute for Sustainable Innovation and Growth (iSIG), vol. 11(1), pages 1-19, December.
    2. Uttara Ananthakrishnan & Davide Proserpio & Siddhartha Sharma, 2023. "I Hear You: Does Quality Improve with Customer Voice?," Marketing Science, INFORMS, vol. 42(6), pages 1143-1161, November.
    3. Andreas Kraft & Raghunath Singh Rao, 2025. "Signaling quality via demand lockout," Quantitative Marketing and Economics (QME), Springer, vol. 23(1), pages 1-44, March.
    4. de Haan, Evert & Padigar, Manjunath & El Kihal, Siham & Kübler, Raoul & Wieringa, Jaap E., 2024. "Unstructured data research in business: Toward a structured approach," Journal of Business Research, Elsevier, vol. 177(C).
    5. Balázs Kovács, 2024. "The Turing test of online reviews: Can we tell the difference between human-written and GPT-4-written online reviews?," Marketing Letters, Springer, vol. 35(4), pages 651-666, December.
    6. Jiyeon Hong & Paul R. Hoban, 2022. "Writing More Compelling Creative Appeals: A Deep Learning-Based Approach," Marketing Science, INFORMS, vol. 41(5), pages 941-965, September.
    7. Carlson, Keith & Kopalle, Praveen K. & Riddell, Allen & Rockmore, Daniel & Vana, Prasad, 2023. "Complementing human effort in online reviews: A deep learning approach to automatic content generation and review synthesis," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 54-74.
    8. Gensler, Sonja & Oehring, Karlo & Wiesel, Thorsten, 2024. "Reported and communicated shifts in strategic emphasis and firm performance," International Journal of Research in Marketing, Elsevier, vol. 41(2), pages 220-240.
    9. Roelen-Blasberg, Tobias & Habel, Johannes & Klarmann, Martin, 2023. "Automated inference of product attributes and their importance from user-generated content: Can we replace traditional market research?," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 164-188.
    10. Morimura, Fumikazu & Sakagawa, Yuji, 2023. "The intermediating role of big data analytics capability between responsive and proactive market orientations and firm performance in the retail industry," Journal of Retailing and Consumer Services, Elsevier, vol. 71(C).
    11. Boegershausen, Johannes & Datta, Hannes & Borah, Abhishek & Stephen, Andrew, 2022. "Fields of Gold: Web Scraping and APIs for Impactful Marketing Insights," Other publications TiSEM 5f1ed70a-48c3-422c-bc10-0, Tilburg University, School of Economics and Management.
    12. Hartmann, Jochen & Heitmann, Mark & Siebert, Christian & Schamp, Christina, 2023. "More than a Feeling: Accuracy and Application of Sentiment Analysis," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 75-87.
    13. Jonah Berger & Grant Packard & Reihane Boghrati & Ming Hsu & Ashlee Humphreys & Andrea Luangrath & Sarah Moore & Gideon Nave & Christopher Olivola & Matthew Rocklage, 2022. "Marketing insights from text analysis," Marketing Letters, Springer, vol. 33(3), pages 365-377, September.
    14. Jifeng Mu & Jonathan Z. Zhang, 2021. "Seller marketing capability, brand reputation, and consumer journeys on e-commerce platforms," Journal of the Academy of Marketing Science, Springer, vol. 49(5), pages 994-1020, September.
    15. Ma, Liye & Sun, Baohong, 2020. "Machine learning and AI in marketing – Connecting computing power to human insights," International Journal of Research in Marketing, Elsevier, vol. 37(3), pages 481-504.
    16. Venkatesh Shankar & Sohil Parsana, 2022. "An overview and empirical comparison of natural language processing (NLP) models and an introduction to and empirical application of autoencoder models in marketing," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1324-1350, November.
    17. Shimi Naurin Ahmad & Michel Laroche, 2023. "Extracting marketing information from product reviews: a comparative study of latent semantic analysis and probabilistic latent semantic analysis," Journal of Marketing Analytics, Palgrave Macmillan, vol. 11(4), pages 662-676, December.
    18. Ming-Hui Huang & Roland T. Rust, 2021. "A strategic framework for artificial intelligence in marketing," Journal of the Academy of Marketing Science, Springer, vol. 49(1), pages 30-50, January.
    19. Schwenzow, Jasper & Hartmann, Jochen & Schikowsky, Amos & Heitmann, Mark, 2021. "Understanding videos at scale: How to extract insights for business research," Journal of Business Research, Elsevier, vol. 123(C), pages 367-379.
    20. Qian, Yang & Ling, Haifeng & Meng, Xiangrui & Jiang, Yuanchun & Chai, Yidong & Liu, Yezheng, 2024. "Voice of the Professional: Acquiring competitive intelligence from large-scale professional generated contents," Journal of Business Research, Elsevier, vol. 180(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:intemj:v:19:y:2023:i:4:d:10.1007_s11365-023-00882-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.