IDEAS home Printed from https://ideas.repec.org/a/inm/ormsom/v25y2023i2p648-667.html

Market Segmentation Trees

Author

Listed:
  • Ali Aouad

    (London Business School, London NW1 4SA, United Kingdom)

  • Adam N. Elmachtoub

    (Department of Industrial Engineering and Operations Research and Data Science Institute, Columbia University, New York, New York 10027)

  • Kris J. Ferreira

    (Harvard Business School, Harvard University, Boston, Massachusetts 02163)

  • Ryan McNellis

    (Department of Industrial Engineering and Operations Research and Data Science Institute, Columbia University, New York, New York 10027)

Abstract

Problem definition : We seek to provide an interpretable framework for segmenting users in a population for personalized decision making. Methodology/results : We propose a general methodology, market segmentation trees (MSTs), for learning market segmentations explicitly driven by identifying differences in user response patterns. To demonstrate the versatility of our methodology, we design two new specialized MST algorithms: (i) choice model trees (CMTs), which can be used to predict a user’s choice amongst multiple options, and (ii) isotonic regression trees (IRTs), which can be used to solve the bid landscape forecasting problem. We provide a theoretical analysis of the asymptotic running times of our algorithmic methods, which validates their computational tractability on large data sets. We also provide a customizable, open-source code base for training MSTs in Python that uses several strategies for scalability, including parallel processing and warm starts. Finally, we assess the practical performance of MSTs on several synthetic and real-world data sets, showing that our method reliably finds market segmentations that accurately model response behavior. Managerial implications : The standard approach to conduct market segmentation for personalized decision making is to first perform market segmentation by clustering users according to similarities in their contextual features and then fit a “response model” to each segment to model how users respond to decisions. However, this approach may not be ideal if the contextual features prominent in distinguishing clusters are not key drivers of response behavior. Our approach addresses this issue by integrating market segmentation and response modeling, which consistently leads to improvements in response prediction accuracy, thereby aiding personalization. We find that such an integrated approach can be computationally tractable and effective even on large-scale data sets. Moreover, MSTs are interpretable because the market segments can easily be described by a decision tree and often require only a fraction of the number of market segments generated by traditional approaches. Disclaimer: This work was done prior to Ryan McNellis joining Amazon.

Suggested Citation

  • Ali Aouad & Adam N. Elmachtoub & Kris J. Ferreira & Ryan McNellis, 2023. "Market Segmentation Trees," Manufacturing & Service Operations Management, INFORMS, vol. 25(2), pages 648-667, March.
  • Handle: RePEc:inm:ormsom:v:25:y:2023:i:2:p:648-667
    DOI: 10.1287/msom.2023.1195
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/msom.2023.1195
    Download Restriction: no

    File URL: https://libkey.io/10.1287/msom.2023.1195?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, Enero-Abr.
    2. Fernando Bernstein & Sajad Modaresi & Denis Sauré, 2019. "A Dynamic Clustering Approach to Data-Driven Assortment Personalization," Management Science, INFORMS, vol. 67(5), pages 2095-2115, May.
    3. Ruomeng Cui & Santiago Gallino & Antonio Moreno & Dennis J. Zhang, 2018. "The Operational Value of Social Media Information," Production and Operations Management, Production and Operations Management Society, vol. 27(10), pages 1749-1769, October.
    4. Dimitris Bertsimas & Nathan Kallus, 2020. "From Predictive to Prescriptive Analytics," Management Science, INFORMS, vol. 66(3), pages 1025-1044, March.
    5. Hema Yoganarasimhan, 2020. "Search Personalization Using Machine Learning," Management Science, INFORMS, vol. 66(3), pages 1045-1070, March.
    6. Lemmens, A. & Croux, C., 2006. "Bagging and boosting classification trees to predict churn," Other publications TiSEM d5cb664d-5859-44db-a621-e, Tilburg University, School of Economics and Management.
    7. Nathan Kallus & Madeleine Udell, 2020. "Dynamic Assortment Personalization in High Dimensions," Operations Research, INFORMS, vol. 68(4), pages 1020-1037, July.
    8. Ricardo Fraiman & Badih Ghattas & Marcela Svarc, 2013. "Interpretable clustering using unsupervised binary trees," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(2), pages 125-145, June.
    9. Chloe Kim Glaeser & Marshall Fisher & Xuanming Su, 2019. "Optimal Retail Location: Empirical Methodology and Application to Practice," Service Science, INFORMS, vol. 21(1), pages 86-102, January.
    10. Emilio Carrizosa & Cristina Molero-Río & Dolores Romero Morales, 2021. "Mathematical optimization in classification and regression trees," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(1), pages 5-33, April.
    11. Mila Nambiar & David Simchi-Levi & He Wang, 2019. "Dynamic Learning and Pricing with Model Misspecification," Management Science, INFORMS, vol. 65(11), pages 4980-5000, November.
    12. Kris Johnson Ferreira & Bin Hong Alex Lee & David Simchi-Levi, 2016. "Analytics for an Online Retailer: Demand Forecasting and Price Optimization," Manufacturing & Service Operations Management, INFORMS, vol. 18(1), pages 69-88, February.
    13. Adam N. Elmachtoub & Vishal Gupta & Michael L. Hamilton, 2021. "The Value of Personalized Pricing," Management Science, INFORMS, vol. 67(10), pages 6055-6070, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jiameng Lyu & Jinxing Xie & Shilin Yuan & Yuan Zhou, 2025. "A Minibatch Stochastic Gradient Descent-Based Learning Metapolicy for Inventory Systems with Myopic Optimal Policy," Management Science, INFORMS, vol. 71(7), pages 5572-5588, July.
    2. Tu, Jiancheng & Wu, Zhibin, 2025. "Inherently interpretable machine learning for credit scoring: Optimal classification tree with hyperplane splits," European Journal of Operational Research, Elsevier, vol. 322(2), pages 647-664.
    3. Consolo, Antonio & Amaldi, Edoardo & Manno, Andrea, 2026. "Soft regression trees: A model variant and a decomposition training algorithm," European Journal of Operational Research, Elsevier, vol. 328(2), pages 607-619.
    4. N. Bora Keskin & Yuexing Li & Nur Sunar, 2025. "Data-Driven Clustering and Feature-Based Retail Electricity Pricing with Smart Meters," Operations Research, INFORMS, vol. 73(5), pages 2636-2660, September.
    5. Xue, Gang & Gong, Daqing & Ren, Long & Cui, Ziruo, 2026. "Modeling expert risk assessments in utility tunnels with deep learning," Reliability Engineering and System Safety, Elsevier, vol. 265(PA).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ningyuan Chen & Ming Hu, 2023. "Frontiers in Service Science: Data-Driven Revenue Management: The Interplay of Data, Model, and Decisions," Service Science, INFORMS, vol. 15(2), pages 79-91, June.
    2. Ningyuan Chen & Andre A. Cire & Ming Hu & Saman Lagzi, 2023. "Model-Free Assortment Pricing with Transaction Data," Management Science, INFORMS, vol. 69(10), pages 5830-5847, October.
    3. Victor Martínez‐de‐Albéniz & Arnau Planas & Stefano Nasini, 2020. "Using Clickstream Data to Improve Flash Sales Effectiveness," Production and Operations Management, Production and Operations Management Society, vol. 29(11), pages 2508-2531, November.
    4. Long He & Sheng Liu & Zuo‐Jun Max Shen, 2022. "Smart urban transport and logistics: A business analytics perspective," Production and Operations Management, Production and Operations Management Society, vol. 31(10), pages 3771-3787, October.
    5. Julian Senoner & Bernhard Kratzwald & Milan Kuzmanovic & Torbjørn H. Netland & Stefan Feuerriegel, 2023. "Addressing distributional shifts in operations management: The case of order fulfillment in customized production," Production and Operations Management, Production and Operations Management Society, vol. 32(10), pages 3022-3042, October.
    6. Yining Wang & Quanquan Liu, 2025. "Estimation of High-Dimensional Contextual Pricing Models with Nonparametric Price Confounders," Operations Research, INFORMS, vol. 73(6), pages 3065-3084, November.
    7. Sanghoon Cho & Mark Ferguson & Pelin Pekgün & Andrew Vakhutinsky, 2023. "Estimating Personalized Demand with Unobserved No-Purchases Using a Mixture Model: An Application in the Hotel Industry," Manufacturing & Service Operations Management, INFORMS, vol. 25(4), pages 1245-1262, July.
    8. Julia Heger & Robert Klein, 2024. "Assortment optimization: a systematic literature review," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 46(4), pages 1099-1161, December.
    9. Muzaffer Buyruk & Ertan Güner, 2022. "Personalization in airline revenue management: an overview and future outlook," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 21(2), pages 129-139, April.
    10. Melvyn Sim & Qinshen Tang & Minglong Zhou & Taozeng Zhu, 2025. "The Analytics of Robust Satisficing: Predict, Optimize, Satisfice, Then Fortify," Operations Research, INFORMS, vol. 73(5), pages 2708-2728, September.
    11. Qi Feng & J. George Shanthikumar, 2022. "Developing operations management data analytics," Production and Operations Management, Production and Operations Management Society, vol. 31(12), pages 4544-4557, December.
    12. Jingwen Tang & Zhengling Qi & Ethan Fang & Cong Shi, 2025. "Offline Feature-Based Pricing Under Censored Demand: A Causal Inference Approach," Manufacturing & Service Operations Management, INFORMS, vol. 27(2), pages 535-553, March.
    13. Adam Behrendt & Martin Savelsbergh & He Wang, 2023. "A Prescriptive Machine Learning Method for Courier Scheduling on Crowdsourced Delivery Platforms," Transportation Science, INFORMS, vol. 57(4), pages 889-907, July.
    14. N. Bora Keskin & Yuexing Li & Nur Sunar, 2025. "Data-Driven Clustering and Feature-Based Retail Electricity Pricing with Smart Meters," Operations Research, INFORMS, vol. 73(5), pages 2636-2660, September.
    15. Dazhou Lei & Hao Hu & Dongyang Geng & Jianshen Zhang & Yongzhi Qi & Sheng Liu & Zuo‐Jun Max Shen, 2023. "New product life cycle curve modeling and forecasting with product attributes and promotion: A Bayesian functional approach," Production and Operations Management, Production and Operations Management Society, vol. 32(2), pages 655-673, February.
    16. Kameng Nip & Zhenbo Wang & Zizhuo Wang, 2021. "Assortment Optimization under a Single Transition Choice Model," Production and Operations Management, Production and Operations Management Society, vol. 30(7), pages 2122-2142, July.
    17. Tsao, Yu-Chung & Chen, Yu-Kai & Chiu, Shih-Hao & Lu, Jye-Chyi & Vu, Thuy-Linh, 2022. "An innovative demand forecasting approach for the server industry," Technovation, Elsevier, vol. 110(C).
    18. Zhaohui (Zoey) Jiang & Jun Li, 2025. "Instrumenting While Experimenting: An Empirical Method for Competitive Pricing at Scale," Operations Research, INFORMS, vol. 73(5), pages 2477-2495, September.
    19. Lalit Jain & Zhaoqi Li & Erfan Loghmani & Blake Mason & Hema Yoganarasimhan, 2024. "Effective Adaptive Exploration of Prices and Promotions in Choice-Based Demand Models," Marketing Science, INFORMS, vol. 43(5), pages 1002-1030, September.
    20. Ausseil, Rosemonde & Ulmer, Marlin W. & Pazour, Jennifer A., 2024. "Online acceptance probability approximation in peer-to-peer transportation," Omega, Elsevier, vol. 123(C).

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormsom:v:25:y:2023:i:2:p:648-667. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.