IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v69y2023i4p2165-2181.html

Active Learning for Contextual Search with Binary Feedback

Author

Listed:
  • Xi Chen

    (Leonard N. Stern School of Business, New York University, New York, New York 10012)

  • Quanquan Liu

    (Naveen Jindal School of Management, University of Texas at Dallas, Richardson, Texas 75080)

  • Yining Wang

    (Naveen Jindal School of Management, University of Texas at Dallas, Richardson, Texas 75080)

Abstract

In this paper, we study the learning problem in contextual search, which is motivated by applications such as crowdsourcing and personalized medicine experiments. In particular, for a sequence of arriving context vectors, with each context associated with an underlying value, the decision maker either makes a query at a certain point or skips the context. The decision maker will only observe the binary feedback on the relationship between the query point and the value associated with the context. We study a probably approximately correct learning setting, where the goal is to learn the underlying mean value function in context with a minimum number of queries. To address this challenge, we propose a trisection search approach combined with a margin-based active learning method. We show that the algorithm only needs to make O ˜ ( 1 / ε 2 ) queries to achieve an ε -estimation accuracy. This sample complexity significantly reduces the required sample complexity in the passive setting where neither sample skipping nor query selection is allowed, which is at least Ω ( 1 / ε 3 ) .

Suggested Citation

  • Xi Chen & Quanquan Liu & Yining Wang, 2023. "Active Learning for Contextual Search with Binary Feedback," Management Science, INFORMS, vol. 69(4), pages 2165-2181, April.
  • Handle: RePEc:inm:ormnsc:v:69:y:2023:i:4:p:2165-2181
    DOI: 10.1287/mnsc.2022.4473
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mnsc.2022.4473
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2022.4473?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Maxime C. Cohen & Ilan Lobel & Renato Paes Leme, 2020. "Feature-Based Dynamic Pricing," Management Science, INFORMS, vol. 66(11), pages 4921-4943, November.
    2. Hamsa Bastani & Mohsen Bayati, 2020. "Online Decision Making with High-Dimensional Covariates," Operations Research, INFORMS, vol. 68(1), pages 276-294, January.
    3. Li, Xiaoou & Chen, Yunxiao & Chen, Xi & Liu, Jingchen & Ying, Zhiliang, 2021. "Optimal stopping and worker selection in crowdsourcing: an adaptive sequential probability ratio test framework," LSE Research Online Documents on Economics 100873, London School of Economics and Political Science, LSE Library.
    4. Hamsa Bastani & Mohsen Bayati & Khashayar Khosravi, 2021. "Mostly Exploration-Free Algorithms for Contextual Bandits," Management Science, INFORMS, vol. 67(3), pages 1329-1349, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xueping Gong & Wei You & Jiheng Zhang, 2026. "Minimax Optimality in Contextual Dynamic Pricing with General Valuation Models," Operations Research, INFORMS, vol. 74(2), pages 879-897, March.
    2. Jinglong Zhao, 2024. "Experimental Design For Causal Inference Through An Optimization Lens," Papers 2408.09607, arXiv.org, revised Aug 2024.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hamsa Bastani & David Simchi-Levi & Ruihao Zhu, 2022. "Meta Dynamic Pricing: Transfer Learning Across Experiments," Management Science, INFORMS, vol. 68(3), pages 1865-1881, March.
    2. Zhimei Ren & Zhengyuan Zhou, 2024. "Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits," Management Science, INFORMS, vol. 70(2), pages 1315-1342, February.
    3. Divya Singhvi & Somya Singhvi, 2025. "Online Learning with Sample Selection Bias," Operations Research, INFORMS, vol. 73(5), pages 2458-2476, September.
    4. Yinchu Zhu & Ilya O. Ryzhov, 2022. "Optimal data-driven hiring with equity for underrepresented groups," Papers 2206.09300, arXiv.org.
    5. Ningyuan Chen & Guillermo Gallego, 2021. "Nonparametric Pricing Analytics with Customer Covariates," Operations Research, INFORMS, vol. 69(3), pages 974-984, May.
    6. Yining Wang & Quanquan Liu, 2025. "Estimation of High-Dimensional Contextual Pricing Models with Nonparametric Price Confounders," Operations Research, INFORMS, vol. 73(6), pages 3065-3084, November.
    7. Mohammad Zhalechian & Esmaeil Keyvanshokooh & Cong Shi & Mark P. Van Oyen, 2023. "Data-Driven Hospital Admission Control: A Learning Approach," Operations Research, INFORMS, vol. 71(6), pages 2111-2129, November.
    8. Esmaeil Keyvanshokooh & Mohammad Zhalechian & Cong Shi & Mark P. Van Oyen & Pooyan Kazemian, 2025. "Contextual Learning with Online Convex Optimization: Theory and Application to Medical Decision-Making," Management Science, INFORMS, vol. 71(12), pages 10442-10464, December.
    9. Akshay Krishnamurthy & Thodoris Lykouris & Chara Podimata & Robert Schapire, 2023. "Contextual Search in the Presence of Adversarial Corruptions," Operations Research, INFORMS, vol. 71(4), pages 1120-1135, July.
    10. Jackie Baek & Vivek F. Farias, 2024. "Fair Exploration via Axiomatic Bargaining," Management Science, INFORMS, vol. 70(12), pages 8922-8939, December.
    11. Saeid Delshad & Amin Khademi, 2022. "Adaptive Design of Personalized Dose-Finding Clinical Trials," Service Science, INFORMS, vol. 14(4), pages 273-291, December.
    12. Jingwen Zhang & Yifang Chen & Amandeep Singh, 2022. "Causal Bandits: Online Decision-Making in Endogenous Settings," Papers 2211.08649, arXiv.org, revised Feb 2023.
    13. Jianyu Xu & Yining Wang & Xi Chen & Yu-Xiang Wang, 2025. "Dynamic Pricing with Adversarially-Censored Demands," Papers 2502.06168, arXiv.org, revised Jan 2026.
    14. Max Simchowitz & Aleksandrs Slivkins, 2024. "Exploration and Incentives in Reinforcement Learning," Operations Research, INFORMS, vol. 72(3), pages 983-998, May.
    15. Leon Yang Chu & Qi Feng & J. George Shanthikumar & Zuo-Jun Max Shen & Jian Wu, 2025. "Solving the Price-Setting Newsvendor Problem with Parametric Operational Data Analytics (ODA)," Management Science, INFORMS, vol. 71(8), pages 6627-6646, August.
    16. Ruohan Zhan & Zhimei Ren & Susan Athey & Zhengyuan Zhou, 2024. "Policy Learning with Adaptively Collected Data," Management Science, INFORMS, vol. 70(8), pages 5270-5297, August.
    17. Rong Jin & David Simchi-Levi & Li Wang & Xinshang Wang & Sen Yang, 2021. "Shrinking the Upper Confidence Bound: A Dynamic Product Selection Problem for Urban Warehouses," Management Science, INFORMS, vol. 67(8), pages 4756-4771, August.
    18. Kan Xu & Hamsa Bastani, 2025. "Multitask Learning and Bandits via Robust Statistics," Management Science, INFORMS, vol. 71(9), pages 7752-7773, September.
    19. Adel Javanmard & Jingwei Ji & Renyuan Xu, 2024. "Multi-Task Dynamic Pricing in Credit Market with Contextual Information," Papers 2410.14839, arXiv.org, revised Dec 2025.
    20. Amirnequiee, Shobeir & Naoum-Sawaya, Joe & Pun, Hubert, 2026. "Robust framework for the joint learning of consumer preferences and market segmentation," Omega, Elsevier, vol. 138(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:69:y:2023:i:4:p:2165-2181. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.