IDEAS home Printed from https://ideas.repec.org/a/wly/apsmbi/v32y2016i3p358-367.html
   My bibliography  Save this article

Reinforcement learning behaviors in sponsored search

Author

Listed:
  • Wei Chen
  • Tie‐Yan Liu
  • Xinxin Yang

Abstract

This paper is concerned with the modeling of advertiser behaviors in sponsored search. Modeling advertiser behaviors can help search engines better serve advertisers, improve auction mechanism, and forecast future revenue. Previous works on this topic either unrealistically assume advertisers to be able to perceive the states of the sponsored search system and the private information of other advertisers or ignore the differences in advertisers' abilities to optimize their bid strategies. To tackle the problems, we propose viewing sponsored search auctions as partially observable multi‐agent system with private information. Then, we employ a reinforcement learning behavior model to describe how each advertiser responds to this multi‐agent system. The proposed model no longer assumes advertisers to have perfect information access, but instead assumes them to optimize their strategies only based on the partially observed states in the auctions. Furthermore, the model does not specify how the optimization is conducted, but instead uses parameters learned from data to describe different advertisers' abilities in obtaining the optimal strategies. Our experiments on real sponsored search data demonstrate that the proposed model outperforms previous models in predicting the bids and rank positions of the advertisers in the near future. In addition to the accurate prediction of these short‐term behaviors, our study shows another nice property of the proposed model. That is, if all the advertisers behave according to the model, the multi‐agent system of sponsored search will converge to a locally envy‐free equilibrium, under certain conditions. This result establishes a connection between machine‐learned behavior models and game‐theoretic properties of the system. Copyright © 2016 John Wiley & Sons, Ltd.

Suggested Citation

  • Wei Chen & Tie‐Yan Liu & Xinxin Yang, 2016. "Reinforcement learning behaviors in sponsored search," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 32(3), pages 358-367, May.
  • Handle: RePEc:wly:apsmbi:v:32:y:2016:i:3:p:358-367
    DOI: 10.1002/asmb.2157
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/asmb.2157
    Download Restriction: no

    File URL: https://libkey.io/10.1002/asmb.2157?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:apsmbi:v:32:y:2016:i:3:p:358-367. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://doi.org/10.1002/(ISSN)1526-4025 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.