Reinforcement learning behaviors in sponsored search

My bibliography Save this article

Reinforcement learning behaviors in sponsored search

Author

Listed:

Wei Chen
Tie‐Yan Liu
Xinxin Yang

Registered:

Abstract

This paper is concerned with the modeling of advertiser behaviors in sponsored search. Modeling advertiser behaviors can help search engines better serve advertisers, improve auction mechanism, and forecast future revenue. Previous works on this topic either unrealistically assume advertisers to be able to perceive the states of the sponsored search system and the private information of other advertisers or ignore the differences in advertisers' abilities to optimize their bid strategies. To tackle the problems, we propose viewing sponsored search auctions as partially observable multi‐agent system with private information. Then, we employ a reinforcement learning behavior model to describe how each advertiser responds to this multi‐agent system. The proposed model no longer assumes advertisers to have perfect information access, but instead assumes them to optimize their strategies only based on the partially observed states in the auctions. Furthermore, the model does not specify how the optimization is conducted, but instead uses parameters learned from data to describe different advertisers' abilities in obtaining the optimal strategies. Our experiments on real sponsored search data demonstrate that the proposed model outperforms previous models in predicting the bids and rank positions of the advertisers in the near future. In addition to the accurate prediction of these short‐term behaviors, our study shows another nice property of the proposed model. That is, if all the advertisers behave according to the model, the multi‐agent system of sponsored search will converge to a locally envy‐free equilibrium, under certain conditions. This result establishes a connection between machine‐learned behavior models and game‐theoretic properties of the system. Copyright © 2016 John Wiley & Sons, Ltd.

Suggested Citation

Wei Chen & Tie‐Yan Liu & Xinxin Yang, 2016. "Reinforcement learning behaviors in sponsored search," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 32(3), pages 358-367, May.

Handle: RePEc:wly:apsmbi:v:32:y:2016:i:3:p:358-367
DOI: 10.1002/asmb.2157

Download full text from publisher

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:apsmbi:v:32:y:2016:i:3:p:358-367. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://doi.org/10.1002/(ISSN)1526-4025 .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Reinforcement learning behaviors in sponsored search

Author

Abstract

Suggested Citation

Download full text from publisher

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data