Voting cycles when a dominant point exists
We conduct experiments in which humans repeatedly play one of two games against a computer decision maker that follows either a reinforcement learning or an Experience Weighted Attraction algorithm. Our experiments show these learning algorithms more sensitively detect exploitable opportunities than humans. Also, learning algorithms respond to detected payoff increasing opportunities systematically; however, the responses are too weak to improve the algorithms payoffs. Human play against various decision maker types doesn't significantly vary. These factors lead to a strong linear relationship between the humans and algorithms action choice proportions that is suggestive of the algorithm's best response correspondence.
|Date of creation:||Feb 2005|
|Date of revision:|
|Contact details of provider:|| Postal: |
Phone: (404) 651-3990
Fax: (404) 651-3996
Web page: http://excen.gsu.edu/
More information through EDIRC
When requesting a correction, please mention this item's handle: RePEc:exc:wpaper:2006-16. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (J. Todd Swarthout)
If references are entirely missing, you can add them using this form.