Learn Without Counterfactuals
In this paper we study learning procedures when counterfactuals (payo s of not-chosen actions) are not observed. The decision maker reasons in two steps: First, she updates her propensities for each action after every payo experience, where propensity is de ned as how much she prefers each action. Then, she transforms these propensities into choice probabilities. We introduce natural axioms in the way propensities are updated and the way propensities are translated into choice, and study the decision marker's behavior when such axioms are in place.
|Date of creation:||Feb 2010|
|Date of revision:|
|Contact details of provider:|| Postal: Department of Economics University of Leicester, University Road. Leicester. LE1 7RH. UK|
Phone: +44 (0)116 252 2887
Fax: +44 (0)116 252 2908
Web page: http://www2.le.ac.uk/departments/economics
More information through EDIRC
|Order Information:|| Web: http://www2.le.ac.uk/departments/economics/research/discussion-papers Email: |
When requesting a correction, please mention this item's handle: RePEc:lec:leecon:10/15. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Mrs. Alexandra Mazzuoccolo)
If references are entirely missing, you can add them using this form.