Evaluating generalizability and parameter consistency in learning models
A new evaluation method is proposed for comparing learning models used for predicting decisions based on experience. The method is based on the generalization of models' predictions at the individual level. First, it evaluates the ability to make a priori predictions for decisions in new tasks using parameters from different tasks performed by an individual decision-maker. Second, it evaluates the consistency of parameters estimated in different tasks performed by the same person. We use this method to examine two rules for updating past experience with payoff feedback: The Delta rule, where only the chosen option is updated; and a Decay-Reinforcement rule, where additionally, non-chosen options are discounted. The results reveal that although the Decay-Reinforcement rule fits the data better, it has poor generality and parameter consistency at the individual level. The current method thus improves the ability to select models based on their correspondence to consistent characteristics within individual decision-makers.
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Kahneman, Daniel & Tversky, Amos, 1979.
"Prospect Theory: An Analysis of Decision under Risk,"
Econometric Society, vol. 47(2), pages 263-91, March.
- Amos Tversky & Daniel Kahneman, 1979. "Prospect Theory: An Analysis of Decision under Risk," Levine's Working Paper Archive 7656, David K. Levine.
- Sarin, Rajiv & Vahid, Farshid, 1999. "Payoff Assessments without Probabilities: A Simple Dynamic Model of Choice," Games and Economic Behavior, Elsevier, vol. 28(2), pages 294-309, August.
- Stahl, Dale O., 1996. "Boundedly Rational Rule Learning in a Guessing Game," Games and Economic Behavior, Elsevier, vol. 16(2), pages 303-330, October.
- Sarin, Rajiv & Vahid, Farshid, 2001.
"Predicting How People Play Games: A Simple Dynamic Model of Choice,"
Games and Economic Behavior,
Elsevier, vol. 34(1), pages 104-122, January.
- Sarin, R. & Vahid, F., 1999. "Predicting how People Play Games: a Simple Dynamic Model of Choice," Monash Econometrics and Business Statistics Working Papers 12/99, Monash University, Department of Econometrics and Business Statistics.
- Richard H. Thaler & Amos Tversky & Daniel Kahneman & Alan Schwartz, 1997. "The Effect of Myopia and Loss Aversion on Risk Taking: An Experimental Test," The Quarterly Journal of Economics, Oxford University Press, vol. 112(2), pages 647-661.
- Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
- Drew Fudenberg & David K. Levine, 1996.
"Consistency and Cautious Fictitious Play,"
Levine's Working Paper Archive
470, David K. Levine.
- Erev, Ido & Bereby-Meyer, Yoella & Roth, Alvin E., 1999. "The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models," Journal of Economic Behavior & Organization, Elsevier, vol. 39(1), pages 111-128, May.
- Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-81, September.
- Cheung, Yin-Wong & Friedman, Daniel, 1997. "Individual Learning in Normal Form Games: Some Laboratory Results," Games and Economic Behavior, Elsevier, vol. 19(1), pages 46-76, April.
When requesting a correction, please mention this item's handle: RePEc:eee:gamebe:v:63:y:2008:i:1:p:370-394. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Shamier, Wendy)
If references are entirely missing, you can add them using this form.