Evaluating generalizability and parameter consistency in learning models
A new evaluation method is proposed for comparing learning models used for predicting decisions based on experience. The method is based on the generalization of models' predictions at the individual level. First, it evaluates the ability to make a priori predictions for decisions in new tasks using parameters from different tasks performed by an individual decision-maker. Second, it evaluates the consistency of parameters estimated in different tasks performed by the same person. We use this method to examine two rules for updating past experience with payoff feedback: The Delta rule, where only the chosen option is updated; and a Decay-Reinforcement rule, where additionally, non-chosen options are discounted. The results reveal that although the Decay-Reinforcement rule fits the data better, it has poor generality and parameter consistency at the individual level. The current method thus improves the ability to select models based on their correspondence to consistent characteristics within individual decision-makers.
If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Sarin, R. & Vahid, F., 1999.
"Predicting how People Play Games: a Simple Dynamic Model of Choice,"
Monash Econometrics and Business Statistics Working Papers
12/99, Monash University, Department of Econometrics and Business Statistics.
- Sarin, Rajiv & Vahid, Farshid, 2001. "Predicting How People Play Games: A Simple Dynamic Model of Choice," Games and Economic Behavior, Elsevier, vol. 34(1), pages 104-122, January.
- Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
- Richard H. Thaler & Amos Tversky & Daniel Kahneman & Alan Schwartz, 1997. "The Effect of Myopia and Loss Aversion on Risk Taking: An Experimental Test," The Quarterly Journal of Economics, Oxford University Press, vol. 112(2), pages 647-661.
- Fudenberg, Drew & Levine, David K., 1995.
"Consistency and cautious fictitious play,"
Journal of Economic Dynamics and Control,
Elsevier, vol. 19(5-7), pages 1065-1089.
- Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
- Stahl, Dale O., 1996. "Boundedly Rational Rule Learning in a Guessing Game," Games and Economic Behavior, Elsevier, vol. 16(2), pages 303-330, October.
- Amos Tversky & Daniel Kahneman, 1979.
"Prospect Theory: An Analysis of Decision under Risk,"
Levine's Working Paper Archive
7656, David K. Levine.
- Kahneman, Daniel & Tversky, Amos, 1979. "Prospect Theory: An Analysis of Decision under Risk," Econometrica, Econometric Society, vol. 47(2), pages 263-291, March.
- Cheung, Yin-Wong & Friedman, Daniel, 1997. "Individual Learning in Normal Form Games: Some Laboratory Results," Games and Economic Behavior, Elsevier, vol. 19(1), pages 46-76, April.
- Erev, Ido & Bereby-Meyer, Yoella & Roth, Alvin E., 1999. "The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models," Journal of Economic Behavior & Organization, Elsevier, vol. 39(1), pages 111-128, May.
- Sarin, Rajiv & Vahid, Farshid, 1999. "Payoff Assessments without Probabilities: A Simple Dynamic Model of Choice," Games and Economic Behavior, Elsevier, vol. 28(2), pages 294-309, August.
When requesting a correction, please mention this item's handle: RePEc:eee:gamebe:v:63:y:2008:i:1:p:370-394. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Dana Niculescu)
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.