This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Catching a Baseball: A Reinforcement Learning Perspective Using a Neural Network

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Rajarshi Das
Abstract

Moments after a baseball batter has hit a fly ball, an outfielder has to decide whether to run forward or backward to catch the ball. Judging a fly ball is a difficult task, especially when the fielder is in the plane of the ball's trajectory. There exists several alternative hypotheses in the literature which identify different perceptual features available to the fielder that may provide useful cues as to the location of the ball's landing point. A recent study in experimental psychology suggests that to intercept the ball, the fielder has to run such that the double derivative of $\tan\phi$ with respect to time is close to zero $d^2(\tan\phi)/dt^2\approx o$. Where $\phi$ is the elevation angle of the ball from the fielder's perspective (MCLeod \& Dlenes 1993). We investigate whether $d^2(\tan\phi)/dt^2$ information is a useful cue to learn this task in the Adaptive Heuristic Critic (${\cal AHC}$) reinforcement learning framework. Our results provide supporting evidence that $d^2(\tan\phi)/dt^2$ information furnishes strong initial cue in determinimg the landing point of the ball and plays a key role in the learning process. However, our simulations show that during later stages of the ball's flight, yet another perceptual feature, the perpendicular velocity of the ball $(v_p)$ with respect to the fielder, provides stronger cues as to the location of the landing point. The trained network generalized to novel circumstances and also exhibited some of the characteristic behavior that has been recorded by experimental psychologists among experienced fielders. We believe learning approaches to learn common physical tasks, and similarly motivated work could stimulate useful interdisciplinary research on the subject.

Download Info
To our knowledge, this item is not available for download. To find whether it is available, there are three options:
1. Check below under "Related research" whether another version of this item is available online.
2. Check on the provider's web page whether it is in fact available.
3. Perform a search for a similarly titled item that would be available.

Publisher Info
Paper provided by Santa Fe Institute in its series Working Papers with number 94-04-022.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: Apr 1994
Date of revision:
Handle: RePEc:wop:safiwp:94-04-022

Contact details of provider:
Postal: 1399 Hyde Park Road, Santa Fe, New Mexico 87501
Web page: http://www.santafe.edu/sfi/publications/working-papers.html
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Thomas Krichel).

Related research
Keywords:

Statistics
Access and download statistics

Did you know? You too can volunteer for RePEc, for example by encouraging others to register as authors.

This page was last updated on 2009-12-16.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.