Reinforcement learning for long-run average cost
No abstract is available for this item.
If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Cassandras, Christos G. & Han, Youngnam, 1992. "Optimal inspection policies for a manufacturing station," European Journal of Operational Research, Elsevier, vol. 63(1), pages 35-53, November.
- Tapas K. Das & Abhijit Gosavi & Sridhar Mahadevan & Nicholas Marchalleck, 1999. "Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning," Management Science, INFORMS, vol. 45(4), pages 560-574, April.
- Shioyama, Tadayoshi, 1991. "Optimal control of a queuing network system with two types of customers," European Journal of Operational Research, Elsevier, vol. 52(3), pages 367-372, June.
When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:155:y:2004:i:3:p:654-674. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Dana Niculescu)
If references are entirely missing, you can add them using this form.