Reinforcement learning for long-run average cost
No abstract is available for this item.
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Cassandras, Christos G. & Han, Youngnam, 1992. "Optimal inspection policies for a manufacturing station," European Journal of Operational Research, Elsevier, vol. 63(1), pages 35-53, November.
- Shioyama, Tadayoshi, 1991. "Optimal control of a queuing network system with two types of customers," European Journal of Operational Research, Elsevier, vol. 52(3), pages 367-372, June.
- Tapas K. Das & Abhijit Gosavi & Sridhar Mahadevan & Nicholas Marchalleck, 1999. "Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning," Management Science, INFORMS, vol. 45(4), pages 560-574, April.
When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:155:y:2004:i:3:p:654-674. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Dana Niculescu)
If references are entirely missing, you can add them using this form.