|
Leslie Kaebling, Michael L. Littman, and Andrew W. Moore. Reinforcement learning:A survey. Journal of Artificial Intelligence Research, 4:237-285, May 1996. [2]Richard S. Sutton and Andrew G. Barto. Reinforcement Learning:An Introduction. MIT Press/Bradford Books, March 1998. [3]C. J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, Psychology Departments, Cambridge University, 1989. [4]C. J. C. H. Watkins and P. Dayan. Q-learning. Machine Learning, 8:279-292, 1992. [5]Takahashi, Y., Takeda, M., and Asada, M. (1999). “Continuous Valued Q-learning for Vision-Guided Behavior Acquistion”, Proceedings of 1999 IEEE/SICE/RSJ International Conference on Multisensor Fusion and Intergration for Intelligent Systems, pp.255-260, 1999. [6]V. Gullapali, “A stochastic reinforcement learning algorithm for learning real-valued functions,” Neural Net., Vol. 3, pp. 671-692, 1990. [7]V. Gullapalli, “Associative reinforcement learning of real-valued functions,” Proc. IEEE, Syst., Man, Cybern., Charlottesville, VA, Oct. 1991. [8]A.G. Barto, R. S. Sutton, and C. W. Anderson, “Neuronlike adaptive elements that can solve difficult learning control problems,” IEEE Trans. Syst., Man, Cybern., Vol. SMC-13, pp. 834-846.1983 [9]Athanasios V. Vasilakos , Nikolaos H. Loukas , Konstantinos C. Zikidis, “A.N.A.S.A.II:A NOVEL, REAL-VALUED, REINFORCEMENT ALGORITHM FOR NEURAL UNIT / NETWORK.” Proceedings of 1993 International Joint Conference on Neural Networks. [10]Aly El-Osery and Mo Jamshidi, “A Stochastic Learning Automaton Based Autonomous Control Robotic Agents.” Autonomous Control Engineering Center (ACE), University of New Mexico. [11]Krzysztof Patan and Thomas Parisini, “Stochastic learning methods for dynamic neural networks:simulated and real-data comparisons.” Proceedings of the American Control Conference Anchorage, AK May 8-10, 2002. [12]Masayuki Yamamura, Takashi Onozuka, “Reinforcement Learning with Knowledge by using a Stochastic Gradient Method on a Bayesian Network.” 0-7803-4859-1/98, 1998 IEEE. [13]V. Paraskevopoulos, M.I.Heywood, C.R.Chatwin, “Modular SRV Reinforcement Learning:An architecture for non-linear control.” 0-7803-4859-1/98 1998 IEEE. [14]Sadayoshi MIKAMI, and Yukinori KAKAZU, “Extended Stochastic Reinforcement Learning for the Acquisition of Cooperative Motion Plans for Dynamically Constrained Agents.” Manuscript received July 15, 1993. [15]J.-S. R. Jang, C.-T. Sun, E. Mizutzni, “Neural-Fuzzy AND Soft Computing” Prentice Hall Upper Saddle River, NJ 07458, 1997. [16]Georgios I. Papadimitriou, “A New Approach to the Design of Reinforcement Schemes for Learning Automata: Stochastic Estimator Learning Algorithms.” IEEE transaction on knowledge and data engineering, Vol. 6, No. 4, August 1994. [17]R.A. Leaver and P. Mars, “STOCHASTIC COMPUTING AND REINFORCEMENT NEURAL NETWORKS.” British Aerospace plc, U.K., University of Durham, U.K.. [18] Tapas K. Das, Abhijit Gosavi, Sridhar Mahadevan, Nicholas Marchalleck, “Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning.” Management Science/Vol. 45, No. 4, April 1999 [19]Nils J. Nilsson, “Introduction to Machine Learning” Artificial Intelligence Laboratory, Department of Computer Science, Stanford University Stanford, CA 94305.
|