|
[1] Barto, A.G.,R.S. Sutton, C.W. Anderson(1983).“Neuronlike elements that can solve difficult learning control problem.”IEEE Trans. Syst. Man, Cybern. ,13,835-846. [2] Barto, A.G.,P. Anandan(1985).“Pattern recognition stochastic learning automata.”IEEE Trans. Syst. Man, Cybern., 15, 360-374. [3] Haykin, S.(1994). Neural networks: A comprehensive foundation. NJ: Prentice-Hall. [4] Miller, W.T., R.S. Sutton, P.J. Werbos ed.(1990)Neural networks for control. The MIT press. [5] Narendra, W.T., M.A.L. Thathachar(1989). Learning automata : an introduction. Englewood Cliffs, NJ: Prentice-Hell. [6] Narendra, K.S.(1974). “Learning automata - A survey.”IEEE Trans. Syst. Man, Cybern., vol. SMC-4, no. 4. [7] Narendra, K.S., S. Lakshmivarahan(1977).“Learning automata — A critique.”Journal of Cybernetics, and Information Science, 1, 53-56. [8] Patterson, D.W.(1995). Artificial neural networks: Theory and applications. Singapore: Prentice-Hall. [9] Pavol, I.P.(1927). Conditioned reflexes. London: Oxford Univ. Press. [10] Rumelhar, D.E., G.E. Hinton, R.J. Williams(1986).“Learning representations by back-propagation error.”Nature, 323, 533-536. [11] Robinson, A.J.,F. Fallside(1988).“Static and dynamic error propagation networks with application to speed coding.”Neural Information Processing System(Denver 1987), ed. D.Z. Anderson, 632-641. New York: American Institute of Physics. [12] Ross, S(1983)Introduction to stochastic dynamic programming. San Diego academic press. [13] Samuel, A.L.(1959).“Some studies in machine learning using the game of checkers.”IBM Journal on Research and Development, Vol.3, pp.210-229. [14] Sutton, R.S.(1988).“Learning to predict by the methods of temporal difference.”Machine Learning,3: 9-44. [15] Sutton, R.S.(1992).“Introduction: The challenge of reinforcement learning.”Machine Learning, 8: 225-227. [16] Watkins, C.J.C.H.(1992).“Q-learning.”Machine Learning,8: 279-292. [17] Werbos, P.J.(1988). “Generalization of back propagation with applications to a recurrent gas market model.”Neural Networks,vol.1,339-356. [18] Chande, T.S. (民85). 最新技術分析指標. 台北: 寰宇出版公司.
|