[1] 李柏儀. “使用三維全景重建法進行電腦視覺導航.” 淡江大學航空太空工程學系碩士班學位論文 (2016): 1-86.[2] Swati Mishra and Pankaj Bande. Maze solving algorithms for micro mouse. Signal Image Technology and Internet Based Systems, 2008. SITIS’08. IEEE International Conference on. IEEE, 2008.
[3] Manoj Sharma. Algorithms for Micro-mouse. Future Computer and Com- munication, 2009. ICFCC 2009. International Conference on. IEEE, 2009.
[4] Cai, Jianping et al. A micromouse maze sovling simulator. Future Com- puter and Communication (ICFCC), 2010 2nd International Conference on. Vol. 3. IEEE, 2010.
[5] Jitin Kumar Goyal and Kuldeep Singh Nagla. A new approach of path planning for mobile robots. Advances in Computing, Communications and Informatics (ICACCI), 2014 International Conference on. IEEE, 2014.
[6] Jianping Cai et al. An algorithm of micromouse maze solving. Computer and Information Technology (CIT), 2010 IEEE 10th International Confer- ence on. IEEE, 2010.
[7] Jie Zhan, Xianchun Li, and Jiawei He. The simulation research of search algorithm for computer mouse maze. Wireless Communications, Network-ing and Mobile Computing (WiCOM 2014), 10th International Conference on. IET, 2014.
[8] Richard Bellman. A Markovian decision process. Journal of Mathematics and Mechanics. 6. 1957.
[9] Richard Bellman. Dynamic Programming. Princeton university press, 1957.
[10] Chris Watkins and Peter Dayan. Q-learning. Machine learning 8.3-4 (1992): 279-292.
[11] Walter Pullen. Maze Classification. from http://www.astrolog.org/ labyrnth/algrithm.htm
[12] José Vidal. Fundamentals of multiagent systems: using netLogo models. system (2006).
[13] Stuart Russell and Peter Norvig. 人工智慧-現代方法(歐崇明、時文中、 陳龍譯)(台北:培生教育,2011)。
[14] Chris Watkins. Learning from delayed rewards. Diss. University of Cam- bridge, 1989.
[15] Tim Eden, Anthony Knittel and Raphael van Uffelen. Reinforcement Learning. from http://www.cse.unsw.edu.au/~cs9417ml/RL1/index.html
[16] David Poole and Mackworth Alan. Artificial Intelligence: foundations of computational agents Cambridge University Press, 2010.
[17] Mehmet Hacibeyoglu and Ahmet Arslan. Reinforcement learning accel- erated with artificial neural network for maze and search problems. 3rd International Conference on Human System Interaction. IEEE, 2010.
[18] Tom Schaul et al. PyBrain. Journal of Machine Learning Research 11.Feb (2010): 743-746.
[19] David Silver. Markov Decision Processes. ULC, 2015
[20] Python Software Foundation. Python Documentation. from https://docs. python.org/3/