|
[1]J. Schmidhuber, “Deep Learning in Neural Networks: An Neural Networks,” Neu-ral Networks, vol 61, pp. 85-117, 2015. [2]Kaelbling. et al., “Reinforcement learning: A survey,” Journal of AI Research, 4 pp. 237-285, 1996. [3]Mnih,V. et al., “Playing Atari with deep reinforcement learning,” NIPS Deep Learn-ing Workshop, 2013. [4]Timothy P. Lillicrap et al., “Continuous Control with Deep Reinforcement Learning,” International Conference on Learning Representations (ICLR), 2016. [5]Sangduck Lee and Woonchul Ham, "Self stabilizing strategy in tracking control of unmanned electric bicycle with mass balance," IEEE/RSJ International Conference on Intelligent Robots and Systems, 2002, pp. 2200-2205 vol.3. [6]M. Yamakita, A. Utano and K. Sekiguchi, "Experimental Study of Automatic Con-trol of Bicycle with Balancer," 2006 IEEE/RSJ International Conference on Intelli-gent Robots and Systems, 2006, pp. 5606-5611. [7]A. Suebsomran, "Balancing control of bicycle robot," 2012 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER), 2012, pp. 69-73. [8]L. P. Tuyen and T. Chung, "Controlling bicycle using deep deterministic policy gra-dient algorithm," 2017 14th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI), 2017, pp. 413-417. [9]G. Belascuen and N. Aguilar, "Design, Modeling and Control of a Reaction Wheel Balanced Inverted Pendulum," 2018 IEEE Biennial Congress of Argentina (AR-GENCON), 2018, pp. 1-9. [10]K Jain, "Artificial neural networks: a tutorial," Computer, vol. 29, No. 3, pp. 31-44, 1996. [11]Watkins, C.J.C.H. (1989).Learning from delayed rewards. PhD Thesis, University of Cambridge, England. [12]Alzubaidi, L., Zhang, J., Humaidi, A.J. et al. “Review of deep learning: concepts, CNN architectures, challenges, applications, future directions.” J Big Data 8, 53 2021. [13]S. Albawi, T. A. Mohammed and S. Al-Zawi, "Understanding of a convolutional neural network," 2017 International Conference on Engineering and Technology (ICET), 2017, pp. 1-6. [14]Mnih,V. et al., “Human-level control through deep reinforcement learning,” Nature, vol 518, pp. 529–533, 2015. [15]Vijay R. Konda and John N. Tsitsiklis, “Actor-Critic Algorithms,” Advances in Neural Information Processing Systems 12, pp. 1008-1014, 1999. [16]Richard S. Sutton et al., “Policy Gradient Methods for Reinforcement Learning with Function Approximation,” Advances in Neural Information Processing Systems 12, pp. 1008-1014, 1999. [17]Unity Technologies, “Unity,” https://unity.com/
|