|
[1] Game complexity, https://en.wikipedia.org/wiki/Game_complexity。 [2] Mark H.M. Winands, The Surakarta Bot Revealed, 2015, Games and AI Group, Department of Data Science and Knowledge Engineering, Maastricht University, Maastricht, The Netherlands. [3] David Silver*, Julian Schrittwieser*, Karen Simonyan*, Ioannis Antonoglou, Aja Huang, Arthur, Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy, Lillicrap, Fan Hui, Laurent Sifre, George van den Driessche, Thore Graepel, Demis Hassabis, Mastering the Game of Go without Human Knowledge, Nature, 2017, DeepMind. [4] Surag Nair, https://github.com/suragnair/alpha-zero-general, Stanford University. [5] Surag Nair, Shantanu Thakoor, Megha Jhunjhunwala, Learning to Play Othello Without Human Knowledge, 2017, Stanford University. [6] 詹傑淳,電腦圍棋打劫的最佳策略之研究 A Study of Optimal Strategies for Ko Fight of Computer Go, 2010。 [7] Yen-Chi Chen, Jia-Fong Yeh and Shun-Shii Lin, “Design and Implementation Aspects of a Surakarta Program,” ICGA Journal, vol.40, no.4, pp.438-449, March 25, 2019. [8] Chih-Yu Kao, Kuei-Ting Kuo, “應用N-Tuple網路之Surakarta棋人工智慧”, TCGA 2019. [9] Lessons from Implementing AlphaZero, Aditya Prasad, https://medium.com/oracledevs/lessons-from-implementing-alphazero-7e36e9054191. [10] David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, Timothy Lillicrap, Karen Simonyan, Demis Hassabis, Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm, Nature, 2017, DeepMind. [11] David Silver, UCL Course on RL, http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html. [12] D.P. Kingma and J. Ba. Adam: A method for stochastic optimization. The International Conference on Learning Representations (ICLR), 2015.
|