|
[1] D. Silver et al. (2017). Mastering the game of go without human knowledge. Nature, 550:354– 359. [2] Silver, D., Hubert, T., Schrittwieser, J., Antonoglou, I., Lai, M., Guez, A., ... & Lillicrap, T. (2017). Mastering chess and shogi by self-play with a general reinforcement learning algorithm. arXiv preprint arXiv:1712.01815. [3] Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, et al. (2019). Mastering atari, go, chess and shogi by planning with a learned model. arXiv preprint arXiv:1911.08265. [4] Y. LeCun, L. Bottou, Y. Bengio, P. Haffner et al. (1998). Gradient-based learning. applied to document recognition. Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324. [5] D. E. Knuth and R. W. Moore (1975). An analysis of alpha-beta pruning. Artificial. Intelligence, vol. 6, no. 4, pp. 293-326. [6] L. Kocsis, and C. Szepesvari (2006). Bandit based monte-carlo planning. In 15th. European Conference on Machine Learning, pages 282-293. [7] Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., Van Den Driessche, G., ... & Dieleman, S. (2016). Mastering the game of Go with deep neural networks and tree search. Nature, 529(7587), 484. [8] Coulom, R (2006). Efficient selectivity and backup operators in Monte-Carlo tree search. In 5th International Conference on Computers and Games, 72–83. [9] Y. Tian, J. Ma, Q. Gong, S. Sengupta, Z. Chen, J. Pinkerton and C. L. Zitnick (2019). ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero. CoRR, vol. abs/1902.04522. [10] A. Zobrist (1970). A new hashing method with application for game playing. Technical Report 88, Univ. of Wisconsin. [11] Cracraft, S.M (1984). Bitmap Move Generation in Chess. ICCA Journal, Vol. 7, No. 3, pp. 146- 153. ISSN 0920-234X. [12] Reversi, WIKIPEDIA, https://en.wikipedia.org/wiki/Reversi [13] Campbell, M., Hoane, A., & Hsu, F. (2002). Deep Blue. Artificial Intelligence, 134(1-2), 57-83. doi:10.1016/s0004-3702(01)00129-1 [14] Guillaume, M. J. B. C., Mark, H. M. W., Herik, H. J. v. d., Jos, W. H. M. U., & Bruno,B. (2008). Progressive Strategies for Monte-Carlo Tree Search. New Mathematics and Natural Computation, 4(3). [15] Gelly, S., Wang, Y., Munos, R., Teytaud, O. (2006). Modification of UCT with Patterns in Monte-Carlo Go. Technical Report 6062, INRIA. [16] https://github.com/pytorch [17] M. Buro (1997). The Othello Match of the Year: Takeshi Murakami vs. Logistello. ICCA. Journal, vol. 20, no. 3, pp. 189-193. |