作者(英文):Xain-Dong Chiu
論文名稱(英文):Use Deep Reinforcement learning to enhance the power of a Minishogi program
指導教授(英文):Shi-Jim Yen
口試委員(英文):Wen-Cheng Lin
Min-Xiou Chen
關鍵詞(英文):AlphaZeroArtificial IntelligenceComputer GamesMiniShogiMonte-Carlo Tree SearchMachine Learning
由於硬體的進步,越來越多的演算法得以實踐,並且獲得豐碩的成果,例如Google DeepMind所開發的AlphaGO是第一個稱霸圍棋界的電腦圍棋程式。然而AlphaGo應用了非常巨量的人類知識來輔助學習,如:職業棋士對局的棋譜、人類標示的棋盤資訊…等等。 Google DeepMind提出了不需要人類知識,僅依靠MCTS方法訓練的Alpha Zero,可惜的是並沒有開源,故本論文將依Google DeepMind提出論文揭露之訊息嘗試實作Alpha Zero方法於迷你將棋遊戲。
Due to the advancement of hardware, more and more algorithms have been implemented and have achieved fruitful results. For example, AlphaGO developed by Google DeepMind is the first computer Go program to dominate the world of chess. However, AlphaGo uses a very large amount of human knowledge to assist in learning, such as: the chess game of professional chess players, the chessboard information marked by humans, and so on. Google DeepMind proposes Alpha Zero, which does not require human knowledge and only relies on the MCTS method. Unfortunately, there is no open source. Therefore, this paper will try to implement the Alpha Zero method in the mini chess game according to the information disclosed by Google DeepMind.
