dctian - Overview
SimpleAlphaZero SimpleAlphaZero Public
A simplified implementation of DeepMind's AlphaZero (with Monte Carlo Tree Search, etc), which can be used to train agents to play simple board games, such as Connect4, Tic-Tac-Toe, and Five in a R…