mcts
AlphaGo, AlphaGo Zero, AlphaZero
Model-based RL methods that use Monte Carlo Tree Search for planning and ultilize self-play mechanism for training. ...
Model-based RL methods that use Monte Carlo Tree Search for planning and ultilize self-play mechanism for training. ...