MuZero

January 2, 2024 · 6 min · Trung H. Nguyen

AlphaGo, AlphaGo Zero, AlphaZero

Model-based RL methods that use Monte Carlo Tree Search for planning and ultilize self-play mechanism for training. ...

October 17, 2023 · 11 min · Trung H. Nguyen