Markov Decision Processes, Bellman equations

You may have known or heard vaguely about a computer program called AlphaGo - the AI has beaten Lee Sedol - the winner of 18 world Go titles. One of the techniques it used is called self-play against its other instances, with Reinforcement Learning. ...

June 27, 2021 · 5 min · Trung H. Nguyen

Markov Chain

If we have to describe the definition of Markov chain in one statement, it will be: “It only matters where you are, not where you’ve been”. ...

June 19, 2021 · 4 min · Trung H. Nguyen

My very first post

Enjoy my index-zero-ed note while staying tuned for next ones! ...

June 5, 2021 · 1 min · Trung H. Nguyen