Gaussian Distribution & Gaussian Network Models

Notes on Gaussian distribution & Gaussian network models. ...

November 22, 2021 · 15 min · Trung H. Nguyen

Power Series

Recall that in the previous note, Infinite Series of Constants, we mentioned a type of series called power series a lot. In the content of this note, we will be diving deeper into details of its. ...

September 21, 2021 · 15 min · Trung H. Nguyen

Infinite Series of Constants

Notes on infinite series of constants. ...

September 6, 2021 · 20 min · Trung H. Nguyen

Monte Carlo Methods in Reinforcement Learning

Recall that when using Dynamic Programming algorithms to solve RL problems, we made an assumption about the complete knowledge of the environment. With Monte Carlo methods, we only require experience - sample sequences of states, actions, and rewards from simulated or real interaction with an environment. ...

August 21, 2021 · 20 min · Trung H. Nguyen

Solving MDPs with Dynamic Programming

In two previous notes, MDPs and Bellman equations and Optimal Policy Existence, we have known how MDPs, Bellman equations were defined and how they worked. In this note, we are going to find the solution for the MDP framework with Dynamic Programming. ...

July 25, 2021 · 9 min · Trung H. Nguyen

Optimal Policy Existence

In the previous note about Markov Decision Processes, Bellman equations, we mentioned that there exists a policy $\pi_*$ that is better than or equal to all other policies. In this note, we will be proving that. ...

July 10, 2021 · 7 min · Trung H. Nguyen

Measures

When talking about measure, you might associate it with the idea of length, the measurement of something in one dimension. And then probably, you will extend your idea into two dimensions with area, or even three dimensions with volume. ...

July 3, 2021 · 9 min · Trung H. Nguyen