Trust Region Policy Optimization

A model-free RL algorithm that ensures stable and efficient policy updates by optimizing within a trust region, limiting the step size to prevent drastic policy changes and improve convergence. ...

November 23, 2022 · 12 min · Trung H. Nguyen

Deep Q-learning

Notes on DQN and its variants. ...

November 18, 2022 · 8 min · Trung H. Nguyen

Natural Evolution Strategies

Natural Evolution Strategies, or NES, are referred to a family of evolution strategies that throughout its generations update a search distribution repeatedly using an estimated gradient of its distribution parameters. ...

October 7, 2022 · 10 min · Trung H. Nguyen

Policy Gradient

Notes on Policy gradient methods. ...

October 6, 2022 · 4 min · Trung H. Nguyen

CMA Evolution Strategy

Covariance Matrix Adaptation Evolution Strategy (CMA-ES) is an evolutionary algorithm for complex non-linear non-convex blackbox optimization problems in continuous domain. ...

September 14, 2022 · 8 min · Trung H. Nguyen

Read-through: Measure theory - the Lebesgue integral

(WIP) Note III of the measure theory series. Materials are mostly taken from Tao’s book, except for some needed notations extracted from Stein’s book. ...

August 21, 2022 · 10 min · Trung H. Nguyen

Linear Models

Notes on using linear models in regression and classification. ...

August 13, 2022 · 32 min · Trung H. Nguyen