posts

Trust Region Policy Optimization

A model-free RL algorithm that ensures stable and efficient policy updates by optimizing within a trust region, limiting the step size to prevent drastic policy changes and improve convergence. ...

Deep Q-learning

Notes on DQN and its variants. ...

Natural Evolution Strategies

Natural Evolution Strategies, or NES, are referred to a family of evolution strategies that throughout its generations update a search distribution repeatedly using an estimated gradient of its distribution parameters. ...

Policy Gradient

Notes on Policy gradient methods. ...

CMA Evolution Strategy

Covariance Matrix Adaptation Evolution Strategy (CMA-ES) is an evolutionary algorithm for complex non-linear non-convex blackbox optimization problems in continuous domain. ...

Read-through: Measure theory - the Lebesgue integral

(WIP) Note III of the measure theory series. Materials are mostly taken from Tao’s book, except for some needed notations extracted from Stein’s book. ...

Linear Models

Notes on using linear models in regression and classification. ...