Multi-agent Deep Deterministic Policy Gradient

May 25, 2023 · 5 min · Trung H. Nguyen

Maximum Entropy Reinforcement Learning via Soft Q-learning & Soft Actor-Critic

Notes on Entropy-Regularized Reinforcement Learning via SQL & SAC ...

December 27, 2022 · 11 min · Trung H. Nguyen

Deterministic Policy Gradients

The generalization of policy gradient theorems into deterministic case and corresponding policy gradient algorithms. ...

December 2, 2022 · 12 min · Trung H. Nguyen

Trust Region Policy Optimization

A model-free RL algorithm that ensures stable and efficient policy updates by optimizing within a trust region, limiting the step size to prevent drastic policy changes and improve convergence. ...

November 23, 2022 · 12 min · Trung H. Nguyen

Deep Q-learning

Notes on DQN and its variants. ...

November 18, 2022 · 8 min · Trung H. Nguyen