Read-through: Probabilistic Graphical Models - Inference

February 2, 2023 · 27 min · Trung H. Nguyen

Categorical Reparameterization with Gumbel-Softmax & Concrete Distribution

Notes on using Gumbel-Softmax & Concrete Distribution in Categorical sampling ...

January 2, 2023 · 9 min · Trung H. Nguyen

Maximum Entropy Reinforcement Learning via Soft Q-learning & Soft Actor-Critic

Notes on Entropy-Regularized Reinforcement Learning via SQL & SAC ...

December 27, 2022 · 11 min · Trung H. Nguyen

Read-through: Probabilistic Graphical Models - Representation

December 10, 2022 · 44 min · Trung H. Nguyen

Deterministic Policy Gradients

Notes on Deterministic Policy Gradient algorithms ...

December 2, 2022 · 12 min · Trung H. Nguyen

Trust Region Policy Optimization

Notes on policy optimization using trust region method. ...

November 23, 2022 · 12 min · Trung H. Nguyen

Deep Q-learning

Notes on DQN and its variants. ...

November 18, 2022 · 8 min · Trung H. Nguyen