Posts
Read-through: Probabilistic Graphical Models - Inference
Categorical Reparameterization with Gumbel-Softmax & Concrete Distribution
Notes on using Gumbel-Softmax & Concrete Distribution in Categorical sampling ...
Maximum Entropy Reinforcement Learning via Soft Q-learning & Soft Actor-Critic
Notes on Entropy-Regularized Reinforcement Learning via SQL & SAC ...
Read-through: Probabilistic Graphical Models - Representation
Deterministic Policy Gradients
Notes on Deterministic Policy Gradient algorithms ...
Trust Region Policy Optimization
Notes on policy optimization using trust region method. ...