Categorical Reparameterization with Gumbel-Softmax & Concrete Distribution
Notes on using Gumbel-Softmax & Concrete Distribution in Categorical sampling ...
Notes on using Gumbel-Softmax & Concrete Distribution in Categorical sampling ...
Notes on Entropy-Regularized Reinforcement Learning via SQL & SAC ...
Notes on Deterministic Policy Gradient algorithms ...
Notes on policy optimization using trust region method. ...
Notes on DQN and its variants. ...
Natural Evolution Strategies, or NES, are referred to a family of evolution strategies that throughout its generations update a search distribution repeatedly using an estimated gradient of its distribution parameters. ...