2024  3

May  1

Graph generation with predefined chromatic number

May 19, 2024 · 2 min · Trung H. Nguyen

March  1

Temporal consistency loss & Ape-X DQfD

March 12, 2024 · 4 min · Trung H. Nguyen

January  1

MuZero

January 2, 2024 · 5 min · Trung H. Nguyen

2023  6

October  1

AlphaZero

October 17, 2023 · 11 min · Trung H. Nguyen

May  2

Multi-agent Deep Deterministic Policy Gradient

May 25, 2023 · 5 min · Trung H. Nguyen

GAN

May 1, 2023 · 9 min · Trung H. Nguyen

February  2

Read-through: Probabilistic Graphical Models - Learning

February 19, 2023 · 16 min · Trung H. Nguyen

Read-through: Probabilistic Graphical Models - Inference

February 2, 2023 · 27 min · Trung H. Nguyen

January  1

Categorical Reparameterization with Gumbel-Softmax & Concrete Distribution

January 2, 2023 · 9 min · Trung H. Nguyen

2022  19

December  3

Maximum Entropy Reinforcement Learning via Soft Q-learning & Soft Actor-Critic

December 27, 2022 · 11 min · Trung H. Nguyen

Read-through: Probabilistic Graphical Models - Representation

December 10, 2022 · 44 min · Trung H. Nguyen

Deterministic Policy Gradients

December 2, 2022 · 12 min · Trung H. Nguyen

November  2

Trust Region Policy Optimization

November 23, 2022 · 12 min · Trung H. Nguyen

Deep Q-learning

November 18, 2022 · 8 min · Trung H. Nguyen

October  2

Natural Evolution Strategies

October 7, 2022 · 10 min · Trung H. Nguyen

Policy Gradient

October 6, 2022 · 4 min · Trung H. Nguyen

September  1

CMA Evolution Strategy

September 14, 2022 · 8 min · Trung H. Nguyen

August  2

Read-through: Measure theory - the Lebesgue integral

August 21, 2022 · 10 min · Trung H. Nguyen

Linear Models

August 13, 2022 · 32 min · Trung H. Nguyen

July  1

Read-through: Measure theory - Lebesgue measure

July 3, 2022 · 22 min · Trung H. Nguyen

June  1

Read-through: Measure theory - Elementary measure, Jordan measure & the Riemann integral

June 16, 2022 · 29 min · Trung H. Nguyen

May  3

Likelihood Ratio Policy Gradient via Importance Sampling

May 25, 2022 · 5 min · Trung H. Nguyen

Planning & Learning

May 19, 2022 · 7 min · Trung H. Nguyen

Policy Gradient Theorem

May 4, 2022 · 8 min · Trung H. Nguyen

April  1

The Exponential Family, Generalized Linear Models

April 4, 2022 · 14 min · Trung H. Nguyen

March  1

Eligible Traces

March 13, 2022 · 25 min · Trung H. Nguyen

February  1

Function Approximation

February 11, 2022 · 21 min · Trung H. Nguyen

January  1

Temporal-Difference Learning

January 31, 2022 · 21 min · Trung H. Nguyen

2021  10

November  1

Gaussian Distribution & Gaussian Network Models

November 22, 2021 · 15 min · Trung H. Nguyen

September  2

Power Series

September 21, 2021 · 15 min · Trung H. Nguyen

Infinite Series of Constants

September 6, 2021 · 20 min · Trung H. Nguyen

August  1

Monte Carlo Methods in Reinforcement Learning

August 21, 2021 · 20 min · Trung H. Nguyen

July  3

Solving MDPs with Dynamic Programming

July 25, 2021 · 9 min · Trung H. Nguyen

Optimal Policy Existence

July 10, 2021 · 7 min · Trung H. Nguyen

Measures

July 3, 2021 · 9 min · Trung H. Nguyen

June  3

Markov Decision Processes, Bellman equations

June 27, 2021 · 5 min · Trung H. Nguyen

Markov Chain

June 19, 2021 · 4 min · Trung H. Nguyen

My very first post

June 5, 2021 · 1 min · Trung H. Nguyen