Welcome to my place! 👨‍💻

This is where I document something I’ve learned.

Graph generation with predefined chromatic number

May 19, 2024 · 2 min · Trung H. Nguyen

Temporal consistency loss & Ape-X DQfD

An algorithm consists of three components: the transformed Bellman operator, the temporal consistency (TC) loss and the combination of Ape-X DQN and DQfD to learn a more consistent human-level policy. ...

March 12, 2024 · 4 min · Trung H. Nguyen

MuZero

January 2, 2024 · 5 min · Trung H. Nguyen

AlphaZero

October 17, 2023 · 11 min · Trung H. Nguyen

Multi-agent Deep Deterministic Policy Gradient

May 25, 2023 · 5 min · Trung H. Nguyen

GAN

Notes on Generative Adversarial Networks. ...

May 1, 2023 · 9 min · Trung H. Nguyen

Read-through: Probabilistic Graphical Models - Learning

February 19, 2023 · 16 min · Trung H. Nguyen