Learn, Implement, Master

Welcome to my blog focused on reinforcement learning with JAX! This resource is designed to help researchers and practitioners understand both the theoretical foundations of reinforcement learning and their efficient implementation using Google's JAX library.

Throughout this blog, you'll find tutorials, code examples, and in-depth explanations of key RL concepts—from basics like Markov Decision Processes and Q-learning to advanced topics such as policy gradient methods and model-based RL. Each article emphasizes practical implementation techniques that leverage JAX's automatic differentiation, vectorization, and GPU/TPU acceleration.

15-week comprehensive curriculum
Video tutorials and code implementations
Practical exercises and projects

Deep Reinforcement Learning

Reinforcement Learning Basics with JAX

Mastering RL algorithms through theory and efficient implementation with Google's high-performance numerical computing library

JAX • PyTree • Neural Networks Q-Learning • Policy Gradients

Ji Sue Lee Hanyang University

Explore Content Watch Tutorial

Key Features

Take advantage of these powerful features to accelerate your learning and implementation

Performance

JAX Acceleration

Learn how to leverage JAX's just-in-time compilation and automatic differentiation to speed up RL algorithm implementation and training.

Learn more

Learning

Interactive Notebooks

Access complete Jupyter notebooks with step-by-step implementations of RL algorithms, complete with detailed explanations and visualizations.

Explore notebooks

Implementation

Complete Code Repository

Access a GitHub repository with clean, modular implementations of all the algorithms discussed in the blog, designed for research and practical applications.

View repository

Learn, Implement, Master

15-Week Course Schedule

Introduction to RL

Markov Decision Processes

Dynamic Programming

Monte Carlo Methods

Temporal Difference Learning

n-step Bootstrapping

Planning and Learning

Function Approximation

Deep Q-Networks

Policy Gradient Methods

Trust Region Methods

Model-Based RL

Exploration vs. Exploitation