Academy - MLQ.ai (Page 2)

Fundamentals of Reinforcement Learning: Estimating the Action-Value Function

In this article, we introduce fundamental concepts of reinforcement learning—including the k-armed bandit problem, estimating the action-value function, and the exploration vs. exploitation dilemma.

By Peter Foy

• 5 years ago

Reinforcement Learning

Implementing Deep Reinforcement Learning with PyTorch: Deep Q-Learning

In this article we will look at several implementations of deep reinforcement learning with PyTorch.

By Peter Foy

• 5 years ago

Reinforcement Learning

Deep Reinforcement Learning: Twin Delayed DDPG Algorithm

In this article we review a deep reinforcement learning algorithm called the Twin Delayed DDPG model, which can be applied to continuous action spaces.

By Peter Foy

• 6 years ago

Fundamentals of Reinforcement Learning: Estimating the Action-Value Function

Implementing Deep Reinforcement Learning with PyTorch: Deep Q-Learning

Deep Reinforcement Learning: Twin Delayed DDPG Algorithm

Subscribe to our newsletter