Deep reinforcement learning became prominent because of the work of combining Q-learning with DL. The combination is known as deep Q-learning or DQN for Deep Q Network. This algorithm has powered some of the cutting edge examples of DRL, when Google DeepMind used it to make classic Atari games better than humans in 2012. There are many implementations of this algorithm, and Google has even patented it. The current consensus is that Google patented such a base algorithm in order to thwart patent trolls striking at little guys or developers building commercial applications with DQN. It is unlikely that Google would exercise this legally or that it would have to since this algorithm is no longer considered state of the art.

Hands-On Reinforcement Learning for Games
By :

Hands-On Reinforcement Learning for Games
By:
Overview of this book
With the increased presence of AI in the gaming industry, developers are challenged to create highly responsive and adaptive games by integrating artificial intelligence into their projects. This book is your guide to learning how various reinforcement learning techniques and algorithms play an important role in game development with Python.
Starting with the basics, this book will help you build a strong foundation in reinforcement learning for game development. Each chapter will assist you in implementing different reinforcement learning techniques, such as Markov decision processes (MDPs), Q-learning, actor-critic methods, SARSA, and deterministic policy gradient algorithms, to build logical self-learning agents. Learning these techniques will enhance your game development skills and add a variety of features to improve your game agent’s productivity. As you advance, you’ll understand how deep reinforcement learning (DRL) techniques can be used to devise strategies to help agents learn from their actions and build engaging games.
By the end of this book, you’ll be ready to apply reinforcement learning techniques to build a variety of projects and contribute to open source applications.
Table of Contents (19 chapters)
Preface
Section 1: Exploring the Environment
Understanding Rewards-Based Learning
Dynamic Programming and the Bellman Equation
Monte Carlo Methods
Temporal Difference Learning
Exploring SARSA
Section 2: Exploiting the Knowledge
Going Deep with DQN
Going Deeper with DDQN
Policy Gradient Methods
Optimizing for Continuous Control
All about Rainbow DQN
Exploiting ML-Agents
DRL Frameworks
Section 3: Reward Yourself
3D Worlds
From DRL to AGI
Other Books You May Enjoy
How would like to rate this book
Customer Reviews