Sign In Start Free Trial

Book Overview & Buying
Table Of Contents
Feedback & Rating

Hands-On Deep Learning for Games

By : Micheal Lanham

3 (2)

Hands-On Deep Learning for Games

3 (2)

By: Micheal Lanham

Overview of this book

The number of applications of deep learning and neural networks has multiplied in the last couple of years. Neural nets has enabled significant breakthroughs in everything from computer vision, voice generation, voice recognition and self-driving cars. Game development is also a key area where these techniques are being applied. This book will give an in depth view of the potential of deep learning and neural networks in game development. We will take a look at the foundations of multi-layer perceptron’s to using convolutional and recurrent networks. In applications from GANs that create music or textures to self-driving cars and chatbots. Then we introduce deep reinforcement learning through the multi-armed bandit problem and other OpenAI Gym environments. As we progress through the book we will gain insights about DRL techniques such as Motivated Reinforcement Learning with Curiosity and Curriculum Learning. We also take a closer look at deep reinforcement learning and in particular the Unity ML-Agents toolkit. By the end of the book, we will look at how to apply DRL and the ML-Agents toolkit to enhance, test and automate your games or simulations. Finally, we will cover your possible next steps and possible areas for future learning.

Preface

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Section 1: The Basics

Section 1: The Basics

Deep Learning for Games

Deep Learning for Games

The past, present, and future of DL

Neural networks – the foundation

Multilayer perceptron in TF

TensorFlow Basics

Training neural networks with backpropagation

Building an autoencoder with Keras

Exercises

Summary

Convolutional and Recurrent Networks

Convolutional and Recurrent Networks

Convolutional neural networks

Understanding convolution

Building a self-driving CNN

Memory and recurrent networks

Playing Rock, Paper, Scissors with LSTMs

Exercises

Summary

GAN for Games

GAN for Games

Introducing GANs

Coding a GAN in Keras

Wasserstein GAN

Generating textures with a GAN

A GAN for creating music

Exercises

Summary

Building a Deep Learning Gaming Chatbot

Building a Deep Learning Gaming Chatbot

Neural conversational agents

Sequence-to-sequence learning

DeepPavlov

Building the chatbot server

Running the chatbot in Unity

Exercises

Summary

Section 2: Deep Reinforcement Learning

Section 2: Deep Reinforcement Learning

Introducing DRL

Introducing DRL

Reinforcement learning

RL with the OpenAI Gym

A Q-Learning model

First DRL with Deep Q-learning

RL experiments

Exercises

Summary

Unity ML-Agents

Unity ML-Agents

Installing ML-Agents

Training an agent

What's in a brain?

Monitoring training with TensorBoard

Running an agent

Exercises

Summary

Agent and the Environment

Agent and the Environment

Exploring the training environment

Understanding state

Understanding visual state

Convolution and visual state

Recurrent networks for remembering series

Exercises

Summary

Understanding PPO

Understanding PPO

Marathon RL

The partially observable Markov decision process

Actor-Critic and continuous action spaces

Understanding TRPO and PPO

Learning to tune PPO

Exercises

Summary

Rewards and Reinforcement Learning

Rewards and Reinforcement Learning

Rewards and reward functions

Sparsity of rewards

Curriculum Learning

Understanding Backplay

Curiosity Learning

Exercises

Summary

Imitation and Transfer Learning

Imitation and Transfer Learning

IL, or behavioral cloning

Online training

Offline training

Transfer learning

Imitation Transfer Learning

Exercises

Summary

Building Multi-Agent Environments

Building Multi-Agent Environments

Adversarial and cooperative self-play

Adversarial self-play

Multi-brain play

Adding individuality with intrinsic rewards

Extrinsic rewards for individuality

Exercises

Summary

Section 3: Building Games

Section 3: Building Games

Debugging/Testing a Game with DRL

Debugging/Testing a Game with DRL

Introducing the game

Setting up ML-Agents

Overriding the Unity input system

Testing through imitation

Analyzing the testing process

Exercises

Summary

Obstacle Tower Challenge and Beyond

Obstacle Tower Challenge and Beyond

The Unity Obstacle Tower Challenge

Deep Learning for your game?

Building your game

More foundations of learning

Summary

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

3 (2)

5 star

50%

4 star

0

3 star

0

2 star

0

1 star

50%

Understanding TRPO and PPO

There are many variations to the policy-and model-free algorithms that have become popular for solving RL problems of optimizing predictions of future rewards. As we have seen, many of these algorithms use an advantage function, such as Actor-Critic, where we have two sides of the problem trying to converge to the optimum solution. In this case, the advantage function is trying to find the maximum expected discounted rewards. TRPO and PPO do this by using an optimization method called a Minorize-Maximization (MM) algorithm. An example of how the MM algorithm solves a problem is shown in the following diagram:

Using the MM algorithm

This diagram was extracted from a series of blogs by Jonathon Hui that elegantly describe the MM algorithm along with the TRPO and PPO methods in much greater detail. See the following link for the source: (https://medium...

Search

Your notes and bookmarks