Foundations of Deep Reinforcement Learning: Theory and Practice in Python

Front Cover
Addison-Wesley Professional, 2019 M11 20 - 416 pages
The Contemporary Introduction to Deep Reinforcement Learning that Combines Theory and Practice

Deep reinforcement learning (deep RL) combines deep learning and reinforcement learning, in which artificial agents learn to solve sequential decision-making problems. In the past decade deep RL has achieved remarkable results on a range of problems, from single and multiplayer games—such as Go, Atari games, and DotA 2—to robotics.

Foundations of Deep Reinforcement Learning is an introduction to deep RL that uniquely combines both theory and implementation. It starts with intuition, then carefully explains the theory of deep RL algorithms, discusses implementations in its companion software library SLM Lab, and finishes with the practical details of getting deep RL to work.
This guide is ideal for both computer science students and software engineers who are familiar with basic machine learning concepts and have a working understanding of Python.
  • Understand each key aspect of a deep RL problem
  • Explore policy- and value-based algorithms, including REINFORCE, SARSA, DQN, Double DQN, and Prioritized Experience Replay (PER)
  • Delve into combined algorithms, including Actor-Critic and Proximal Policy Optimization (PPO)
  • Understand how algorithms can be parallelized synchronously and asynchronously
  • Run algorithms in SLM Lab and learn the practical implementation details for getting deep RL to work
  • Explore algorithm benchmark results with tuned hyperparameters
  • Understand how deep RL environments are designed
Register your book for convenient access to downloads, updates, and/or corrections as they become available. See inside book for details.
 

Contents

Foreword
Acknowledgments
SARSA
23
Deep QNetworks DQN
23
Combined Methods
23
Parallelization Methods
23
Algorithm Summary
23
SLM
23
Actions
23
Rewards
23
Transition Function
23
Epilogue
23
B Example Environments
23
References
23
Index
23
Improving
23

Network Architectures
23
Hardware
23
Environment Design
23
Proximal Policy Optimization PPO
23
PolicyBased and ValueBased Algorithms
23

Other editions - View all

Common terms and phrases

About the author (2019)

Laura Graesser is a research software engineer working in robotics at Google. She holds a master’s degree in computer science from New York University, where she specialized in machine learning.

Wah Loon Keng is an AI engineer at Machine Zone, where he applies deep reinforcement learning to industrial problems. He has a background in both theoretical physics and computer science.

Bibliographic information