TensorFlow Agents:TensorFlow 强化学习基础设施

TensorFlow Agents

This project provides optimized infrastructure for reinforcement learning. It extends the OpenAI gym interface to multiple parallel environments and allows agents to be implemented in TensorFlow and perform batched computation. As a starting point, we provide BatchPPO, an optimized implementation of Proximal Policy Optimization.


Related posts

Leave a Comment