A heterogeneous RL runtime control platform for concurrent multi-actor execution.
Independent / Multi-Agent Proximal Policy Optimization (PPO) Algorithms for the RelayRL framework
Data types for the RelayRL framework.
Traits for training and testing environments in the RelayRL framework.
A deep reinforcement learning 2D gridworld environment with single- and multi-agent support.