Independent / Multi-Agent Proximal Policy Optimization (PPO) Algorithms for the RelayRL framework
A heterogeneous RL runtime control platform for concurrent multi-actor execution.