← Back
PPO
technique
Endorsed by 1 creator
Proximal Policy Optimization, a reinforcement learning algorithm
Endorsed by
Topics
Used by
These creators have mentioned using this product
Greg Brockman
uses
"The algorithm we use called PPO, you plan over every single time step. There's no hierarchy."