OpenAI: Competitive Self-Play

in #deep-learning8 years ago (edited)

OpenAI set up competitions between 3D robots. Each robot has a goal (push the other robot, go to other side, kick the ball). The robots learn behaviors like tackling, ducking, faking, kicking and catching, and diving for the ball. Neural network of each agent was trained with PPO (Proximal Policy Optimization).

OpenAI

Coin Marketplace

STEEM 0.06
TRX 0.29
JST 0.046
BTC 64890.95
ETH 1885.94
USDT 1.00
SBD 0.51