返回浏览
존슐만

존슐만

adminadmin
热度 412023/5/12
前往原站查看

简介

The AI researcher who proposed PPO, the most favored algorithm in RL, shares his knowledge of reinforcement learning.

图片预览