ppo-LunarLander-v2 / results.json

Commit History

First implementation. mean_reward=235.67 +/- 43.70256071321255
0b88b27

eelang commited on