Atari Games On Atari 2600 Skiing
评估指标
Score
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Score |
---|---|
generalized-data-distribution-iteration | -6774 |
dna-proximal-policy-optimization-with-a-dual | -29974 |
fully-parameterized-quantile-function-for | -9085.3 |
distributional-reinforcement-learning-with-1 | -9324 |
recurrent-rational-networks | -23487 |
the-arcade-learning-environment-an-evaluation | 0 |
recurrent-rational-networks | -23582 |
noisy-networks-for-exploration | -7550 |
implicit-quantile-networks-for-distributional | -9289 |
gdi-rethinking-what-makes-reinforcement | -6774 |
train-a-real-world-local-path-planner-in-one | -8295.4 |
first-return-then-explore | -3660 |
distributed-prioritized-experience-replay | -10789.9 |
agent57-outperforming-the-atari-human | -4202.6 |
the-arcade-learning-environment-an-evaluation | 0 |
online-and-offline-reinforcement-learning-by | -30000 |
mastering-atari-go-chess-and-shogi-by | -29968.36 |
recurrent-experience-replay-in-distributed | -30021.7 |
evolving-simple-programs-for-playing-atari | -9011 |
increasing-the-action-gap-new-operators-for | -13264.51 |
mastering-atari-with-discrete-world-models-1 | -9299 |
impala-scalable-distributed-deep-rl-with | -10180.38 |
generalized-data-distribution-iteration | -6025 |