Skip to content Skip to sidebar Skip to footer

ppo-QbertNoFrameskip-v4

PPO Agent playing QbertNoFrameskip-v4 This is a trained model of a PPO agent playing QbertNoFrameskip-v4 using the stable-baselines3 library. The training report: https://wandb.ai/simoninithomas/HFxSB3/reports/Atari-HFxSB3-Benchmark--VmlldzoxNjI3NTIy Evaluation Results Mean_reward: 15685.00 +/- 115.217 Usage (with Stable-baselines3) …