Edit model card
PPO Agent playing LunarLander-v2
Usage (with SB3 RL Zoo)
Training (with the RL Zoo)
Hyperparameters
Environment Arguments
PPO Agent playing LunarLander-v2
This is a trained…
Edit model card
Decision Transformer model trained on medium-replay trajectories sampled from the Gym Walker2d environment
Decision Transformer model trained on medium-replay trajectories sampled from the Gym Walker2d environment
…
