sb3
/

ppo-Pendulum-v1

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

araffin commited on May 4, 2022

Commit

7ee9df9

•

1 Parent(s): 91234a1

Add code

Files changed (1) hide show

README.md +23 -1

README.md CHANGED Viewed

@@ -24,5 +24,27 @@ model-index:
   This is a trained model of a **PPO** agent playing **Pendulum-v1** using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
   ## Usage (with Stable-baselines3)
-  TODO: Add your code

   This is a trained model of a **PPO** agent playing **Pendulum-v1** using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
   ## Usage (with Stable-baselines3)
+```python
+from stable_baselines3 import PPO
+from stable_baselines3.common.env_util import make_vec_env
+# Create the environment
+env_id = "Pendulum-v1"
+env = make_vec_env(env_id, n_envs=1)
+# Instantiate the agent
+model = PPO(
+    "MlpPolicy",
+    env,
+    gamma=0.98,
+    use_sde=True,
+    sde_sample_freq=4,
+    learning_rate=1e-3,
+    verbose=1,
+)
+# Train the agent
+model.learn(total_timesteps=int(1e5))
+```