Francesco-A
commited on
Commit
•
a47abee
1
Parent(s):
9a1187f
Update README.md
Browse files
README.md
CHANGED
@@ -26,12 +26,10 @@ This is a trained model of a **PPO** agent playing **MountainCar-v0**
|
|
26 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
|
27 |
|
28 |
# Model Details
|
29 |
-
```python
|
30 |
- Model Name: ppo-MountainCar-v0
|
31 |
- Model Type: Proximal Policy Optimization (PPO)
|
32 |
- Policy Architecture: MultiLayerPerceptron (MLPPolicy)
|
33 |
- Environment: MountainCar-v0
|
34 |
-
```
|
35 |
- Training Data: The model was trained using three consecutive training sessions:
|
36 |
- First training session: Total timesteps = 1,000,000
|
37 |
- Second training session: Total timesteps = 500,000
|
|
|
26 |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
|
27 |
|
28 |
# Model Details
|
|
|
29 |
- Model Name: ppo-MountainCar-v0
|
30 |
- Model Type: Proximal Policy Optimization (PPO)
|
31 |
- Policy Architecture: MultiLayerPerceptron (MLPPolicy)
|
32 |
- Environment: MountainCar-v0
|
|
|
33 |
- Training Data: The model was trained using three consecutive training sessions:
|
34 |
- First training session: Total timesteps = 1,000,000
|
35 |
- Second training session: Total timesteps = 500,000
|