Update README.md
Browse files
README.md
CHANGED
@@ -74,7 +74,7 @@ mean_reward, std_reward = evaluate_policy(model, eval_env, n_eval_episodes=10, d
|
|
74 |
print(f"mean_reward={mean_reward:.2f} +/- {std_reward}")
|
75 |
```
|
76 |
|
77 |
-
## 3. Re-train model (
|
78 |
|
79 |
```python
|
80 |
# Load a saved LunarLander model from the Hub and retrain
|
@@ -143,7 +143,7 @@ package_to_hub(model=model, # Our trained model
|
|
143 |
commit_message=commit_message)
|
144 |
```
|
145 |
|
146 |
-
## 4. Re-train model (
|
147 |
- Change `--repo_id` become your repo id :)
|
148 |
- `--id_retrain` and `--filename_retrain` in order to load my trained model, you can change to your trained model
|
149 |
```python
|
@@ -151,7 +151,7 @@ package_to_hub(model=model, # Our trained model
|
|
151 |
--commit_message "retrain model from hub 5m" \
|
152 |
--id_retrain "thien1892/LunarLander-v2-ppo-v5" \
|
153 |
--filename_retrain "ppo-LunarLander-v2.zip" \
|
154 |
-
--total_timesteps
|
155 |
-
--learning_rate
|
156 |
--n_envs 64
|
157 |
```
|
|
|
74 |
print(f"mean_reward={mean_reward:.2f} +/- {std_reward}")
|
75 |
```
|
76 |
|
77 |
+
## 3. Re-train model (Optional 1)
|
78 |
|
79 |
```python
|
80 |
# Load a saved LunarLander model from the Hub and retrain
|
|
|
143 |
commit_message=commit_message)
|
144 |
```
|
145 |
|
146 |
+
## 4. Re-train model (Optional 2)
|
147 |
- Change `--repo_id` become your repo id :)
|
148 |
- `--id_retrain` and `--filename_retrain` in order to load my trained model, you can change to your trained model
|
149 |
```python
|
|
|
151 |
--commit_message "retrain model from hub 5m" \
|
152 |
--id_retrain "thien1892/LunarLander-v2-ppo-v5" \
|
153 |
--filename_retrain "ppo-LunarLander-v2.zip" \
|
154 |
+
--total_timesteps 2000000 \
|
155 |
+
--learning_rate 3e-5 \
|
156 |
--n_envs 64
|
157 |
```
|