lightonai
/

alfred-40b-0723

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

WeightsnWizardry commited on Jul 31, 2023

Commit

e60fdf7

•

1 Parent(s): fb8bfcb

Update README.md

Files changed (1) hide show

README.md +2 -8

README.md CHANGED Viewed

@@ -118,19 +118,15 @@ Samples from each of the datasets have been programmatically formatted to chat,
 | Num Rollouts       | 1024       |
 | PPO Epochs         | 1          |
 | Value Epochs       | 1          |
-| Constant KL Coef   | `true`     |
-| Init KL Coef       | 0.01       |
-| Target KL          | 6.0        |
-| K Beta             | 0.1        |
 | Gamma              | 1.0        |
 | GAE Lambda         | 0.95       |
 | Clip Range         | 0.2        |
 | Clip Range Value   | 0.2        |
 | Whiten Advantages  | `true`     |
 | Whiten Rewards     | `false`    |
-| Loss on EPD        | `true`     |
 | Max Steps          | 200        |
-| microbatch_size    | 1          |
 | PPO steps/epoch    | 1          |
 | Value steps/epoch  | 8          |
@@ -141,8 +137,6 @@ Samples from each of the datasets have been programmatically formatted to chat,
 | Continuation Min Len | 0          |
 |                Top P | 1.0        |
 |          Temperature | 1.0        |
-|     # Cached Batches | 128        |
-|      Microbatch size | 1          |
 ## Evaluation

 | Num Rollouts       | 1024       |
 | PPO Epochs         | 1          |
 | Value Epochs       | 1          |
+| KL Coef            | 0.01       |
 | Gamma              | 1.0        |
 | GAE Lambda         | 0.95       |
 | Clip Range         | 0.2        |
 | Clip Range Value   | 0.2        |
 | Whiten Advantages  | `true`     |
 | Whiten Rewards     | `false`    |
+| Loss on EOD        | `true`     |
 | Max Steps          | 200        |
 | PPO steps/epoch    | 1          |
 | Value steps/epoch  | 8          |
 | Continuation Min Len | 0          |
 |                Top P | 1.0        |
 |          Temperature | 1.0        |
 ## Evaluation