jikaixuan commited on
Commit
e639da6
1 Parent(s): cf31064

Model save

Browse files
Files changed (2) hide show
  1. README.md +10 -10
  2. training_args.bin +2 -2
README.md CHANGED
@@ -15,15 +15,15 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.6400
19
- - Rewards/chosen: 0.0301
20
- - Rewards/rejected: -0.0273
21
- - Rewards/accuracies: 0.6370
22
- - Rewards/margins: 0.0574
23
- - Logps/rejected: -253.2124
24
- - Logps/chosen: -269.2556
25
- - Logits/rejected: -2.4963
26
- - Logits/chosen: -2.4945
27
 
28
  ## Model description
29
 
@@ -60,7 +60,7 @@ The following hyperparameters were used during training:
60
 
61
  | Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
62
  |:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
63
- | 0.6442 | 1.0 | 955 | 0.6400 | 0.0301 | -0.0273 | 0.6370 | 0.0574 | -253.2124 | -269.2556 | -2.4963 | -2.4945 |
64
 
65
 
66
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.6409
19
+ - Rewards/chosen: 0.0197
20
+ - Rewards/rejected: -0.0229
21
+ - Rewards/accuracies: 0.6130
22
+ - Rewards/margins: 0.0426
23
+ - Logps/rejected: -253.1684
24
+ - Logps/chosen: -269.3594
25
+ - Logits/rejected: -2.4973
26
+ - Logits/chosen: -2.4954
27
 
28
  ## Model description
29
 
 
60
 
61
  | Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
62
  |:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
63
+ | 0.6468 | 1.0 | 955 | 0.6409 | 0.0197 | -0.0229 | 0.6130 | 0.0426 | -253.1684 | -269.3594 | -2.4973 | -2.4954 |
64
 
65
 
66
  ### Framework versions
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:55735e718abcb162ead37e5dd69bbbcafb5bd7d79744ef90d96d7c4c7c4f2969
3
- size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6dab51201b4225ab9086b23809d9065b793c1c64bd39c30b7fcefee8ce4762f6
3
+ size 4792