rafaeloc15/mistral-small-v4

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0237
 ## Model description
@@ -40,7 +40,7 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0002
-- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
@@ -52,8 +52,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.0373        | 1.0   | 110  | 0.0274          |
-| 0.0302        | 2.0   | 220  | 0.0237          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0259
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0002
+- train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.0434        | 1.0   | 226  | 0.0348          |
+| 0.0337        | 2.0   | 452  | 0.0259          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -6,6 +6,7 @@
   "fan_in_fan_out": false,
   "inference_mode": true,
   "init_lora_weights": true,
   "layers_pattern": null,
   "layers_to_transform": null,
   "loftq_config": {},

   "fan_in_fan_out": false,
   "inference_mode": true,
   "init_lora_weights": true,
+  "layer_replication": null,
   "layers_pattern": null,
   "layers_to_transform": null,
   "loftq_config": {},

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:96b67927fcb00c931555dd243722108043295fc73678578097ae65fe6e7ecdcf
 size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:3a3f17c1e81feb25a14b952be88d129cd178ecc3216d973613ec49a87eafa3b0
 size 109069176

runs/Apr21_22-55-26_07dc28601d23/events.out.tfevents.1713740164.07dc28601d23.675.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:adeb86cc378d52259fe58f2bcb07d4eddce2219298b8b7fdf7d953022d1be66f
+size 15404

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e8c798461c57d7010b041a17a668493a8d2410e8526dd949bcb3fb2d86ac4ae4
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:8fe6676f36f26bf78d4ec4230573df46f0815c8b5a21c68511928a36f1b08d87
 size 4920