dahara1
/

weblab-10b-instruction-sft-GPTQ

Text Generation

text-generation-inference

Model card Files Files and versions Community

dahara1 commited on Aug 26, 2023

Commit

38f5908

•

1 Parent(s): 17f36be

Update README.md

Files changed (1) hide show

README.md +9 -7

README.md CHANGED Viewed

@@ -74,11 +74,13 @@ Also, the score may change as a result of more tuning.
     - *model loading is performed with gptq_use_triton=True, and evaluation is performed with template version 0.3 using the few-shot in-context learning.*
     - *The number of few-shots is 3,3,3,2.*
-    | Model | Average | JCommonsenseQA | JNLI | MARC-ja | JSQuAD |
-    | :-- | :-- | :-- | :-- | :-- | :-- |
-    | weblab-10b-instruction-sft | 78.78 | 74.35 | 65.65 | 96.06 | 79.04 |
-    | weblab-10b | 66.38 | 65.86 | 54.19 | 84.49 | 60.98 |
-    | *weblab-10b-instruction-sft-GPTQ first tuning* | 69.72 | 74.53 | 41.70 | 89.95 | 72.69 |
-    | *weblab-10b-instruction-sft-GPTQ second tuning* | 74.59 | 74.08 | 60.72 | 91.85 | 71.70 |
-    | *weblab-10b-instruction-sft-GPTQ third tuning* | - | - | - | - | - |

     - *model loading is performed with gptq_use_triton=True, and evaluation is performed with template version 0.3 using the few-shot in-context learning.*
     - *The number of few-shots is 3,3,3,2.*
+    | Model | Average | JCommonsenseQA | JNLI | MARC-ja | JSQuAD | model |
+    | :-- | :-- | :-- | :-- | :-- | :-- | :-- |
+    | weblab-10b | 66.38 | 65.86 | 54.19 | 84.49 | 60.98 | [original model](https://huggingface.co/matsuo-lab/weblab-10b) |
+    | weblab-10b-instruction-sft | 78.78 | 74.35 | 65.65 | 96.06 | 79.04 | [original instruction model](https://huggingface.co/matsuo-lab/weblab-10b-instruction-sft) |
+    | *weblab-10b-instruction-sft-GPTQ first tuning* | 69.72 | 74.53 | 41.70 | 89.95 | 72.69 | deleted |
+    | *weblab-10b-instruction-sft-GPTQ second tuning* | 74.59 | 74.08 | 60.72 | 91.85 | 71.70 | deleted |
+    | *weblab-10b-instruction-sft-GPTQ third tuning* | - | 73.19 | 69.26 | - | 72.10 | current model. replaced on August 26th |
+    | *weblab-10b-instruction-sft-GPTQ 4th tuning* | - | - | - | - | - | - |