OuteAI
/

Lite-Mistral-150M-v2-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

edwko commited on Jul 18

Commit

25f3b6b

•

1 Parent(s): 841b905

Update README.md

Files changed (1) hide show

README.md +12 -12

README.md CHANGED Viewed

@@ -16,6 +16,18 @@ The model was trained on ~8 billion tokens.
 - Extended Training: Further refinement of the model, resulting in improved benchmark performance and overall text generation quality.
 - Tokenizer changes.
 ## How coherent is the 150M model?
 Let's look at real-world examples:
@@ -132,18 +144,6 @@ The model shows some promise in understanding context related to simple requests
   </tr>
 </table>
-## Chat format
-This model uses a specific chat format for optimal performance.
-```
-<s>system
-[System message]</s>
-<s>user
-[Your question or message]</s>
-<s>assistant
-[The model's response]</s>
-```
 ## Usage with HuggingFace transformers
 The model can be used with HuggingFace's `transformers` library:
 ```python

 - Extended Training: Further refinement of the model, resulting in improved benchmark performance and overall text generation quality.
 - Tokenizer changes.
+## Chat format
+This model is **very sensitive** to the chat template used. Ensure you use the correct template:
+```
+<s>system
+[System message]</s>
+<s>user
+[Your question or message]</s>
+<s>assistant
+[The model's response]</s>
+```
 ## How coherent is the 150M model?
 Let's look at real-world examples:
   </tr>
 </table>
 ## Usage with HuggingFace transformers
 The model can be used with HuggingFace's `transformers` library:
 ```python