ilsp
/

Meltemi-7B-Instruct-v1.5

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

droussis commited on Aug 1

Commit

498ecf4

•

1 Parent(s): a0b99e9

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ We present Meltemi 7B Instruct v1.5 Large Language Model (LLM), a new and improv
 # Model Information
-- Vocabulary extension of the Mistral 7b tokenizer with Greek tokens
 - 8192 context length
 - Fine-tuning has been done with the [Odds Ratio Preference Optimization (ORPO)](https://arxiv.org/abs/2403.07691) algorithm using 97k preference data:
   * 89,730 Greek preference data which are mostly translated versions of high-quality datasets on Hugging Face

 # Model Information
+- Vocabulary extension of the Mistral 7b tokenizer with Greek tokens for lower costs and faster inference (**1.52** vs. 6.80 tokens/word for Greek)
 - 8192 context length
 - Fine-tuning has been done with the [Odds Ratio Preference Optimization (ORPO)](https://arxiv.org/abs/2403.07691) algorithm using 97k preference data:
   * 89,730 Greek preference data which are mostly translated versions of high-quality datasets on Hugging Face