rccmsu
/

ruadapt_saiga2_7b_v0.1

Text Generation

Model card Files Files and versions Community

rccmsu commited on Nov 26, 2023

Commit

01fa45b

•

1 Parent(s): 69b3004

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -1,9 +1,14 @@
 ---
 library_name: peft
 ---
-## Training procedure
-### Framework versions
-- PEFT 0.5.0

 ---
 library_name: peft
+language:
+- ru
 ---
+Use in the same way as IlyaGusev/saiga2_7b_lora.
+WARNING! Load tokenizer as AutoTokenizer.from_pretrained(model_path, use_fast=True)
+Up to 60% faster generation and 35% training with HF because of different tokenizer.
+## Model description
+Instruction version (Saiga datasets) of Russian adaptation of LLaMa-2-7B by replacing the tokenizer.
+Paper: Tikhomirov M.M., Chernyshev D.I., Impact of Tokenization on LLaMa Russian Adaptation (will be soon)