rccmsu
/

ruadapt_saiga2_7b_v0.1

Text Generation

Model card Files Files and versions Community

rccmsu commited on Jan 15

Commit

082dbdf

•

1 Parent(s): 946002a

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -11,6 +11,8 @@ Up to 60% faster generation and 35% training (on identical russian text sequence
 Colab: https://colab.research.google.com/drive/109ZhEB6STy-0jO-Z_4ttkWr1jg_FCTRW?usp=sharing
 ## Model description
 Instruction version (Saiga datasets) of Russian adaptation of LLaMa-2-7B by replacing the tokenizer.

 Colab: https://colab.research.google.com/drive/109ZhEB6STy-0jO-Z_4ttkWr1jg_FCTRW?usp=sharing
+Paper: Tikhomirov M., Chernyshev D. Impact of Tokenization on LLaMa Russian Adaptation //arXiv preprint arXiv:2312.02598. – 2023.
 ## Model description
 Instruction version (Saiga datasets) of Russian adaptation of LLaMa-2-7B by replacing the tokenizer.