ai-forever commited on
Commit
341a438
1 Parent(s): 8902689

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -1
README.md CHANGED
@@ -1,2 +1,9 @@
1
- ## ruGPT-3.5 13B
2
 
 
 
 
 
 
 
 
 
1
+ # ruGPT-3.5 13B
2
 
3
+ Language model for Russian language. Model has 13B parameters as you can guess from it's name. This is our biggest model so far and it was used for trainig GigaChat (read more about it in the [article](https://habr.com/ru/companies/sberbank/articles/730108/)).
4
+
5
+ ## Dataset
6
+
7
+ Model was pretrained on a 300Gb of various domains, than additionaly trained on the 100 Gb of code and legal documets. Here is the dataset structure:
8
+
9
+ <img src="https://habrastorage.org/getpro/habr/upload_files/384/cd1/40f/384cd140fbd9b4e7dd5427801be13ca0.png" width="800px">