Edit model card

Descripton:

This is ruadapt version of upstage/SOLAR-10.7B-v1.0 model with tokenizer replacement. Additionally to previous work, the model was adapted in two stages: 1) vocabulary optimization, and 2) additional attention fine-tuning using LoRa.

How to cite:

Tikhomirov M., Chernyshev D. Impact of Tokenization on LLaMa Russian Adaptation //arXiv preprint arXiv:2312.02598. – 2023.

Downloads last month
35
Safetensors
Model size
10.7B params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for msu-rcc-lair/ruadapt_solar_10.7_darulm_unigram_proj_init_twostage_v1

Finetunes
1 model

Spaces using msu-rcc-lair/ruadapt_solar_10.7_darulm_unigram_proj_init_twostage_v1 4