bigmix
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the task arithmetic merge method using jeiku/Rosa_v1_3B as a base.
Models Merged
The following models were included in the merge:
- jeiku/Rosa_v1_3B + jeiku/Theory_of_Mind_128_StableLM
- jeiku/Rosa_v1_3B + jeiku/PIPPA_128_StableLM
- jeiku/Rosa_v1_3B + jeiku/LimaRP_StableLM
- jeiku/Rosa_v1_3B + jeiku/Theory_of_Mind_RP_128_StableLM
- jeiku/Rosa_v1_3B + jeiku/No_Robots_Alpaca_StableLM
- jeiku/Rosa_v1_3B + jeiku/Alpaca_128_StableLM
- jeiku/Rosa_v1_3B + jeiku/Everything_v3_128_StableLM
- jeiku/Rosa_v1_3B + jeiku/RPGPT_StableLM
- jeiku/Rosa_v1_3B + jeiku/Toxic_DPO_StableLM
- jeiku/Rosa_v1_3B + jeiku/Gnosis_256_StableLM
- jeiku/Rosa_v1_3B + jeiku/Bluemoon_cleaned_StableLM
Configuration
The following YAML configuration was used to produce this model:
merge_method: task_arithmetic
base_model: jeiku/Rosa_v1_3B
parameters:
normalize: true
models:
- model: jeiku/Rosa_v1_3B+jeiku/No_Robots_Alpaca_StableLM
parameters:
weight: 0.5
- model: jeiku/Rosa_v1_3B+jeiku/Toxic_DPO_StableLM
parameters:
weight: 0.5
- model: jeiku/Rosa_v1_3B+jeiku/Alpaca_128_StableLM
parameters:
weight: 0.4
- model: jeiku/Rosa_v1_3B+jeiku/Everything_v3_128_StableLM
parameters:
weight: 0.4
- model: jeiku/Rosa_v1_3B+jeiku/Gnosis_256_StableLM
parameters:
weight: 1
- model: jeiku/Rosa_v1_3B+jeiku/Theory_of_Mind_128_StableLM
parameters:
weight: 0.8
- model: jeiku/Rosa_v1_3B+jeiku/PIPPA_128_StableLM
parameters:
weight: 0.4
- model: jeiku/Rosa_v1_3B+jeiku/LimaRP_StableLM
parameters:
weight: 0.7
- model: jeiku/Rosa_v1_3B+jeiku/Theory_of_Mind_RP_128_StableLM
parameters:
weight: 0.6
- model: jeiku/Rosa_v1_3B+jeiku/Bluemoon_cleaned_StableLM
parameters:
weight: 0.8
- model: jeiku/Rosa_v1_3B+jeiku/RPGPT_StableLM
parameters:
weight: 0.4
dtype: float16
- Downloads last month
- 6
Inference API (serverless) does not yet support model repos that contain custom code.
Model tree for jeiku/Tofu_3B
Merge model
this model