AuriAetherwiing's picture
Update README.md
43edc11 verified
metadata
base_model:
  - nothingiisreal/MN-12B-Celeste-V1.9
  - intervitens/mini-magnum-12b-v1.1
library_name: transformers
tags:
  - mergekit
  - merge
license: cc-by-nc-nd-4.0

UPD: this model series is succeeded by EVA
Unprivated, to store for historical reasons
There's not much point in those merges, Celeste 70B 0.1 pretty much melded Celeste's and Magnum's datasets anyway
To be continued, but on a different base, under a different name, and actually trained this time, without shortcuts

MN-12B-Starcannon-v2

This is a merge of pre-trained language models created using mergekit. Turned out to be a bit more Magnum-esque, but still is very creative, and writing style is pretty nice, even if some slop words appear time to time. Might be a good fit for people wanting more variety than Magnum has, and more verbose prose than Celeste v1.9 has.

Dynamic FP8
Static GGUF (by Mradermacher)
EXL2 (by kingbri of RoyalLab)

Merge Details

Merge Method

This model was merged using the TIES merge method using nothingiisreal/MN-12B-Celeste-V1.9 as a base.

Merge fodder

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
    - model: intervitens/mini-magnum-12b-v1.1
      parameters:
        density: 0.3
        weight: 0.5
    - model: nothingiisreal/MN-12B-Celeste-V1.9
      parameters:
        density: 0.7
        weight: 0.5

merge_method: ties
base_model: nothingiisreal/MN-12B-Celeste-V1.9
parameters:
    normalize: true
    int8_mask: true
dtype: bfloat16