Quant-Cartel
/

MN-12B-Tarsus-exl2-longcal

Not-For-All-Audiences

Model card Files Files and versions Community

rAIfle commited on Aug 16

Commit

bc260e4

•

1 Parent(s): accb54a

Create README.md

Files changed (1) hide show

README.md +66 -0

README.md ADDED Viewed

	@@ -0,0 +1,66 @@

+---
+license: cc-by-nc-2.0
+tags:
+- not-for-all-audiences
+---
+```
+  e88 88e                               d8
+ d888 888b  8888 8888  ,"Y88b 888 8e   d88
+C8888 8888D 8888 8888 "8" 888 888 88b d88888
+ Y888 888P  Y888 888P ,ee 888 888 888  888
+  "88 88"    "88 88"  "88 888 888 888  888
+      b
+      8b,
+  e88'Y88                  d8           888
+ d888  'Y  ,"Y88b 888,8,  d88    ,e e,  888
+C8888     "8" 888 888 "  d88888 d88 88b 888
+ Y888  ,d ,ee 888 888     888   888   , 888
+  "88,d88 "88 888 888     888    "YeeP" 888
+PROUDLY PRESENTS
+```
+# MN-12B-Tarsus-exl2-longcal
+Quantized using 115 rows of 8192 tokens from the default ExLlamav2-calibration dataset.
+Branches:
+- `main` -- `measurement.json`
+- `8b8h` -- 8bpw, 8bit lm_head
+- `6b6h` -- 6bpw, 6bit lm_head
+- `4b6h` -- 4bpw, 6bit lm_head
+- `3b6h` -- 3bpw, 6bit lm_head
+- `2.25b6h` -- 2.25bpw, 6bit lm_head
+Original model link: [Envoid/MN-12B-Tarsus](https://huggingface.co/Envoid/MN-12B-Tarsus)
+Original model README below.
+-----
+## CAUTION: This model was finetuned on a corpus that includes adult content and may produce mature content without warning.
+![](https://files.catbox.moe/1k5ama.jpg)
+# MN-12B-Tarsus
+Is a full-weight finetune of [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407)
+Which underwent several intermediate steps.
+This finetune was made with chatting/roleplaying via SillyTavern in mind and thus all of the testing was done there, with the goals being to:
+-Reduce shiver-slop
+-Make the model more conversationally proactive
+-Give it a more human-like output (i.e. less gratuitous purple prose)
+-Reducing overall positivity bias
+It still responds well to Mistral-Instruct formatting.
+The results are imperfect and its assistant capabilities suffered somewhat as a result but in quick testing it definitely seems to have achieved all of the goals to varying degrees.
+It sometimes fumbles with tokens in odd places so it's certainly not perfect. Possibly best used as merge-fodder.
+Trained using [qlora-pipe](https://github.com/tdrussell/qlora-pipe)