akaistormherald's picture
Update README.md
d1da8e9 verified
|
raw
history blame contribute delete
No virus
434 Bytes
metadata
language:
  - en
license: apache-2.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - mistral
  - gguf
base_model: unsloth/zephyr-sft-bnb-4bit
datasets:
  - unalignment/toxic-dpo-v0.2

Uploaded model

  • Developed by: akaistormherald
  • License: apache-2.0
  • Finetuned from model : unsloth/zephyr-sft-bnb-4bit

Mistral7b + SFT + 4bit DPO training with unalignment/toxic-dpo-v0.2 == ToxicMist? ☣🌫 (GGUF)