flammenai
/

Mahou-1.3-mistral-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mahou-1.3-mistral-7B / README.md

nbeerbower's picture

Update README.md

3efc0d5 verified 4 months ago

|

No virus

1.3 kB

	---
	library_name: transformers
	license: apache-2.0
	base_model:
	- nbeerbower/Flammen-Mahou-mistral-7B
	datasets:
	- flammenai/MahouMix-v1
	---
	![image/png](https://huggingface.co/flammenai/Mahou-1.0-mistral-7B/resolve/main/mahou1.png)

	# Mahou-1.3-mistral-7B

	Mahou is our attempt to build a production-ready conversational/roleplay LLM.

	Future versions will be released iteratively and finetuned from flammen.ai conversational data.

	### Chat Format

	This model has been trained to use ChatML format.

	```
	<\|im_start\|>system
	{{system}}<\|im_end\|>
	<\|im_start\|>{{char}}
	{{message}}<\|im_end\|>
	<\|im_start\|>{{user}}
	{{message}}<\|im_end\|>
	```

	### Roleplay Format

	- Speech without quotes.
	- Actions in `asterisks`

	```
	leans against wall cooly so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.
	```

	### ST Settings

	1. Use ChatML for the Context Template.
	2. Enable Instruct Mode.
	3. Use the [Mahou preset](https://huggingface.co/datasets/flammenai/Mahou-ST-ChatML-Instruct/raw/main/Mahou.json).
	4. Recommended: Add newline as a stopping string: `["\n"]`

	### Method

	Finetuned for 10 epochs using an A100 on Google Colab.

	[Fine-tune Llama 3 with ORPO](https://huggingface.co/blog/mlabonne/orpo-llama-3) - [Maxime Labonne](https://huggingface.co/mlabonne)