Edit model card

This is a simple experiment using geman ORPO training for one epoch using qlora and unsloth on Vezora/Mistral-22B-v0.2

Downloads last month: 5

Safetensors

Model size

22.2B params

Tensor type

BF16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

johannhartmann
/

mistral22b_orpo_de

Datasets used to train johannhartmann/mistral22b_orpo_de