mlx-community/Qwen2.5-7B-Instruct-Uncensored-4bit

The Model mlx-community/Qwen2.5-7B-Instruct-Uncensored-4bit was converted to MLX format from Orion-zhen/Qwen2.5-7B-Instruct-Uncensored using mlx-lm version 0.19.1.

Use with mlx

pip install mlx-lm

from mlx_lm import load, generate

model, tokenizer = load("mlx-community/Qwen2.5-7B-Instruct-Uncensored-4bit")

prompt="hello"

if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, tokenize=False, add_generation_prompt=True
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)

Downloads last month: 217

Safetensors

Model size

1.19B params

Tensor type

FP16

U32

Inference Examples

Text Generation

Inference API (serverless) does not yet support mlx models for this pipeline type.

Model tree for mlx-community/Qwen2.5-7B-Instruct-Uncensored-4bit

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct

Finetuned

Orion-zhen/Qwen2.5-7B-Instruct-Uncensored

Quantized

(9)

this model

Datasets used to train mlx-community/Qwen2.5-7B-Instruct-Uncensored-4bit

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

72.040
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

35.830
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

1.360
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

7.050
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

13.580
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

38.070

View on Papers With Code