rombodawg's picture
Create README.md
0f57a61 verified
|
raw
history blame
No virus
868 Bytes
metadata
library_name: transformers
base_model:
  - Qwen/Qwen2.5-3B-Instruct
license: other
license_name: qwen-research
license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE

Rombos-LLM-V2.5-Qwen-3b

image/jpeg

Rombos-LLM-V2.5-Qwen-3b is a continues finetuned version of Qwen2.5-3B. I noticed recently that the Qwen team did not learn from my methods of continuous finetuning, the great benefits, and no downsides of it. So I took it upon myself to merge the instruct model with the base model myself using the Ties merge method

This version of the model shows higher performance than the original instruct and base models.

Quants:

GGUF: https://huggingface.co/bartowski/Replete-LLM-V2.5-Qwen-3b-GGUF

Benchmarks: (Coming soon)