3B model?

by KatyTheCutie - opened Jan 4

Jan 4

Will you experiment with 3B models anytime soon? I think StableLM's 3B model has potential, the Zephyr model is outstanding for a 3B model but its not that good for Roleplay.

Walmart-the-bag

Owner Jan 4

Will you experiment with 3B models anytime soon? I think StableLM's 3B model has potential, the Zephyr model is outstanding for a 3B model but its not that good for Roleplay.

I will try to experiment with 3B models today. I will definitely try with StableLM's model for you.

KatyTheCutie

Jan 4

Thank you! You're the best ☺️

KatyTheCutie

Jan 4

HuggingFaceH4/no_robots: for more human-like chats.
Teknium/openhermes: for increased knowledge and intelligence.
Airoboros 3.1: for better writing ability.
Unalignment/toxic-dpo-v0.1: for uncensored responses.

Just some suggestions.

Walmart-the-bag

Owner Jan 4

Would you want me to experiment with a MoE or Finetune for 3B?

KatyTheCutie

Jan 5

A 3B finetune would be nice but anything would be good!

Walmart-the-bag

Owner Jan 5

A 3B finetune would be nice but anything would be good!

All files should be uploaded soon with information. https://huggingface.co/Walmart-the-bag/zephyr-quiklang-3b

KatyTheCutie

Jan 5

Thank you! Can you perhaps make a Q6 GGUF of it? I don't want to bother @TheBloke with it, Thanks again!

Walmart-the-bag

Owner Jan 5

Thank you! Can you perhaps make a Q6 GGUF of it? I don't want to bother @TheBloke with it, Thanks again!

I am attempting to, but keep failing. I will keep trying but I would suggest TheBloke to do it since I am unexperienced with llama.cpp and gguf quants.

Walmart-the-bag

Owner Jan 6

TheBloke has published GGUF version.

Walmart-the-bag changed discussion status to closed Jan 6

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment