wbag
Walmart-the-bag
AI & ML interests
Merging, Finetuning, and Pretraining LLM models.
Organizations
Walmart-the-bag's activity
posted
an
update
15 days ago
reacted to
merve's
post with š
29 days ago
Post
2818
This is not a drill š„
HuggingChat is now multimodal with meta-llama/Llama-3.2-11B-Vision-Instruct! š¤
This also comes with multimodal assistants, I have migrated my Marcus Aurelius advice assistant to Llama-Vision and Marcus can see now! š
Chat with Marcus: https://hf.co/chat/assistant/65bfed22022ba290531112f8
Start chatting with Llama-Vision 3.2 11B Instruct https://huggingface.co/chat/models/meta-llama/Llama-3.2-11B-Vision-Instruct
HuggingChat is now multimodal with meta-llama/Llama-3.2-11B-Vision-Instruct! š¤
This also comes with multimodal assistants, I have migrated my Marcus Aurelius advice assistant to Llama-Vision and Marcus can see now! š
Chat with Marcus: https://hf.co/chat/assistant/65bfed22022ba290531112f8
Start chatting with Llama-Vision 3.2 11B Instruct https://huggingface.co/chat/models/meta-llama/Llama-3.2-11B-Vision-Instruct
reacted to
KingNish's
post with š
about 2 months ago
Post
3062
A super good and fast image inpainting demo is here.
Its' super cool and realistic.
Demo by @OzzyGT (Must try):
OzzyGT/diffusers-fast-inpaint
Its' super cool and realistic.
Demo by @OzzyGT (Must try):
OzzyGT/diffusers-fast-inpaint
reacted to
KingNish's
post with ā
6 months ago
Post
4623
Microsoft Just Launched 3 Powerful Models
1. Phi 3 Medium (4k and 128k): A 14b Instruct tuned models that outperformed big models like Command R+ (104b), GPT 3.5 Pro, Gemini Pro, and is highly competitive with top models such as Mixtral 8x22b, Llama3 70B, and GPT 4.
microsoft/Phi-3-medium-4k-instruct
DEMO: https://huggingface.co/spaces/Walmart-the-bag/Phi-3-Medium
2. Phi 3 Mini Vision 128k: A 4.5 billion-parameter, instruction-tuned vision model that has outperformed models such as Llava3 and Claude 3, and is providing stiff competition to Gemini 1Pro Vision.
microsoft/Phi-3-vision-128k-instruct
3. Phi3 Small (8k and 128k): Better than Llama3 8b, Mixtral 8x7b and GPT 3.5 turbo.
microsoft/Phi-3-small-128k-instruct
1. Phi 3 Medium (4k and 128k): A 14b Instruct tuned models that outperformed big models like Command R+ (104b), GPT 3.5 Pro, Gemini Pro, and is highly competitive with top models such as Mixtral 8x22b, Llama3 70B, and GPT 4.
microsoft/Phi-3-medium-4k-instruct
DEMO: https://huggingface.co/spaces/Walmart-the-bag/Phi-3-Medium
2. Phi 3 Mini Vision 128k: A 4.5 billion-parameter, instruction-tuned vision model that has outperformed models such as Llava3 and Claude 3, and is providing stiff competition to Gemini 1Pro Vision.
microsoft/Phi-3-vision-128k-instruct
3. Phi3 Small (8k and 128k): Better than Llama3 8b, Mixtral 8x7b and GPT 3.5 turbo.
microsoft/Phi-3-small-128k-instruct
posted
an
update
6 months ago
Post
1617
Phi-3-Medium just came out! So far it's decent (fails a few riddles š), try it for yourself and let me know how it is.
Original Model: microsoft/Phi-3-medium-128k-instruct
Test it out: https://huggingface.co/spaces/Walmart-the-bag/Phi-3-medium *running on ZERO gpu*
Original Model: microsoft/Phi-3-medium-128k-instruct
Test it out: https://huggingface.co/spaces/Walmart-the-bag/Phi-3-medium *running on ZERO gpu*
posted
an
update
6 months ago
Post
2152
Mm what a good time for a new merge!
This is a merge of 6 models that were finetuned on llama3 8b. This has done pretty decent on some coding tasks, for the parameter size. I have looked through models because a lot of people cannot run 33B models (deepseek) for coding.
Original Model: Walmart-the-bag/Llama-3-LizardCoder-8B
GGUF: Walmart-the-bag/Llama-3-LizardCoder-8B-GGUF
This is a merge of 6 models that were finetuned on llama3 8b. This has done pretty decent on some coding tasks, for the parameter size. I have looked through models because a lot of people cannot run 33B models (deepseek) for coding.
Original Model: Walmart-the-bag/Llama-3-LizardCoder-8B
GGUF: Walmart-the-bag/Llama-3-LizardCoder-8B-GGUF
posted
an
update
6 months ago
Post
1389
Juggernaut X V10 is pretty good, its a few weeks old but not very popular. Try it out and let me know what you guys think. I think it is pretty good for a daily use.
ā Original Model: RunDiffusion/Juggernaut-X-v10
š Test it out: Walmart-the-bag/Juggernaut-X-v10
š« Author: https://huggingface.co/RunDiffusion
ā Original Model: RunDiffusion/Juggernaut-X-v10
š Test it out: Walmart-the-bag/Juggernaut-X-v10
š« Author: https://huggingface.co/RunDiffusion
posted
an
update
6 months ago
Post
1058
Replete-AI/code_bagel
Make the ultimate coding finetune to compete with the likes of closed source models using the code_bagel dataset!
Made by @rombodawg of RepleteAi, the code_bagel dataset contains over 800 million tokens of deduplicated and uncensored code from only reputable sources on huggingface. This code is formatted in the alpaca instruct format for ease of use in training.
Make the ultimate coding finetune to compete with the likes of closed source models using the code_bagel dataset!
Made by @rombodawg of RepleteAi, the code_bagel dataset contains over 800 million tokens of deduplicated and uncensored code from only reputable sources on huggingface. This code is formatted in the alpaca instruct format for ease of use in training.
This comment has been hidden
š