Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
dhruvabansalΒ 
posted an update May 8
Post
1637
πŸš€ Introducing RefuelLLM-2 and RefuelLLM-2-small, the next version of our large language models purpose built for data labeling, enrichment and cleaning.

RefuelLLM-2 (83.82%) outperforms all state-of-the-art LLMs, including GPT-4-Turbo (80.88%), Claude-3-Opus (79.19%) and Gemini-1.5-Pro (74.59%), across a benchmark of ~30 data labeling tasks.

RefuelLLM-2-small (79.67%), aka Llama-3-Refueled, outperforms all comparable LLMs including Claude3-Sonnet (70.99%), Haiku (69.23%) and GPT-3.5-Turbo (68.13%).

πŸ“– Open sourcing the model weights: refuelai/Llama-3-Refueled
πŸ“ Detailed blog post: https://www.refuel.ai/blog-posts/announcing-refuel-llm-2
πŸ§ͺ Try out the model here: https://labs.refuel.ai/playground

Just saying that I really like the ability to quickly test your model against the monster ones! It's amazing how well it performs against Claude. 🀯

Β·

Thank you so much @radames ! We were very intentional about making this a smooth experience for users :)

very cool! for more visibility, feel free to repost your blog in https://huggingface.co/blog-explorers !