view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 • about 5 hours ago • 8
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka • 2 days ago • 60
Aligning Large Language Models via Self-Steering Optimization Paper • 2410.17131 • Published 30 days ago • 21
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 8 days ago • 94
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python about 1 month ago • 41
view article Article How to build a custom text classifier without days of human labeling By sdiazlor • Oct 17 • 55
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw • Oct 16 • 18
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? Paper • 2409.07703 • Published Sep 12 • 66
view article Article Fine-tuning a token classification model for legal data using Argilla and AutoTrain By bikashpatra • Sep 7 • 14
Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models Paper • 2408.02442 • Published Aug 5 • 21
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment Paper • 2408.06266 • Published Aug 12 • 9
view article Article 🔥 Argilla 2.0: the data-centric tool for AI makers 🤗 By dvilasuero • Jul 30 • 37
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26 • 55
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 63