view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka β’ 2 days ago β’ 60
Thinking LLMs: General Instruction Following with Thought Generation Paper β’ 2410.10630 β’ Published Oct 14 β’ 16
TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees Paper β’ 2410.12854 β’ Published Oct 10 β’ 1
view article Article Synthetic dataset generation techniques: Self-Instruct By davanstrien β’ May 15 β’ 12
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ 8 days ago β’ 94
π«π· Calme-3 Collection Here you can find all the new Calme-3 models β’ 26 items β’ Updated 2 days ago β’ 7
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning Paper β’ 2410.02089 β’ Published Oct 2 β’ 12
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper β’ 2402.14905 β’ Published Feb 22 β’ 126
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 β’ 8 items β’ Updated 14 days ago β’ 95
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. β’ 3 items β’ Updated 28 days ago β’ 26
AutoTrain: No-code training for state-of-the-art models Paper β’ 2410.15735 β’ Published Oct 21 β’ 57
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages Paper β’ 2410.01036 β’ Published Oct 1 β’ 14
view article Article wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR?? By catherinearnett β’ Sep 27 β’ 35