๐ฎ๐น๐ฏ๐ต๐ง๐ท Generating multilingual instruction datasets with Magpie ๐ฆโโฌ Oct 21 โข 18
LoRA vs Full Fine-tuning: An Illusion of Equivalence Paper โข 2410.21228 โข Published 24 days ago โข 2
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper โข 2411.07133 โข Published 10 days ago โข 28
view article Article SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive By DavidGF โข 13 days ago โข 9
view article Article ๐ฎ๐น๐ฏ๐ต๐ง๐ท Generating multilingual instruction datasets with Magpie ๐ฆโโฌ By anakin87 โข Oct 21 โข 18
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled โข Oct 14 โข 55
Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses Paper โข 2408.00584 โข Published Aug 1 โข 6
view article Article Selective fine-tuning of Language Models with Spectrum By anakin87 โข Sep 3 โข 29
๐งฉ Verbalized Rebus @ CLiC-it 2024 Collection Materials for the paper "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses" โข 13 items โข Updated Aug 5 โข 3
view article Article ๐ฅ Argilla 2.0: the data-centric tool for AI makers ๐ค By dvilasuero โข Jul 30 โข 37
view article Article Mixedbread ๐ค deepset: Announcing our New German/English Embedding Model By shadeMe โข Jul 19 โข 15
view article Article ๐ฆโ๏ธ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero โข Jun 4 โข 73
Refusal in Language Models Is Mediated by a Single Direction Paper โข 2406.11717 โข Published Jun 17 โข 2
abliterated-v3 Collection Latest gen of the abliterated models I've produced โข 17 items โข Updated Jun 3 โข 97
view article Article โ๏ธ ๐ฅ Building High-Quality Datasets with distilabel and Prometheus 2 By burtenshaw โข Jun 3 โข 26
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 โข 158