28 29 104

Stefano Fiorucci PRO

anakin87

https://stefano-fiorucci.netlify.app

AI & ML interests

Contributing to Haystack, the LLM Framework 🏗️. NLP / LLMs.

Articles

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

Oct 21

• 18

Selective fine-tuning of Language Models with Spectrum

Sep 3

• 29

Organizations

anakin87's activity

upvoted 3 papers 6 days ago

upvoted an article 12 days ago

Article

SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive

•

13 days ago

• 9

upvoted an article 20 days ago

Article

Introducing GGUF-my-LoRA

•

20 days ago

• 11

upvoted 2 articles about 1 month ago

Article

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

•

Oct 21

• 18

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

•

Oct 14

• 55

upvoted a paper about 2 months ago

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses

Paper • 2408.00584 • Published Aug 1 • 6

upvoted an article 3 months ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3

• 29

upvoted a paper 3 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7 • 7

upvoted a collection 4 months ago

🧩 Verbalized Rebus @ CLiC-it 2024

Collection

Materials for the paper "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses" • 13 items • Updated Aug 5 • 3

upvoted 4 articles 4 months ago

Article

🔥 Argilla 2.0: the data-centric tool for AI makers 🤗

•

Jul 30

• 37

Article

MMLU-PRO-ITA a new eval for Italian LLMs

•

Jul 23

• 3

Article

Mixedbread 🤝 deepset: Announcing our New German/English Embedding Model

•

Jul 19

• 15

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

•

Jun 4

• 73

upvoted a paper 5 months ago

Refusal in Language Models Is Mediated by a Single Direction

Paper • 2406.11717 • Published Jun 17 • 2

upvoted a collection 5 months ago

abliterated-v3

Collection

Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3 • 97

upvoted 3 articles 6 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13

• 369

Article

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

•

Jun 3

• 26

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28

• 158