Gabriel Martín Blázquez's picture

Gabriel Martín Blázquez

gabrielmbmb

·

https://gabrielmb.com

AI & ML interests

ML Engineer

Recent Activity

upvoted a paper 4 days ago

liked a dataset 6 days ago

microsoft/orca-agentinstruct-1M-v1

liked a model 6 days ago

numind/NuExtract-1.5-smol

Articles

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Organizations

gabrielmbmb's activity

upvoted a paper 4 days ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14 • 16

upvoted a paper 7 days ago

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published 9 days ago • 58

upvoted a collection 11 days ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 3 days ago • 223

upvoted a collection 21 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated about 4 hours ago • 172

upvoted an article 22 days ago

Article

Code a simple RAG from scratch

By

•

23 days ago

• 8

upvoted a collection 29 days ago

🍓 Ichigo v0.3

The experimental family designed to train LLMs to understand sound natively. • 6 items • Updated 11 days ago • 17

upvoted an article 30 days ago

Article

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

about 1 month ago

• 41

upvoted 5 articles about 1 month ago

Article

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

By

•

Oct 21

• 18

Article

How to build a custom text classifier without days of human labeling

By

•

Oct 17

• 55

Article

Fixing Gradient Accumulation

Oct 16

• 41

Article

How to optimize your data labelling project with custom interfaces

By

•

Oct 16

• 18

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

Oct 14

• 55

upvoted a paper about 1 month ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 166

upvoted an article about 1 month ago

Article

Improving Parquet Dedupe on Hugging Face Hub

Oct 5

• 30

upvoted a paper about 2 months ago

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3 • 52

upvoted an article about 2 months ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

By

•

Sep 27

• 35

upvoted a paper about 2 months ago

Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

Paper • 2409.12903 • Published Sep 19 • 21

upvoted a collection about 2 months ago

Useful Spaces

20 items • Updated 5 days ago • 5

upvoted 2 articles about 2 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 169

Article

Exploring the Daily Papers Page on Hugging Face

Sep 23

• 39