Leandro von Werra's picture

205516.8 TFLOPS

Leandro von Werra

lvwerra

·

https://github.com/lvwerra

AI & ML interests

NLP and RL

Articles

CinePile 2.0 - making stronger datasets with adversarial refinement

FineVideo: behind the scenes

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

A failed experiment: Infini-Attention, and why we should keep trying?

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Welcome Llama 3 - Meta's new open LLM

StarCoder2 and The Stack v2

Constitutional AI with Open LLMs

Preference Tuning LLMs with Direct Preference Optimization Methods

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

The N Implementation Details of RLHF with PPO

Finetune Stable Diffusion Models with DDPO via TRL

Spread Your Wings: Falcon 180B is here

Code Llama: Llama 2 learns to code

Fine-tune Llama 2 with DPO

The Falcon has landed in the Hugging Face ecosystem

Creating a Coding Assistant with StarCoder

StarCoder: A State-of-the-Art LLM for Code

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Evaluating Language Model Bias with 🤗 Evaluate

Announcing Evaluation on the Hub

Organizations

lvwerra's activity

upvoted a paper 8 days ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published 9 days ago • 20

upvoted an article about 1 month ago

Article

FineVideo: behind the scenes

Sep 23

• 23

upvoted a paper about 2 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 125

upvoted a paper 3 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 116

upvoted 3 articles 3 months ago

Article

Tool Use, Unified

Aug 12

• 62

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14

• 48

Article

XetHub is joining Hugging Face!

Aug 8

• 79

upvoted 4 articles 4 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 213

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18

• 66

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 258

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1

• 46

upvoted a paper 4 months ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1 • 42

upvoted 2 papers 5 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 86

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22 • 45

upvoted 2 articles 5 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 177

Article

Putting RL back in RLHF

Jun 12

• 62

upvoted a paper 5 months ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28 • 12

upvoted a collection 6 months ago

Leaderboards and benchmarks ✨

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 73 items • Updated 4 days ago • 88

upvoted 2 articles 6 months ago

Article

2024-04-22 - Hub Incident Post Mortem

By

•

May 17

• 17

Article

License to Call: Introducing Transformers Agents 2.0

May 13

• 116