1 86 4

Léo Hunout

hunoutl

AI & ML interests

AI Engineer working on Jean Zay supercomputer in France 🇫🇷

Organizations

hunoutl's activity

upvoted a paper about 11 hours ago

AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published 23 days ago • 56

upvoted 2 papers about 14 hours ago

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published 5 days ago • 43

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published 5 days ago • 60

upvoted an article 14 days ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22

• 226

upvoted 2 papers about 1 month ago

TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1 • 28

Hyper-Connections

Paper • 2409.19606 • Published Sep 29 • 19

upvoted 4 papers about 2 months ago

Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

Paper • 2409.12903 • Published Sep 19 • 21

upvoted 10 papers 2 months ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4 • 27

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8 • 155

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 115

Imagen 3

Paper • 2408.07009 • Published Aug 13 • 61

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20 • 40

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27 • 121

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28 • 19

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29 • 92

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3 • 31

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Paper • 2408.01584 • Published Aug 2 • 7