Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published 29 days ago • 109
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention about 1 month ago • 19
view article Article How to train a new language model from scratch using Transformers and Tokenizers Feb 14, 2020 • 16
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Jul 31 • 325
Whisper Release Collection Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. • 12 items • Updated Sep 13, 2023 • 74