MagicQuill: An Intelligent Interactive Image Editing System Paper • 2411.09703 • Published 7 days ago • 50
OpenCulture Collection A multilingual dataset of public domain books and newspapers. • 27 items • Updated 15 days ago • 117
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 8 days ago • 94
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated about 2 hours ago • 172
CLEAR: Character Unlearning in Textual and Visual Modalities Paper • 2410.18057 • Published 29 days ago • 199
Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions Paper • 2410.17655 • Published 29 days ago • 5
How Many Van Goghs Does It Take to Van Gogh? Finding the Imitation Threshold Paper • 2410.15002 • Published Oct 19 • 6
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 28 days ago • 26
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 198
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Paper • 2410.12705 • Published Oct 16 • 29
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper • 2410.13848 • Published Oct 17 • 27
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Oct 15 • 139
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 369