Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2409.18869

📑Trending Papers - September 9⃣️

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 28 days ago • 123
Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5 • 86
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4 • 86
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published 29 days ago • 82

about 17 hours ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published 19 days ago • 82
Pixtral 12B

Paper • 2410.07073 • Published 7 days ago • 54

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published 15 days ago • 130
Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published 19 days ago • 82
An accurate detection is not all you need to combat label noise in web-noisy datasets

Paper • 2407.05528 • Published Jul 8 • 3
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Paper • 2407.00402 • Published Jun 29 • 22

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published 15 days ago • 130
Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published 19 days ago • 82
Phantom of Latent for Large Language and Vision Models

Paper • 2409.14713 • Published 24 days ago • 27
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published 28 days ago • 36

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published 19 days ago • 82

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published 19 days ago • 82

MIO: A Foundation Model on Multimodal Tokens

Paper • 2409.17692 • Published 20 days ago • 47
Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published 19 days ago • 82

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published 19 days ago • 82

random interest papers

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Paper • 2409.17481 • Published 21 days ago • 46
Reducing the Footprint of Multi-Vector Retrieval with Minimal Performance Impact via Token Pooling

Paper • 2409.14683 • Published 24 days ago • 8
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction

Paper • 2409.17422 • Published 21 days ago • 23
Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published 19 days ago • 82

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection

Paper • 2409.08513 • Published Sep 13 • 10
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12 • 42
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published 28 days ago • 72
LLMs + Persona-Plug = Personalized LLMs

Paper • 2409.11901 • Published 28 days ago • 30

Previous
1
2
3
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs