-
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 123 -
Attention Heads of Large Language Models: A Survey
Paper • 2409.03752 • Published • 86 -
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency
Paper • 2409.02634 • Published • 86 -
OmniGen: Unified Image Generation
Paper • 2409.11340 • Published • 82
Collections
Discover the best community collections!
Collections including paper arxiv:2409.18869
-
Addition is All You Need for Energy-efficient Language Models
Paper • 2410.00907 • Published • 130 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 82 -
An accurate detection is not all you need to combat label noise in web-noisy datasets
Paper • 2407.05528 • Published • 3 -
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP
Paper • 2407.00402 • Published • 22
-
Addition is All You Need for Energy-efficient Language Models
Paper • 2410.00907 • Published • 130 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 82 -
Phantom of Latent for Large Language and Vision Models
Paper • 2409.14713 • Published • 27 -
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Paper • 2409.12183 • Published • 36
-
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Paper • 2409.17481 • Published • 46 -
Reducing the Footprint of Multi-Vector Retrieval with Minimal Performance Impact via Token Pooling
Paper • 2409.14683 • Published • 8 -
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction
Paper • 2409.17422 • Published • 23 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 82
-
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
Paper • 2409.08513 • Published • 10 -
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale
Paper • 2409.08264 • Published • 42 -
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 72 -
LLMs + Persona-Plug = Personalized LLMs
Paper • 2409.11901 • Published • 30