RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 2 days ago • 35
Rapid Response: Mitigating LLM Jailbreaks with a Few Examples Paper • 2411.07494 • Published 10 days ago • 1
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published 10 days ago • 28
FineTuneBench: How well do commercial fine-tuning APIs infuse knowledge into LLMs? Paper • 2411.05059 • Published 14 days ago • 1
mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding Paper • 2409.03420 • Published Sep 5 • 25
Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks? Paper • 2411.05000 • Published 14 days ago • 21
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation Paper • 2411.04709 • Published 16 days ago • 25
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published 14 days ago • 48
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 14 days ago • 108
From Medprompt to o1: Exploration of Run-Time Strategies for Medical Challenge Problems and Beyond Paper • 2411.03590 • Published 16 days ago • 9
Language Models can Self-Lengthen to Generate Long Texts Paper • 2410.23933 • Published 21 days ago • 16
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper • 2410.22366 • Published 24 days ago • 73
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published 30 days ago • 24
EMMA: End-to-End Multimodal Model for Autonomous Driving Paper • 2410.23262 • Published 22 days ago • 2
LongReward: Improving Long-context Large Language Models with AI Feedback Paper • 2410.21252 • Published 24 days ago • 16