Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention Paper • 2410.10774 • Published 2 days ago • 23
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper • 2410.10306 • Published 3 days ago • 39
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Paper • 2410.08261 • Published 6 days ago • 43
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation Paper • 2410.05363 • Published 9 days ago • 42
BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way Paper • 2410.06241 • Published 8 days ago • 10
Pyramidal Flow Matching for Efficient Video Generative Modeling Paper • 2410.05954 • Published 8 days ago • 32
Lumina Family Collection Lumina-T2X is a unified framework for Text to Any Modality Generation • 8 items • Updated Jul 30 • 4
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 21 days ago • 677