ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning Paper • 2411.05003 • Published 5 days ago • 63
VEnhancer: Generative Space-Time Enhancement for Video Generation Paper • 2407.07667 • Published Jul 10 • 13
Training-free Regional Prompting for Diffusion Transformers Paper • 2411.02395 • Published 8 days ago • 23
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks Paper • 2410.20650 • Published 16 days ago • 15
MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Paper • 2410.20280 • Published 17 days ago • 21
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Paper • 2410.18666 • Published 20 days ago • 17
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer Paper • 2410.10812 • Published 29 days ago • 14
DragAnything: Motion Control for Anything using Entity Representation Paper • 2403.07420 • Published Mar 12 • 13
Improving Long-Text Alignment for Text-to-Image Diffusion Models Paper • 2410.11817 • Published 28 days ago • 14
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper • 2410.10792 • Published 29 days ago • 26
Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention Paper • 2410.10774 • Published 29 days ago • 23
ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion Paper • 2410.08168 • Published Oct 10 • 7
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Paper • 2410.08261 • Published Oct 10 • 48
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control Paper • 2403.04880 • Published Mar 7 • 53
DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models Paper • 2410.08207 • Published Oct 10 • 18
ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler Paper • 2410.05651 • Published Oct 8 • 13
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation Paper • 2410.07171 • Published Oct 9 • 41