-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper • 2306.07967 • Published • 24 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper • 2306.07954 • Published • 113 -
TryOnDiffusion: A Tale of Two UNets
Paper • 2306.08276 • Published • 73 -
Seeing the World through Your Eyes
Paper • 2306.09348 • Published • 33
Collections
Discover the best community collections!
Collections including paper arxiv:2311.04145
-
Falah/3d-birds_animals_prompts
Viewer • Updated • 100k • 40 -
Single-Shot Implicit Morphable Faces with Consistent Texture Parameterization
Paper • 2305.03043 • Published • 5 -
nvidia/HelpSteer
Viewer • Updated • 37.1k • 2.63k • 217 -
gradio/custom-component-gallery-backups
Viewer • Updated • 57 • 311 • 3
-
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation
Paper • 2310.16656 • Published • 40 -
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
Paper • 2310.16825 • Published • 32 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 40 -
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Paper • 2311.04145 • Published • 32
-
FreeU: Free Lunch in Diffusion U-Net
Paper • 2309.11497 • Published • 64 -
Imagic: Text-Based Real Image Editing with Diffusion Models
Paper • 2210.09276 • Published -
On Architectural Compression of Text-to-Image Diffusion Models
Paper • 2305.15798 • Published • 4 -
Wuerstchen: Efficient Pretraining of Text-to-Image Models
Paper • 2306.00637 • Published • 12
-
OmnimatteRF: Robust Omnimatte with 3D Background Modeling
Paper • 2309.07749 • Published • 7 -
AudioSR: Versatile Audio Super-resolution at Scale
Paper • 2309.07314 • Published • 25 -
Generative Image Dynamics
Paper • 2309.07906 • Published • 52 -
MagiCapture: High-Resolution Multi-Concept Portrait Customization
Paper • 2309.06895 • Published • 27
-
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Paper • 2309.00398 • Published • 20 -
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Paper • 2307.04725 • Published • 64 -
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
Paper • 2307.00522 • Published • 32 -
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Paper • 2309.15091 • Published • 32