-
TC4D: Trajectory-Conditioned Text-to-4D Generation
Paper • 2403.17920 • Published • 16 -
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
Paper • 2403.17001 • Published • 6 -
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
Paper • 2403.12365 • Published • 10 -
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models
Paper • 2403.13535 • Published • 22
Collections
Discover the best community collections!
Collections including paper arxiv:2403.12365
-
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
Paper • 2403.12365 • Published • 10 -
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Paper • 2407.11398 • Published • 8 -
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
Paper • 2407.17470 • Published • 14
-
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
Paper • 2403.12365 • Published • 10 -
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
Paper • 2403.17001 • Published • 6 -
TC4D: Trajectory-Conditioned Text-to-4D Generation
Paper • 2403.17920 • Published • 16 -
DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Paper • 2403.17237 • Published • 9
-
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
Paper • 2403.12365 • Published • 10 -
RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS
Paper • 2403.13806 • Published • 18 -
Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering
Paper • 2403.14554 • Published • 12 -
DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Paper • 2403.17237 • Published • 9
-
Video as the New Language for Real-World Decision Making
Paper • 2402.17139 • Published • 18 -
Learning and Leveraging World Models in Visual Representation Learning
Paper • 2403.00504 • Published • 31 -
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper • 2403.01422 • Published • 26 -
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Paper • 2403.05438 • Published • 18
-
Seamless Human Motion Composition with Blended Positional Encodings
Paper • 2402.15509 • Published • 14 -
TripoSR: Fast 3D Object Reconstruction from a Single Image
Paper • 2403.02151 • Published • 12 -
3D-VLA: A 3D Vision-Language-Action Generative World Model
Paper • 2403.09631 • Published • 7 -
Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting
Paper • 2403.09981 • Published • 6
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper • 2401.09416 • Published • 9 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper • 2401.10171 • Published • 12 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper • 2311.09217 • Published • 21 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper • 2401.12979 • Published • 6
-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper • 2306.07967 • Published • 24 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper • 2306.07954 • Published • 113 -
TryOnDiffusion: A Tale of Two UNets
Paper • 2306.08276 • Published • 73 -
Seeing the World through Your Eyes
Paper • 2306.09348 • Published • 33