FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs Paper • 2407.04051 • Published Jul 4 • 35
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning Paper • 2307.04725 • Published Jul 10, 2023 • 64
Real-time Monocular Full-body Capture in World Space via Sequential Proxy-to-Motion Learning Paper • 2307.01200 • Published Jul 3, 2023 • 9