-
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper โข 2404.10667 โข Published โข 17 -
AniClipart: Clipart Animation with Text-to-Video Priors
Paper โข 2404.12347 โข Published โข 12 -
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Paper โข 2404.09967 โข Published โข 20 -
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Paper โข 2404.05014 โข Published โข 53
Collections
Discover the best community collections!
Collections including paper arxiv:2404.10667
-
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper โข 2404.10667 โข Published โข 17 -
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention
Paper โข 2409.01876 โข Published โข 1 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper โข 2312.13578 โข Published โข 27 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper โข 2312.03029 โข Published โข 23
-
Rho-1: Not All Tokens Are What You Need
Paper โข 2404.07965 โข Published โข 84 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper โข 2404.10667 โข Published โข 17 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper โข 2402.12847 โข Published โข 24 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper โข 2402.09353 โข Published โข 26
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper โข 2402.17485 โข Published โข 188 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper โข 2312.01841 โข Published โข 1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper โข 2311.16498 โข Published โข 1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper โข 2312.02134 โข Published โข 2
-
Can Large Language Models Understand Context?
Paper โข 2402.00858 โข Published โข 21 -
OLMo: Accelerating the Science of Language Models
Paper โข 2402.00838 โข Published โข 80 -
Self-Rewarding Language Models
Paper โข 2401.10020 โข Published โข 144 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper โข 2401.17072 โข Published โข 25
-
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper โข 2310.09199 โข Published โข 24 -
A Zero-Shot Language Agent for Computer Control with Structured Reflection
Paper โข 2310.08740 โข Published โข 14 -
Personality Traits in Large Language Models
Paper โข 2307.00184 โข Published โข 20 -
An Emulator for Fine-Tuning Large Language Models using Small Language Models
Paper โข 2310.12962 โข Published โข 14