VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation Paper • 2309.00398 • Published Sep 1, 2023 • 20
Teach LLMs to Personalize -- An Approach inspired by Writing Education Paper • 2308.07968 • Published Aug 15, 2023 • 25
Dual-Stream Diffusion Net for Text-to-Video Generation Paper • 2308.08316 • Published Aug 16, 2023 • 23
TeCH: Text-guided Reconstruction of Lifelike Clothed Humans Paper • 2308.08545 • Published Aug 16, 2023 • 33
Thinking Like an Annotator: Generation of Dataset Labeling Instructions Paper • 2306.14035 • Published Jun 24, 2023 • 8
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications Paper • 2306.14289 • Published Jun 25, 2023 • 15
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing Paper • 2306.14435 • Published Jun 26, 2023 • 20
Kosmos-2: Grounding Multimodal Large Language Models to the World Paper • 2306.14824 • Published Jun 26, 2023 • 34
PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Paper • 2306.15667 • Published Jun 27, 2023 • 7
SketchMetaFace: A Learning-based Sketching Interface for High-fidelity 3D Character Face Modeling Paper • 2307.00804 • Published Jul 3, 2023 • 5