Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Paper • 2401.11708 • Published Jan 22 • 30
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models Paper • 2402.01118 • Published Feb 2 • 29
Animated Stickers: Bringing Stickers to Life with Video Diffusion Paper • 2402.06088 • Published Feb 8 • 9