Alex Yang's picture

7

Alex Yang

yinfeiy86

·

AI & ML interests

None yet

Organizations

None yet

yinfeiy86's activity

upvoted a paper 24 days ago

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published 26 days ago • 17

upvoted 4 papers about 1 month ago

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Paper • 2410.08159 • Published Oct 10 • 23

Progressive Autoregressive Video Diffusion Models

Paper • 2410.08151 • Published Oct 10 • 15

MM-Ego: Towards Building Egocentric Multimodal LLMs

Paper • 2410.07177 • Published Oct 9 • 20

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published Oct 2 • 40

upvoted a paper 5 months ago

Understanding Alignment in Multimodal LLMs: A Comprehensive Study

Paper • 2407.02477 • Published Jul 2 • 21

upvoted a paper 7 months ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8 • 80