Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper โข 2410.19008 โข Published 19 days ago โข 22 โข 2
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper โข 2410.16153 โข Published 19 days ago โข 42 โข 3
Harnessing Webpage UIs for Text-Rich Visual Understanding Paper โข 2410.13824 โข Published 23 days ago โข 29 โข 2
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark Paper โข 2409.02813 โข Published Sep 4 โข 28 โข 3