-
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
Paper • 2410.16153 • Published • 42 -
AutoTrain: No-code training for state-of-the-art models
Paper • 2410.15735 • Published • 54 -
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Paper • 2410.12787 • Published • 30 -
LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks
Paper • 2410.01744 • Published • 25
shanshan wang
cooleel
AI & ML interests
None yet
Organizations
Collections
2
models
None public yet