Marcus Gawronsky
marcusinthesky
AI & ML interests
Representation Learning
Organizations
Collections
9
-
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Paper • 2410.10139 • Published • 50 -
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Paper • 2410.10563 • Published • 36 -
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
Paper • 2410.10783 • Published • 25 -
TVBench: Redesigning Video-Language Evaluation
Paper • 2410.07752 • Published • 5
Papers
1
models
None public yet
datasets
None public yet