Maxi's picture

7 5 58

Maxi PRO

maxiw

·

AI & ML interests

Computer Agents | VLMs

Recent Activity

New activity 1 day ago

OS-Copilot/ScreenSpot-v2

updated a collection 1 day ago

liked a model 1 day ago

OS-Copilot/OS-Atlas-Pro-7B

Organizations

maxiw's activity

upvoted 3 papers 2 months ago

UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity

Paper • 2409.04081 • Published Sep 6 • 3

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12 • 43

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3 • 82

upvoted a paper 3 months ago

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Paper • 2408.08459 • Published Aug 15 • 44

upvoted a collection 3 months ago

XGen-MM-1 models and datasets

A collection of all XGen-MM (Foundation LMM) models! • 15 items • Updated 17 days ago • 34