Tianbao Xie's picture

Tianbao Xie

tianbaoxiexxx

·

https://tianbaoxie.com

AI & ML interests

NLP, AI, RL, Robotics

Recent Activity

liked a dataset 1 day ago

OS-Copilot/OS-Atlas-data

upvoted a paper 3 days ago

liked a model 6 days ago

OpenGVLab/InternVL2-8B-MPO

Organizations

tianbaoxiexxx's activity

upvoted a paper 3 days ago

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Paper • 2411.10323 • Published 6 days ago • 26

upvoted a paper 17 days ago

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published 22 days ago • 46

upvoted a paper 21 days ago

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published 29 days ago • 30

upvoted a paper about 2 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

upvoted 2 papers 2 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 136

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12 • 43

upvoted 3 papers 3 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 136

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16 • 97

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19 • 51

upvoted 2 papers 4 months ago

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

Paper • 2407.18901 • Published Jul 26 • 32

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Paper • 2407.10956 • Published Jul 15 • 6

upvoted a paper 5 months ago

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11 • 52

upvoted an article 6 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 166

upvoted a paper 6 months ago

FIFO-Diffusion: Generating Infinite Videos from Text without Training

Paper • 2405.11473 • Published May 19 • 53

upvoted a paper 7 months ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11 • 46

upvoted 3 papers 8 months ago

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

Paper • 2201.05966 • Published Jan 16, 2022 • 1

In-Context Learning for Few-Shot Dialogue State Tracking

Paper • 2203.08568 • Published Mar 16, 2022 • 1

Binding Language Models in Symbolic Languages

Paper • 2210.02875 • Published Oct 6, 2022 • 1

upvoted 2 papers about 1 year ago

Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning

Paper • 2309.11489 • Published Sep 20, 2023 • 2

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 2