Stoney Kang's picture

6 7

Stoney Kang

sikang99

·

AI & ML interests

Remote Control based on Vision

Recent Activity

reacted to merve's post with ❤️ about 1 month ago

liked a model about 1 month ago

rain1011/pyramid-flow-sd3

Organizations

None yet

sikang99's activity

upvoted 2 papers 2 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 136

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18 • 74

upvoted a collection 2 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 218

upvoted an article 3 months ago

Article

Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner

By

•

May 9

• 11

upvoted 2 papers 3 months ago

UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling

Paper • 2408.04810 • Published Aug 9 • 22

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Paper • 2408.05211 • Published Aug 9 • 46