Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Zhao's picture

9 58

Zhao

Hanyu66

21world's profile picture

·

ZZHanyu

AI & ML interests

CV, NLP

Organizations

None yet

Collections 1

Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts

Paper • 2309.15915 • Published Sep 27, 2023 • 2
Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants

Paper • 2310.00653 • Published Oct 1, 2023 • 3
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

Paper • 2308.12966 • Published Aug 24, 2023 • 6
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

Paper • 2309.09958 • Published Sep 18, 2023 • 18

models 1

Hanyu66/sd_controlNet

Updated Dec 20, 2023

datasets

None public yet

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs