Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

liu's picture

6 4 7

liu

che111

NeuralKartMocker's profile picture

·

cheliu-computation

AI & ML interests

None yet

Organizations

Collections 8

Distilling Vision-Language Models on Millions of Videos

Paper • 2401.06129 • Published Jan 11 • 14
Koala: Key frame-conditioned long video-LLM

Paper • 2404.04346 • Published Apr 5 • 5
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8 • 20
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding

Paper • 2406.07471 • Published Jun 11 • 1

Work for 3D Medical Vision

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Paper • 2406.12275 • Published Jun 18 • 29
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Paper • 2405.15738 • Published May 24 • 43
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16 • 97

models 1

che111/my_model

Updated 8 days ago • 302

datasets

None public yet

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs