Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ytol
's Collections
Multimodal agents (robotics)
Robotics stack
Vision-Language-Action Models
Robotics stack
updated
Apr 23
Upvote
-
openai/whisper-base
Automatic Speech Recognition
•
Updated
Feb 29
•
478k
•
188
HuggingFaceM4/idefics2-8b-AWQ
Image-Text-to-Text
•
Updated
May 6
•
248
•
26
parler-tts/parler_tts_mini_v0.1
Text-to-Speech
•
Updated
Apr 30
•
24.8k
•
346
dora-rs/dora-idefics2
Updated
May 5
•
264
•
5
MIT/ast-finetuned-speech-commands-v2
Audio Classification
•
Updated
Sep 10, 2023
•
54.9k
•
13
jxu124/OpenX-Embodiment
Updated
29 days ago
•
4.32k
•
43
LiheYoung/depth-anything-small-hf
Depth Estimation
•
Updated
Jan 25
•
135k
•
26
ybelkada/segment-anything
Updated
Dec 26, 2023
•
96
Upvote
-
Share collection
View history
Collection guide
Browse collections