Visual Question Answering
Transformers
Safetensors
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
Inference Endpoints
New discussion

How can I fine-tune the model?

#2 opened about 2 months ago by vigneshwar472

Add new task tag

#1 opened 4 months ago by merve