Tomi Toivio's picture

Tomi Toivio

Ukuli

·

TomiToivio

AI & ML interests

Stable Diffusion, NLP, OpenCV etc.

Recent Activity

liked a model 7 days ago

lmms-lab/LLaVA-NeXT-Video-7B-DPO

liked a model 9 days ago

meta-llama/Llama-3.2-11B-Vision

liked a model 10 days ago

llava-hf/LLaVA-NeXT-Video-34B-hf

Organizations

Ukuli's activity

upvoted a collection 10 days ago

LLaVa-NeXT-Video

LLaVa-NeXT-Video extends LLaVa-NeXT for video understanding. • 5 items • Updated Jun 10 • 6

upvoted an article about 1 month ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 278

upvoted 2 collections about 1 month ago

LLaVA-1.6

A collection of LLaVA-1.6 checkpoints • 4 items • Updated Jan 31 • 65

LLaVA-Video

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 6 items • Updated Oct 5 • 53

upvoted a paper about 2 months ago

Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 23

upvoted a collection about 2 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 28 days ago • 482

upvoted a collection 3 months ago

Sapiens

Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens • 72 items • Updated Sep 18 • 45

upvoted a collection 4 months ago

LLaVa-Interleave

LLaVa models that extends the model capabilities to Multi-image, Multi-frame (videos), Multi-patch (single-image) scenarios. • 3 items • Updated Jul 10 • 14

upvoted 2 collections 8 months ago

BLIP2 models

A collection of all BLIP2 models! • 5 items • Updated 22 days ago • 16

BLIP models

A collection of all BLIP models • 8 items • Updated 22 days ago • 19

upvoted a paper 12 months ago

SpeechBrain: A General-Purpose Speech Toolkit

Paper • 2106.04624 • Published Jun 8, 2021 • 1