VILARIN

vilarin

AI & ML interests

Pantheon

Organizations

vilarin's activity

reacted to merve's post with ๐Ÿ‘€ 25 days ago
reacted to merve's post with ๐Ÿ”ฅ 2 months ago
view post
Post
5496
I have put together a notebook on Multimodal RAG, where we do not process the documents with hefty pipelines but natively use:
- vidore/colpali for retrieval ๐Ÿ“– it doesn't need indexing with image-text pairs but just images!
- Qwen/Qwen2-VL-2B-Instruct for generation ๐Ÿ’ฌ directly feed images as is to a vision language model with no processing to text!
I used ColPali implementation of the new ๐Ÿญ Byaldi library by @bclavie ๐Ÿค—
https://github.com/answerdotai/byaldi
Link to notebook: https://github.com/merveenoyan/smol-vision/blob/main/ColPali_%2B_Qwen2_VL.ipynb
reacted to clem's post with ๐Ÿ”ฅ 2 months ago
posted an update 2 months ago
posted an update 3 months ago
view post
Post
5990
๐Ÿคฉ Amazing day. AWPortrait-FL finally here!
๐Ÿฆ– AWPortrait-FL is finetuned on FLUX.1-dev using the training set of AWPortrait-XL and nearly 2,000 fashion photography photos with extremely high aesthetic quality.

๐Ÿค—Model: Shakker-Labs/AWPortrait-FL

๐Ÿ™‡Demo: vilarin/flux-labs

ยท
posted an update 3 months ago
posted an update 4 months ago
view post
Post
4185
Black Forest Labs, BASED! ๐Ÿ‘
FLUX.1 is more delightful, with good instruction following.
FLUX.1 dev( black-forest-labs/FLUX.1-dev) with a 12B parameter distillation model, second only to Black Forest Labs' state-of-the-art model FLUX.1 pro. ๐Ÿ™€

Update ๐Ÿค™Official demo:
black-forest-labs/FLUX.1-dev
  • 1 reply
ยท
replied to merve's post 5 months ago
view reply

Thank you :) I updated the demo to support file.

reacted to merve's post with โค๏ธ 5 months ago
view post
Post
2736
THUDM has released GLM-4V-9B and it's.. chatty! ๐Ÿ˜‚
I asked it to describe my favorite Howl's Moving Castle scene and here's how it went ๐Ÿ‘‡๐Ÿป

joke aside it seems to outperform the previous VLMs. however the license isn't open-source ๐Ÿ“ˆ
model repo: THUDM/glm-4v-9b
a community member has built a demo: vilarin/VL-Chatbox
  • 1 reply
ยท