24 38 174

Théo Gigant

gigant

https://giganttheo.github.io/

AI & ML interests

multimodal summarization, generative models

Recent Activity

liked a model about 20 hours ago

google/siglip-so400m-patch16-256-i18n

updated a Space 14 days ago

huggan/wikiart-diffusion-mini

New activity 14 days ago

huggan/wikiart-diffusion-mini

Articles

Design choices for Vision Language Models in 2024

Apr 16

• 25

Organizations

gigant's activity

upvoted a paper about 2 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24 • 24

upvoted 2 papers 3 months ago

Contextual Position Encoding: Learning to Count What's Important

Paper • 2405.18719 • Published May 29 • 5

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 118

upvoted 3 papers 4 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3 • 78

Harvesting Textual and Structured Data from the HAL Publication Repository

Paper • 2407.20595 • Published Jul 30 • 21

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Paper • 2406.11271 • Published Jun 17 • 20

upvoted 3 articles 4 months ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18

• 67

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 265

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5

• 161

upvoted a paper 4 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10 • 67

upvoted 4 papers 5 months ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 92

upvoted 2 articles 5 months ago

Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

•

Jun 11

• 48

Article

Vision Language Models Explained

Apr 11

• 214

upvoted 4 articles 6 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13

• 369

Article

Explaining the SDXL latent space

•

May 20

• 33

Article

AI has a problem with objectifying women

•

May 24

• 55

Article

MobileNet-V4 (now in timm)

•

Jun 17

• 39