Post
1877
๐ฆ
Falcon has landed... again!
And now it not just reads but sees as well ๐๐
Here is a summary of the Falcon-11B-VLM model:
Model Type: Causal decoder-only model ๐.
Parameters: 11 billion ๐.
Vision Integration: Uses the pretrained CLIP ViT-L/14 vision encoder with the recently released Falcon2-11B chat-finetuned model and trained with image-text data ๐ผ๏ธ๐.
Training: Pretrained on over 5,000 billion tokens from RefinedWeb with curated corpora ๐.
Dynamic Encoding: Enhances perception of fine-grained details in images ๐.
Training Hardware: 16 A100 80GB GPUs with ZeRO and Flash-Attention 2 ๐ฅ๏ธ.
Tokenizer: Falcon-7B/11B tokenizer ๐งฉ.
Languages Supported: ๐ Primarily English, with capabilities in German ๐ฉ๐ช, Spanish ๐ช๐ธ, French ๐ซ๐ท, Italian ๐ฎ๐น, Dutch ๐ณ๐ฑ, Romanian ๐ท๐ด, Czech ๐จ๐ฟ, Swedish ๐ธ๐ช, and more. ๐ฃ๏ธ๐.
License: Open Source - TII Falcon License 2.0, based on Apache 2.0 ๐.
Model: tiiuae/falcon-11B-vlm
And now it not just reads but sees as well ๐๐
Here is a summary of the Falcon-11B-VLM model:
Model Type: Causal decoder-only model ๐.
Parameters: 11 billion ๐.
Vision Integration: Uses the pretrained CLIP ViT-L/14 vision encoder with the recently released Falcon2-11B chat-finetuned model and trained with image-text data ๐ผ๏ธ๐.
Training: Pretrained on over 5,000 billion tokens from RefinedWeb with curated corpora ๐.
Dynamic Encoding: Enhances perception of fine-grained details in images ๐.
Training Hardware: 16 A100 80GB GPUs with ZeRO and Flash-Attention 2 ๐ฅ๏ธ.
Tokenizer: Falcon-7B/11B tokenizer ๐งฉ.
Languages Supported: ๐ Primarily English, with capabilities in German ๐ฉ๐ช, Spanish ๐ช๐ธ, French ๐ซ๐ท, Italian ๐ฎ๐น, Dutch ๐ณ๐ฑ, Romanian ๐ท๐ด, Czech ๐จ๐ฟ, Swedish ๐ธ๐ช, and more. ๐ฃ๏ธ๐.
License: Open Source - TII Falcon License 2.0, based on Apache 2.0 ๐.
Model: tiiuae/falcon-11B-vlm