Libra
Collection
The official repo for the ICML2024 paper: Libra: Building Decoupled Vision System on Large Language Models
β’
3 items
β’
Updated
Libra: Building Decoupled Vision System on Large Language Models
This repo provides the pretrained weight of Libra vision tokenizer trained with lookup-free quantization.
Please merge the weights into llama-2-7b-chat-hf-libra
(huggingface version of LLaMA2-7B-Chat).
Please download the pretrained CLIP model in huggingface and merge it into the path. The CLIP model can be downloaded here.
The files should be organized as:
llama-2-7b-chat-hf-libra/
|
β # original llama files
|
βββ ...
β
β # newly added vision tokenizer
β
βββ vision_tokenizer_config.yaml
βββ vqgan.ckpt
β
β # CLIP model
β
βββ openai-clip-vit-large-patch14-336/
βββ ...