Image-Text-to-Text
Transformers
Safetensors
English
Chinese
llava
vision-language
llm
lmm
conversational
Inference Endpoints
bczhou commited on
Commit
dc2dba4
1 Parent(s): fd01149

Upload LlavaForConditionalGeneration

Browse files
config.json CHANGED
@@ -19,7 +19,7 @@
19
  "num_key_value_heads": 4,
20
  "rms_norm_eps": 1e-05,
21
  "torch_dtype": "bfloat16",
22
- "vocab_size": 32128
23
  },
24
  "torch_dtype": "float32",
25
  "transformers_version": "4.36.2",
@@ -36,5 +36,5 @@
36
  },
37
  "vision_feature_layer": -2,
38
  "vision_feature_select_strategy": "default",
39
- "vocab_size": 32128
40
  }
 
19
  "num_key_value_heads": 4,
20
  "rms_norm_eps": 1e-05,
21
  "torch_dtype": "bfloat16",
22
+ "vocab_size": 32064
23
  },
24
  "torch_dtype": "float32",
25
  "transformers_version": "4.36.2",
 
36
  },
37
  "vision_feature_layer": -2,
38
  "vision_feature_select_strategy": "default",
39
+ "vocab_size": 32064
40
  }
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:96fc22c0b623716dbfb2ab18c917d28ef89dee75cc13f909248f1ace85f18cca
3
  size 4979346680
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5757b71668b0f69d61185ff018ada04d490dac417f998bbd6455ec8a84fa1c72
3
  size 4979346680
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:426bf0fd97e3def0167ae587a82602c62afe49c1ec69f50eb25d1bebc4fa2861
3
  size 661187448
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c163e6bae7ffadf52ebc7ccae0763f43055e62cadd6dd78c408d6b2f840021ce
3
  size 661187448