pharaouk commited on
Commit
a31585c
1 Parent(s): baab364

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -28,6 +28,13 @@ BakLLaVA 2 is cooking with a significantly larger (commercially viable) dataset
28
 
29
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64b7e345f92b20f7a38bf47a/qdYubrBmF7ztAHgdfkkwG.png)
30
 
 
 
 
 
 
 
 
31
 
32
 
33
 
 
28
 
29
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64b7e345f92b20f7a38bf47a/qdYubrBmF7ztAHgdfkkwG.png)
30
 
31
+ # Training dataset
32
+
33
+ -558K filtered image-text pairs from LAION/CC/SBU, captioned by BLIP.
34
+ -158K GPT-generated multimodal instruction-following data.
35
+ -450K academic-task-oriented VQA data mixture.
36
+ -40K ShareGPT data.
37
+ -Additional private data (permissive)
38
 
39
 
40