bokesyo commited on
Commit
80c6598
1 Parent(s): a2f5410

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -0
README.md ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ ---
5
+
6
+ # MiniCPM-Visual-Embedding: An OCR-free Visual Document Embedding Model Based on MiniCPM-V-2.0
7
+
8
+ With MiniCPM-Visual-Embedding, it is possible to directly build knowledge base with raw PDF/Book/Document without any OCR technique nor OCR pipeline. The model only takes images as document-side inputs and produce vectors representing document pages.
9
+
10
+ [Github Repo](https://github.com/bokesyo/minicpm-visual-embedding)
11
+
12
+
13
+ ![Memex Archtechture](images/memex.png)
14
+
15
+ # News
16
+
17
+ - 2024-06-27: We released our first visual embedding model on huggingface.
18
+
19
+ - 2024-05-08: We released our training code (full-parameter tuning with GradCache and DeepSpeed, supports large batch size across multiple GPUs with zero-stage1) and eval code.
20
+