JB-Bai commited on
Commit
c0ac455
1 Parent(s): 046f471
.DS_Store ADDED
Binary file (8.2 kB). View file
 
README.md CHANGED
@@ -2,3 +2,25 @@
2
  license: apache-2.0
3
  ---
4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: apache-2.0
3
  ---
4
 
5
+ # Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
6
+
7
+ ### [Paper](https://arxiv.org/abs/) | [Code](https://github.com/viiika/Meissonic)
8
+
9
+
10
+ ## Introduction
11
+ Meissonic is a non-autoregressive mask image modeling text-to-image synthesis model that can generate high-resolution images. It is designed to run on consumer graphics cards.
12
+
13
+ ## Usage
14
+
15
+ Under Construction. Please check back later.
16
+
17
+ ## Citation
18
+ If you find this work helpful, please consider citing:
19
+ ```bibtex
20
+ @article{bai2024meissonic,
21
+ title={Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis},
22
+ author={Bai, Jinbin and Ye, Tian and Chow, Wei and Song, Enxin and Chen, Qing-Guo and Li, Xiangtai and Dong, Zhen and Zhu, Lei and Yan, Shuicheng},
23
+ journal={arXiv preprint arXiv},
24
+ year={2024}
25
+ }
26
+ ```
model_index.json CHANGED
@@ -2,7 +2,7 @@
2
  "_class_name": "Pipeline",
3
  "_diffusers_version": "0.30.2",
4
  "scheduler": [
5
- "scheduler",
6
  "Scheduler"
7
  ],
8
  "text_encoder": [
@@ -14,7 +14,7 @@
14
  "CLIPTokenizer"
15
  ],
16
  "transformer": [
17
- "transformer",
18
  "Transformer2DModel"
19
  ],
20
  "vqvae": [
 
2
  "_class_name": "Pipeline",
3
  "_diffusers_version": "0.30.2",
4
  "scheduler": [
5
+ "src.scheduler",
6
  "Scheduler"
7
  ],
8
  "text_encoder": [
 
14
  "CLIPTokenizer"
15
  ],
16
  "transformer": [
17
+ "src.transformer",
18
  "Transformer2DModel"
19
  ],
20
  "vqvae": [
text_encoder/model.fp16.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:42a6a63bcfcb0d7cc9e2a687134ceb7cb83d0346285636ec8547e7ffa2bcd224
3
+ size 708111984