ZeroCool94
commited on
Commit
•
0fe2326
1
Parent(s):
5d5d3e9
Update README.md
Browse files
README.md
CHANGED
@@ -41,8 +41,8 @@ This model is still in its infancy and it's meant to be constantly updated and t
|
|
41 |
- #### Stable:
|
42 |
- None
|
43 |
- #### Beta:
|
44 |
-
- [vae.
|
45 |
-
- [maskgit.
|
46 |
|
47 |
Note: Checkpoints under the Beta section are updated daily or at least 3-4 times a week. While the beta checkpoints can be used as they are, only the latest version is kept on the repo and the older checkpoints are removed when a new one
|
48 |
is uploaded to keep the repo clean.
|
@@ -57,22 +57,24 @@ The model was trained on the following dataset:
|
|
57 |
- **Hardware:** 1 x Nvidia RTX 3050 GPU
|
58 |
- **Hours Trained:** NaN.
|
59 |
- **Gradient Accumulations**: 10
|
60 |
-
- **Batch:** 1
|
61 |
- **Learning Rate:** 1e-5
|
62 |
-
- **Learning Rate Scheduler:** `
|
63 |
- **Scheduler Power:** 1.0
|
64 |
- **Optimizer:** Adam
|
65 |
- **Warmup Steps:** 10,000
|
66 |
-
- **Number of Cycles:**
|
67 |
- **Resolution/Image Size**: First trained at a resolution of 64x64, then increased to 256x256 and then to 512x512. Check the notes down below for more details on this.
|
68 |
- **Dimension:** 32
|
69 |
- **vq_codebook_dim:** 8192
|
70 |
- **vq_codebook_size:** 8192
|
|
|
|
|
71 |
- **heads:** 8
|
72 |
- **depth:** 4
|
73 |
- **Random Crop:** True
|
74 |
-
- **Total MaskGit Training Steps:**
|
75 |
-
- **Total VAE Training Steps:**
|
76 |
|
77 |
Note: On Muse we can change the image_size or resolution at any time without having to train the model from scratch again, this allows us to first train the model at low resolution using the same `dim` and `vq_codebook_size` to train faster and then we can increase the `image_size` and use a higher resolution once the model has trained enough.
|
78 |
|
|
|
41 |
- #### Stable:
|
42 |
- None
|
43 |
- #### Beta:
|
44 |
+
- [vae.12145000.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/vae.12145000.pt): Trained from scratch for 12.14M steps with **dim: 32**,**vq_codebook_dim: 8192** and **vq_codebook_size: 8192**.
|
45 |
+
- [maskgit.5125000.pt](https://huggingface.co/Sygil/Sygil-Muse/blob/main/maskgit.5125000.pt): Maskgit trained from the beta VAE for 5.12M steps.
|
46 |
|
47 |
Note: Checkpoints under the Beta section are updated daily or at least 3-4 times a week. While the beta checkpoints can be used as they are, only the latest version is kept on the repo and the older checkpoints are removed when a new one
|
48 |
is uploaded to keep the repo clean.
|
|
|
57 |
- **Hardware:** 1 x Nvidia RTX 3050 GPU
|
58 |
- **Hours Trained:** NaN.
|
59 |
- **Gradient Accumulations**: 10
|
60 |
+
- **Batch Size:** 1
|
61 |
- **Learning Rate:** 1e-5
|
62 |
+
- **Learning Rate Scheduler:** `constant_with_warmup`
|
63 |
- **Scheduler Power:** 1.0
|
64 |
- **Optimizer:** Adam
|
65 |
- **Warmup Steps:** 10,000
|
66 |
+
- **Number of Cycles:** 200
|
67 |
- **Resolution/Image Size**: First trained at a resolution of 64x64, then increased to 256x256 and then to 512x512. Check the notes down below for more details on this.
|
68 |
- **Dimension:** 32
|
69 |
- **vq_codebook_dim:** 8192
|
70 |
- **vq_codebook_size:** 8192
|
71 |
+
- **num_tokens:** 8192
|
72 |
+
- **seq_len:** 1024
|
73 |
- **heads:** 8
|
74 |
- **depth:** 4
|
75 |
- **Random Crop:** True
|
76 |
+
- **Total MaskGit Training Steps:** 5,125,000
|
77 |
+
- **Total VAE Training Steps:** 12,145,000
|
78 |
|
79 |
Note: On Muse we can change the image_size or resolution at any time without having to train the model from scratch again, this allows us to first train the model at low resolution using the same `dim` and `vq_codebook_size` to train faster and then we can increase the `image_size` and use a higher resolution once the model has trained enough.
|
80 |
|