Upload folder using huggingface_hub

#1
by Kardbord - opened
README.md ADDED
@@ -0,0 +1,612 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: creativeml-openrail-m
3
+ tags:
4
+ - stable-diffusion
5
+ - stable-diffusion-diffusers
6
+ - text-to-image
7
+ inference: true
8
+ extra_gated_prompt: "This model is open access and available to all, with a CreativeML\
9
+ \ OpenRAIL-M license further specifying rights and usage.\nThe CreativeML OpenRAIL\
10
+ \ License specifies: \n\n1. You can't use the model to deliberately produce nor\
11
+ \ share illegal or harmful outputs or content \n2. CompVis claims no rights on the\
12
+ \ outputs you generate, you are free to use them and are accountable for their use\
13
+ \ which must not go against the provisions set in the license\n3. You may re-distribute\
14
+ \ the weights and use the model commercially and/or as a service. If you do, please\
15
+ \ be aware you have to include the same use restrictions as the ones in the license\
16
+ \ and share a copy of the CreativeML OpenRAIL-M to all your users (please read the\
17
+ \ license entirely and carefully)\nPlease read the full license carefully here:\
18
+ \ https://huggingface.co/spaces/CompVis/stable-diffusion-license"
19
+ ---
20
+ # Overview
21
+
22
+ This is simply johnslegers/epic-diffusion with the safety checker disabled.
23
+
24
+ **DO NOT** attempt to use this model to generate harmful or illegal content.
25
+
26
+
27
+ [![Example][1]][1]
28
+
29
+ ## Why Epic Diffusion
30
+
31
+ Epîc Diffusion is a general purpose model based on Stable Diffusion 1.x intended to replace the official SD releases
32
+ as your default model. It is focused on providing high quality output in a wide range of different styles, with support
33
+ for NFSW content.
34
+
35
+ Epîc Diffusion 1.0 is a heavily calibrated merge of SD 1.4, SD 1.5, Analog Diffusion, Wavy Diffusion,
36
+ Openjourney Diffusion, Samdoesarts Ultramerge, postapocalypse, Elldreth's Dream, Inkpunk Diffusion,
37
+ Arcane Diffusion & Van Gogh Diffusion blended and reblended multiple times until I got the quality & consistency
38
+ I was looking for...
39
+
40
+ Epic Diffusion is also [available on CivitAI](https://civitai.com/models/3855/epic-diffusion).
41
+
42
+ ## License
43
+
44
+ This model is open access and available to all, with a CreativeML OpenRAIL-M
45
+ license further specifying rights and usage.
46
+
47
+ The CreativeML OpenRAIL License specifies:
48
+
49
+ 1. You can't use the model to deliberately produce nor share illegal or
50
+ harmful outputs or content
51
+
52
+ 2. CompVis claims no rights on the outputs you generate, you are free to use
53
+ them and are accountable for their use which must not go against the
54
+ provisions set in the license
55
+
56
+ 3. You may re-distribute the weights and use the model commercially and/or as
57
+ a service. If you do, please be aware you have to include the same use
58
+ restrictions as the ones in the license and share a copy of the CreativeML
59
+ OpenRAIL-M to all your users (please read the license entirely and carefully)
60
+
61
+ <a href="https://www.buymeacoffee.com/johnslegers" target="_blank">
62
+ <img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 45px !important;width: 162px !important;" >
63
+ </a>
64
+
65
+ ## Example prompts
66
+
67
+ <table>
68
+ <tr style="border: 1px solid;background:#e5e7eb">
69
+ <th style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
70
+ Prompt
71
+ </th>
72
+ <th style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
73
+ Parameters
74
+ </th>
75
+ <th style="vertical-align:top;padding:.5714286em!important;border: 1px solid;min-width:270px">
76
+ Output
77
+ </th>
78
+ </tr>
79
+ <tr>
80
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
81
+ scarlett johansson, in the style of Wes Anderson, highly detailed, unreal engine, octane render, 8k
82
+ </td>
83
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
84
+ <b>Steps:</b><br>20<br>
85
+ <b>Sampler:</b><br>Euler a<br>
86
+ <b>CFG scale:</b><br>7<br>
87
+ <b>Seed:</b><br>2263657329<br>
88
+ <b>Size:</b><br>512x512
89
+ </td>
90
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
91
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/0oZij.png">
92
+ </td>
93
+ </tr>
94
+ <tr>
95
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
96
+ sansa angeline jolie gessica chastain mummy, intricate, elegant, highly detailed, digital painting, artstation, concept art, smooth, sharp focus, illustration, art by artgerm and greg rutkowski and alphonse mucha and william - adolphe bouguereau
97
+ </td>
98
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
99
+ <b>Steps:</b><br>20<br>
100
+ <b>Sampler:</b><br>Euler a<br>
101
+ <b>CFG scale:</b><br>7<br>
102
+ <b>Seed:</b><br>1310341382<br>
103
+ <b>Size:</b><br>512x512
104
+ </td>
105
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
106
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/mnnBR.png">
107
+ </td>
108
+ </tr>
109
+ <tr>
110
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
111
+ Pokimane, Feminine, Mercy, Perfect Sexy Symmetrical Face, Detailed Pupils, Pensive Smirk, Look at Viewer, Leaf Armor, Ilya Kuvshinov, Gil Elvgren, Mucha. Intricate, Octane Render, 4KUHD, Centered, Oil Painting, Bokeh, Rim Lighting.
112
+ </td>
113
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
114
+ <b>Steps:</b><br>20<br>
115
+ <b>Sampler:</b><br>Euler a<br>
116
+ <b>CFG scale:</b><br>7<br>
117
+ <b>Seed:</b><br>4142902194<br>
118
+ <b>Size:</b><br>512x512
119
+ </td>
120
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
121
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/v9NoC.png">
122
+ </td>
123
+ </tr>
124
+ <tr>
125
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
126
+ Mature babe,artgerm Style, gerald brom, atey ghailan, mike mignola, short cut off shirt knot, wide hips, showing off, exposing herself vulnerable, blushing, exited, confident, demanding, joyful, trending on artstation, double split complementary colors, intricate details, highly detailed,
127
+ </td>
128
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
129
+ <b>Steps:</b><br>20<br>
130
+ <b>Sampler:</b><br>Euler a<br>
131
+ <b>CFG scale:</b><br>7<br>
132
+ <b>Seed:</b><br>3954688283<br>
133
+ <b>Size:</b><br>512x512
134
+ </td>
135
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
136
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/vl0bc.png">
137
+ </td>
138
+ </tr>
139
+ <tr>
140
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
141
+ planet base, windows, night, ground level, no man's sky, digital art, highly detailed, intricate, sharp focus, Trending on Artstation HQ, deviantart, unreal engine 5, 4K UHD image
142
+ </td>
143
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
144
+ <b>Steps:</b><br>20<br>
145
+ <b>Sampler:</b><br>Euler a<br>
146
+ <b>CFG scale:</b><br>7<br>
147
+ <b>Seed:</b><br>895811336<br>
148
+ <b>Size:</b><br>512x512
149
+ </td>
150
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
151
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/D2GNK.png">
152
+ </td>
153
+ </tr>
154
+ <tr>
155
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
156
+ berchtesgaden, hyperdetailed, detailed faces, artgerm, wolfenstein, portal 2, Leartes Studios, assassin's creed, alphonse mucha, bouguereau, edmund blair leighton, greg kadel, dynamic lighting, delicate, unreal engine, octane render, 8k
157
+ </td>
158
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
159
+ <b>Steps:</b><br>20<br>
160
+ <b>Sampler:</b><br>Euler a<br>
161
+ <b>CFG scale:</b><br>7<br>
162
+ <b>Seed:</b><br>1172925287<br>
163
+ <b>Size:</b><br>512x512
164
+ </td>
165
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
166
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/m7Xkb.png">
167
+ </td>
168
+ </tr>
169
+ <tr>
170
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
171
+ princess, detailed portrait, hyperdetailed, detailed faces, irakli nadar, magali villeneuve, Assassin's Creed, Tim Hildebrandt, Ilya Kuvshinov, artgem, greg kadel, dynamic lighting, delicate, unreal engine, octane render, 8k
172
+ </td>
173
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
174
+ <b>Steps:</b><br>20<br>
175
+ <b>Sampler:</b><br>Euler a<br>
176
+ <b>CFG scale:</b><br>7<br>
177
+ <b>Seed:</b><br>2096567313<br>
178
+ <b>Size:</b><br>512x512
179
+ </td>
180
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
181
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/LwPPa.png">
182
+ </td>
183
+ </tr>
184
+ <tr>
185
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
186
+ a Photorealistic dramatic hyperrealistic bright blue eyes, African American elegant girl, black hair, white veil,by WLOP,Artgerm,Greg Rutkowski,Alphonse Mucha, Beautiful dynamic dramatic bright sunset lighting,shadows,cinematic atmosphere,Artstation,concept design art,Octane render,8k
187
+ </td>
188
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
189
+ <b>Steps:</b><br>20<br>
190
+ <b>Sampler:</b><br>Euler a<br>
191
+ <b>CFG scale:</b><br>7<br>
192
+ <b>Seed:</b><br>2999946689<br>
193
+ <b>Size:</b><br>512x512
194
+ </td>
195
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
196
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/1nH9c.png">
197
+ </td>
198
+ </tr>
199
+ <tr>
200
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
201
+ cutest girl in the world outside, (detailed portrait), in the style of fernanda suarez and simon stalenhag and Ilya Kuvshinov and Wlop and Artgerm and Chie Yoshii and Greg Rutkowski and Waking Life, trending on artstation, featured on pixiv, dynamic lighting, highly detailed, ambient lighting, octane render, 8k
202
+ </td>
203
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
204
+ <b>Steps:</b><br>20<br>
205
+ <b>Sampler:</b><br>Euler a<br>
206
+ <b>CFG scale:</b><br>7<br>
207
+ <b>Seed:</b><br>2249388004<br>
208
+ <b>Size:</b><br>512x512
209
+ </td>
210
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
211
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/uNux1.png">
212
+ </td>
213
+ </tr>
214
+ <tr>
215
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
216
+ military academy, (detailed portrait), steampunk, in the style of arcane and fernanda suarez and dishonored and bioshock and simon stalenhag and Ilya Kuvshinov and Wlop and Artgerm, trending on artstation, featured on pixiv, dynamic lighting, highly detailed, ambient lighting, octane render, 8k
217
+ </td>
218
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
219
+ <b>Steps:</b><br>20<br>
220
+ <b>Sampler:</b><br>Euler a<br>
221
+ <b>CFG scale:</b><br>7<br>
222
+ <b>Seed:</b><br>3877530043<br>
223
+ <b>Size:</b><br>512x512
224
+ </td>
225
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
226
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/sFXCi.png">
227
+ </td>
228
+ </tr>
229
+ <tr>
230
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
231
+ beautiful female assassin wearing cyberpunk clothing, respirator, cybernetic respirator, (detailed portrait), cell shaded, 4 k, vivid colours, photorealistic concept art by wlop, ilya kuvshinov, artgerm, krenz cushart, greg rutkowski, pixiv. cinematic dramatic atmosphere, sharp focus, volumetric lighting, cinematic lighting, studio quality
232
+ </td>
233
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
234
+ <b>Steps:</b><br>20<br>
235
+ <b>Sampler:</b><br>Euler a<br>
236
+ <b>CFG scale:</b><br>7<br>
237
+ <b>Seed:</b><br>3388890157<br>
238
+ <b>Size:</b><br>512x512
239
+ </td>
240
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
241
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/14iZS.png">
242
+ </td>
243
+ </tr>
244
+ <tr>
245
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
246
+ cemetary, pen and ink, in the style of gustave dore highly detailed, octane render, 8k, trending on artstation, sharp focus, studio photo, intricate details, highly detailed, by greg rutkowski
247
+ </td>
248
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
249
+ <b>Steps:</b><br>20<br>
250
+ <b>Sampler:</b><br>Euler a<br>
251
+ <b>CFG scale:</b><br>7<br>
252
+ <b>Seed:</b><br>568457114<br>
253
+ <b>Size:</b><br>512x512
254
+ </td>
255
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
256
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/D1hsN.png">
257
+ </td>
258
+ </tr>
259
+ <tr>
260
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
261
+ dubai, hyperdetailed, detailed faces, artgem, irakli nadar, mass effect, Tim Hildebrandt, Ilya Kuvshinov, liam wong, greg rutkowski, greg kadel, dynamic lighting, delicate, unreal engine, octane render, 8k, centered, symmetry, painted, intricate, volumetric lighting, beautiful, rich deep colors masterpiece, sharp focus, ultra detailed, in the style of dan mumford and marc simonetti, astrophotography
262
+ </td>
263
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
264
+ <b>Steps:</b><br>20<br>
265
+ <b>Sampler:</b><br>DPM++ SDE<br>
266
+ <b>CFG scale:</b><br>7<br>
267
+ <b>Seed:</b><br>4262868463<br>
268
+ <b>Size:</b><br>512x512
269
+ </td>
270
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
271
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/4uPzr.png">
272
+ </td>
273
+ </tr>
274
+ <tr>
275
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
276
+ Little cute forest fluffy chibi cuteness overload, sunny magical background, ultra precious details, intricate details, volumetric lighting, photo realistic, lifelike, photography, digital art, 8k, trending on artstation, sharp focus, studio photo, intricate details, highly detailed, by greg rutkowski, sharp focus, emitting diodes, smoke, artillery, sparks, racks, system unit, motherboard, by pascal blanche rutkowski repin artstation hyperrealism painting concept art of detailed character design matte painting, 4 k resolution blade runner
277
+ </td>
278
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
279
+ <b>Steps:</b><br>20<br>
280
+ <b>Sampler:</b><br>DPM++ SDE Karras<br>
281
+ <b>CFG scale:</b><br>7<br>
282
+ <b>Seed:</b><br>3849507891<br>
283
+ <b>Size:</b><br>512x512
284
+ </td>
285
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
286
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/4yTQP.png">
287
+ </td>
288
+ </tr>
289
+ <tr>
290
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
291
+ 15 year old schoolgirl with short straight hair, blue eyes, cute, friendly, round face, cottagecore, intricate, enlightened, highly detailed, digital painting, artstation, concept art, smooth, sharp focus, illustration, art by artgerm and greg rutkowski and alphonse mucha
292
+ </td>
293
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
294
+ <b>Steps:</b><br>20<br>
295
+ <b>Sampler:</b><br>Euler a<br>
296
+ <b>CFG scale:</b><br>7<br>
297
+ <b>Seed:</b><br>2276800560<br>
298
+ <b>Size:</b><br>512x512
299
+ </td>
300
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
301
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/gqynB.png">
302
+ </td>
303
+ </tr>
304
+ <tr>
305
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
306
+ extreme wide shot a futuristic containment building in a rainforest valley with a city in the distance, national geographic, hyper realistic, 4 k, harsh light
307
+ </td>
308
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
309
+ <b>Steps:</b><br>20<br>
310
+ <b>Sampler:</b><br>Euler a<br>
311
+ <b>CFG scale:</b><br>7<br>
312
+ <b>Seed:</b><br>3260458902<br>
313
+ <b>Size:</b><br>512x512
314
+ </td>
315
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
316
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/8qH9Y.png">
317
+ </td>
318
+ </tr>
319
+ <tr>
320
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
321
+ portrait of a middle - eastern female cleric with straight black hair wearing blue and yellow vestments casting fireball, fantasy, highly detailed, digital painting, artstation, concept art, character art, art by greg rutkowski and tyler jacobson and alphonse mucha
322
+ </td>
323
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
324
+ <b>Steps:</b><br>20<br>
325
+ <b>Sampler:</b><br>Euler a<br>
326
+ <b>CFG scale:</b><br>7<br>
327
+ <b>Seed:</b><br>1379894453<br>
328
+ <b>Size:</b><br>512x512
329
+ </td>
330
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
331
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/BP98Y.png">
332
+ </td>
333
+ </tr>
334
+ <tr>
335
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
336
+ aSnowshoe Siamese Cat as the doomslayer, realistic scifi cyberpunk power armor robot, closeup portrait art by donato giancola and greg rutkowski, vintage retro scifi, realistic face, digital art, trending on artstation, symmetry
337
+ </td>
338
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
339
+ <b>Steps:</b><br>20<br>
340
+ <b>Sampler:</b><br>Euler a<br>
341
+ <b>CFG scale:</b><br>7<br>
342
+ <b>Seed:</b><br>2122325442<br>
343
+ <b>Size:</b><br>512x512
344
+ </td>
345
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
346
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/GYdOS.png">
347
+ </td>
348
+ </tr>
349
+ <tr>
350
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
351
+ Beautiful boy by René Magritte
352
+ </td>
353
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
354
+ <b>Steps:</b><br>20<br>
355
+ <b>Sampler:</b><br>Euler a<br>
356
+ <b>CFG scale:</b><br>7<br>
357
+ <b>Seed:</b><br>1753689226<br>
358
+ <b>Size:</b><br>512x512
359
+ </td>
360
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
361
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/vP9sv.png">
362
+ </td>
363
+ </tr>
364
+ <tr>
365
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
366
+ portrait of a dark god, copper wires, visible scars and nerves, intricate, headshot, highly detailed, digital painting, artstation, concept art, sharp focus, cinematic lighting, illustration, art by artgerm and greg rutkowski, alphonse mocha, cgsociety, Olivia
367
+ </td>
368
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
369
+ <b>Steps:</b><br>20<br>
370
+ <b>Sampler:</b><br>Euler a<br>
371
+ <b>CFG scale:</b><br>7<br>
372
+ <b>Seed:</b><br>3355776798<br>
373
+ <b>Size:</b><br>512x512
374
+ </td>
375
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
376
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/A94Gg.png">
377
+ </td>
378
+ </tr>
379
+ <tr>
380
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
381
+ knight warrior helmet skyrim mask elder scrolls v nordic armor bethesda adam adamowicz illustration character design concept, unreal 5, daz, hyperrealistic, octane render, cosplay, rpg portrait, dynamic lighting, intricate detail, harvest fall vibrancy, cinematic volume inner glowing aura global illumination ray tracing hdr
382
+ </td>
383
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
384
+ <b>Steps:</b><br>20<br>
385
+ <b>Sampler:</b><br>Euler a<br>
386
+ <b>CFG scale:</b><br>7<br>
387
+ <b>Seed:</b><br>1938574287<br>
388
+ <b>Size:</b><br>512x512
389
+ </td>
390
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
391
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/efGrz.png">
392
+ </td>
393
+ </tr>
394
+ <tr>
395
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
396
+ berserker portrait, d&d style, fantasy, photorealistic, highly detailed, artstation, smooth, sharp focus, art by michael whelan, artgerm, greg rutkowski and alphonse mucha
397
+ </td>
398
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
399
+ <b>Steps:</b><br>20<br>
400
+ <b>Sampler:</b><br>DPM++ SDE Karras<br>
401
+ <b>CFG scale:</b><br>7<br>
402
+ <b>Seed:</b><br>156077154<br>
403
+ <b>Size:</b><br>512x512
404
+ </td>
405
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
406
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/Wbjgp.png">
407
+ </td>
408
+ </tr>
409
+ <tr>
410
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
411
+ symmetry product render poster vivid colors classical proportion car, glowing fog intricate, elegant, highly detailed, digital painting, art station, concept art, smooth, sharp focus, illustration,
412
+ </td>
413
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
414
+ <b>Steps:</b><br>20<br>
415
+ <b>Sampler:</b><br>DPM++ SDE Karras<br>
416
+ <b>CFG scale:</b><br>7<br>
417
+ <b>Seed:</b><br>4294525772<br>
418
+ <b>Size:</b><br>512x512
419
+ </td>
420
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
421
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/sMMpR.png">
422
+ </td>
423
+ </tr>
424
+ <tr>
425
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
426
+ Futuristic Vintage Medium Shot 1920's Poster with Cyberpunk, ovni, tron biker with helmet bike, black in color, with a cyberpunk city background, futuristic lighting, cinematic lighting, cozy lighting, 8k, cinematic poster vintage 1800s
427
+ </td>
428
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
429
+ <b>Steps:</b><br>20<br>
430
+ <b>Sampler:</b><br>Euler a<br>
431
+ <b>CFG scale:</b><br>7<br>
432
+ <b>Seed:</b><br>1229558409<br>
433
+ <b>Size:</b><br>512x512
434
+ </td>
435
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
436
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/0Gojz.png">
437
+ </td>
438
+ </tr>
439
+ <tr>
440
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
441
+ beautiful, young woman, cybernetic, cyberpunk, detailed gorgeous face, flowing hair, vaporwave aesthetic, synthwave , digital painting, artstation, concept art, smooth, sharp focus, illustration, art by artgerm and greg rutkowski and alphonse mucha
442
+ </td>
443
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
444
+ <b>Steps:</b><br>20<br>
445
+ <b>Sampler:</b><br>Euler a<br>
446
+ <b>CFG scale:</b><br>7<br>
447
+ <b>Seed:</b><br>264509871<br>
448
+ <b>Size:</b><br>512x512
449
+ </td>
450
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
451
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/zFdjj.png">
452
+ </td>
453
+ </tr>
454
+ <tr>
455
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
456
+ strong warrior princess| centered| key visual| intricate| highly detailed| breathtaking beauty| precise lineart| vibrant| comprehensive cinematic| Carne Griffiths| Conrad Roset
457
+ </td>
458
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
459
+ <b>Steps:</b><br>20<br>
460
+ <b>Sampler:</b><br>Euler a<br>
461
+ <b>CFG scale:</b><br>7<br>
462
+ <b>Seed:</b><br>16<br>
463
+ <b>Size:</b><br>512x512
464
+ </td>
465
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
466
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/aGuIL.png">
467
+ </td>
468
+ </tr>
469
+ <tr>
470
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
471
+ portrait of a rugged 19th century man with mutton chops in a jacket, victorian, concept art, detailed face, fantasy, close up face, highly detailed, cinematic lighting, digital art painting by greg rutkowski
472
+ </td>
473
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
474
+ <b>Steps:</b><br>20<br>
475
+ <b>Sampler:</b><br>Euler a<br>
476
+ <b>CFG scale:</b><br>7<br>
477
+ <b>Seed:</b><br>16<br>
478
+ <b>Size:</b><br>512x512
479
+ </td>
480
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
481
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/6sKW6.png">
482
+ </td>
483
+ </tr>
484
+ <tr>
485
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
486
+ side profile of cyberpunk body with cyborg skull | cyberpunk | styled in Art Nouveau | insanely detailed | embellishments | high definition | concept art | digital art | vibrant
487
+ </td>
488
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
489
+ <b>Steps:</b><br>20<br>
490
+ <b>Sampler:</b><br>Euler a<br>
491
+ <b>CFG scale:</b><br>7<br>
492
+ <b>Seed:</b><br>16<br>
493
+ <b>Size:</b><br>512x512
494
+ </td>
495
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
496
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/N7kSu.png">
497
+ </td>
498
+ </tr>
499
+ <tr>
500
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
501
+ a cute little matte low poly isometric cherry blossom forest island, pink waterfalls, mist, lat lighting, soft shadows, trending on artstation, 3d render, monument valley, fez video game,
502
+ </td>
503
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
504
+ <b>Steps:</b><br>20<br>
505
+ <b>Sampler:</b><br>Euler a<br>
506
+ <b>CFG scale:</b><br>7<br>
507
+ <b>Seed:</b><br>16<br>
508
+ <b>Size:</b><br>512x512
509
+ </td>
510
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
511
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/fVj9N.png">
512
+ </td>
513
+ </tr>
514
+ <tr>
515
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
516
+ high resolution concept art of an apartment living room overlooking a large futuristic city with floor to ceiling windows and mid century modern furniture cinematic lighting cgsociety
517
+ </td>
518
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
519
+ <b>Steps:</b><br>20<br>
520
+ <b>Sampler:</b><br>Euler a<br>
521
+ <b>CFG scale:</b><br>7<br>
522
+ <b>Seed:</b><br>850995814<br>
523
+ <b>Size:</b><br>512x512
524
+ </td>
525
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
526
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/jkpgU.png">
527
+ </td>
528
+ </tr>
529
+ <tr>
530
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
531
+ hyperrealistic full length portrait of gorgeous watson from apex legends | blonde | detailed gorgeous face!! | full body!! | armor | intricate | elegant | realistic | hyperrealistic | cinematic | character design | concept art | highly detailed | illustration | digital art | digital painting | depth of field | illustrated by tim brown lee
532
+ </td>
533
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
534
+ <b>Steps:</b><br>20<br>
535
+ <b>Sampler:</b><br>Euler a<br>
536
+ <b>CFG scale:</b><br>7<br>
537
+ <b>Seed:</b><br>3002798343<br>
538
+ <b>Size:</b><br>512x512
539
+ </td>
540
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
541
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/hMsH2.png">
542
+ </td>
543
+ </tr>
544
+ <tr>
545
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
546
+ Chibi spiderman, high redolution, 3D rendering, octane rendering, modern Disney style
547
+ </td>
548
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
549
+ <b>Steps:</b><br>20<br>
550
+ <b>Sampler:</b><br>Euler a<br>
551
+ <b>CFG scale:</b><br>7<br>
552
+ <b>Seed:</b><br>3232863832<br>
553
+ <b>Size:</b><br>512x512
554
+ </td>
555
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
556
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/zl18l.png">
557
+ </td>
558
+ </tr>
559
+ <tr>
560
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
561
+ photo of the most beautiful artwork in the world featuring soft lustrous, industrial mechanic real world, fantastic location, working environment, rugged harsh situation worker, full body 8k unity render, action shot, skin pores, detailed intricate iris, very dark lighting, heavy shadows, detailed, detailed face, (vibrant, photo realistic, realistic, dramatic, dark, sharp focus, 8k), (weathered greasy dirty damaged old worn technician worker outfit:1.1), (intricate:1.1), (highly detailed:1.1), digital painting, octane render, artstation, concept art, smooth, sharp focus, illustration, art by artgerm, (loish:0.23), wlop ilya kuvshinov., (global illumination, studio light, volumetric light)<br><br>
562
+ <b>Negative prompt:</b> Asian, black and white, close up, cartoon, 3d, denim, (disfigured), (deformed), (poorly drawn), (extra limbs), blurry, boring, sketch, lackluster, signature, letters, watermark, low res , horrific , mutated , artifacts , bad art , gross , b&w , poor quality , low quality , cropped
563
+ </td>
564
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
565
+ <b>Steps:</b><br>30<br>
566
+ <b>Sampler:</b><br>DPM++ SDE Karras<br>
567
+ <b>CFG scale:</b><br>10<br>
568
+ <b>Seed:</b><br>169686802<br>
569
+ <b>Size:</b><br>512x640
570
+ </td>
571
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
572
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/dPnAA.png">
573
+ </td>
574
+ </tr>
575
+ <tr>
576
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
577
+ photo of the most beautiful artwork in the world featuring soft lustrous, industrial mechanic real world, fantastic location, working environment, rugged harsh situation worker, full body 8k unity render, action shot, skin pores, detailed intricate iris, very dark lighting, heavy shadows, detailed, detailed face, (vibrant, photo realistic, realistic, dramatic, dark, sharp focus, 8k), (weathered greasy dirty damaged old worn technician worker outfit:1.1), (intricate:1.1), (highly detailed:1.1), digital painting, octane render, artstation, concept art, smooth, sharp focus, illustration, art by artgerm, (loish:0.23), wlop ilya kuvshinov., (global illumination, studio light, volumetric light)<br><br>
578
+ <b>Negative prompt:</b> Asian, black and white, close up, cartoon, 3d, denim, (disfigured), (deformed), (poorly drawn), (extra limbs), blurry, boring, sketch, lackluster, signature, letters, watermark, low res , horrific , mutated , artifacts , bad art , gross , b&w , poor quality , low quality , cropped
579
+ </td>
580
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
581
+ <b>Steps:</b><br>30<br>
582
+ <b>Sampler:</b><br>DPM++ SDE Karras<br>
583
+ <b>CFG scale:</b><br>10<br>
584
+ <b>Seed:</b><br>169686796<br>
585
+ <b>Size:</b><br>512x640<br>
586
+ <b>Denoising strength:</b><br>0.7<br>
587
+ <b>Hires upscale:</b><br>2<br>
588
+ <b>Hires upscaler:</b><br>Latent
589
+ </td>
590
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
591
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.imgur.com/ktLu2Tl.png">
592
+ </td>
593
+ </tr>
594
+ <tr>
595
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
596
+ dark and gloomy full body 8k unity render, female teen cyborg, Blue yonder hair, wearing broken battle armor, at cluttered and messy shack , action shot, tattered torn shirt, porcelain cracked skin, skin pores, detailed intricate iris, very dark lighting, heavy shadows, detailed, detailed face, (vibrant, photo realistic, realistic, dramatic, dark, sharp focus, 8k)<br><br>
597
+ <b>Negative prompt:</b> nude, Asian, black and white, close up, cartoon, 3d, denim, (disfigured), (deformed), (poorly drawn), (extra limbs), blurry, boring, sketch, lackluster, signature, letters, watermark, low res , horrific , mutated , artifacts , bad art , gross , b&w , poor quality , low quality , cropped
598
+ </td>
599
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
600
+ <b>Steps:</b><br>26<br>
601
+ <b>Sampler:</b><br>DPM++ SDE Karras<br>
602
+ <b>CFG scale:</b><br>7.5<br>
603
+ <b>Seed:</b><br>2388736888<br>
604
+ <b>Size:</b><br>768x1024
605
+ </td>
606
+ <td style="vertical-align:top;padding:.5714286em!important;border: 1px solid">
607
+ <img style="vertical-align:top;margin:0;padding:0" src="https://i.stack.imgur.com/GnUuV.jpg">
608
+ </td>
609
+ </tr>
610
+ </table>
611
+
612
+ [1]: https://i.stack.imgur.com/wkK2b.png
epic-diffusion.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5994ff3042e41a061650567242f79bd82dc051cf54c646e0980289b30bb6893c
3
+ size 2132865129
epic-diffusion.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:28b74d4f6871b7b2693c623fd78d1d8e9cc5ee9d92d13d9934125ec8a871a5ed
3
+ size 2132625431
feature_extractor/preprocessor_config.json ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "crop_size": {
3
+ "height": 224,
4
+ "width": 224
5
+ },
6
+ "do_center_crop": true,
7
+ "do_convert_rgb": true,
8
+ "do_normalize": true,
9
+ "do_rescale": true,
10
+ "do_resize": true,
11
+ "feature_extractor_type": "CLIPFeatureExtractor",
12
+ "image_mean": [
13
+ 0.48145466,
14
+ 0.4578275,
15
+ 0.40821073
16
+ ],
17
+ "image_processor_type": "CLIPImageProcessor",
18
+ "image_std": [
19
+ 0.26862954,
20
+ 0.26130258,
21
+ 0.27577711
22
+ ],
23
+ "resample": 3,
24
+ "rescale_factor": 0.00392156862745098,
25
+ "size": {
26
+ "shortest_edge": 224
27
+ }
28
+ }
model_index.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"_class_name": "StableDiffusionPipeline", "_diffusers_version": "0.11.0.dev0", "feature_extractor": ["transformers", "CLIPImageProcessor"], "requires_safety_checker": false, "safety_checker": [null, null], "scheduler": ["diffusers", "DPMSolverMultistepScheduler"], "text_encoder": ["transformers", "CLIPTextModel"], "tokenizer": ["transformers", "CLIPTokenizer"], "unet": ["diffusers", "UNet2DConditionModel"], "vae": ["diffusers", "AutoencoderKL"]}
safety_checker/config.json ADDED
@@ -0,0 +1,181 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_commit_hash": "cb41f3a270d63d454d385fc2e4f571c487c253c5",
3
+ "_name_or_path": "CompVis/stable-diffusion-safety-checker",
4
+ "architectures": [
5
+ "StableDiffusionSafetyChecker"
6
+ ],
7
+ "initializer_factor": 1.0,
8
+ "logit_scale_init_value": 2.6592,
9
+ "model_type": "clip",
10
+ "projection_dim": 768,
11
+ "text_config": {
12
+ "_name_or_path": "",
13
+ "add_cross_attention": false,
14
+ "architectures": null,
15
+ "attention_dropout": 0.0,
16
+ "bad_words_ids": null,
17
+ "begin_suppress_tokens": null,
18
+ "bos_token_id": 0,
19
+ "chunk_size_feed_forward": 0,
20
+ "cross_attention_hidden_size": null,
21
+ "decoder_start_token_id": null,
22
+ "diversity_penalty": 0.0,
23
+ "do_sample": false,
24
+ "dropout": 0.0,
25
+ "early_stopping": false,
26
+ "encoder_no_repeat_ngram_size": 0,
27
+ "eos_token_id": 2,
28
+ "exponential_decay_length_penalty": null,
29
+ "finetuning_task": null,
30
+ "forced_bos_token_id": null,
31
+ "forced_eos_token_id": null,
32
+ "hidden_act": "quick_gelu",
33
+ "hidden_size": 768,
34
+ "id2label": {
35
+ "0": "LABEL_0",
36
+ "1": "LABEL_1"
37
+ },
38
+ "initializer_factor": 1.0,
39
+ "initializer_range": 0.02,
40
+ "intermediate_size": 3072,
41
+ "is_decoder": false,
42
+ "is_encoder_decoder": false,
43
+ "label2id": {
44
+ "LABEL_0": 0,
45
+ "LABEL_1": 1
46
+ },
47
+ "layer_norm_eps": 1e-05,
48
+ "length_penalty": 1.0,
49
+ "max_length": 20,
50
+ "max_position_embeddings": 77,
51
+ "min_length": 0,
52
+ "model_type": "clip_text_model",
53
+ "no_repeat_ngram_size": 0,
54
+ "num_attention_heads": 12,
55
+ "num_beam_groups": 1,
56
+ "num_beams": 1,
57
+ "num_hidden_layers": 12,
58
+ "num_return_sequences": 1,
59
+ "output_attentions": false,
60
+ "output_hidden_states": false,
61
+ "output_scores": false,
62
+ "pad_token_id": 1,
63
+ "prefix": null,
64
+ "problem_type": null,
65
+ "projection_dim": 512,
66
+ "pruned_heads": {},
67
+ "remove_invalid_values": false,
68
+ "repetition_penalty": 1.0,
69
+ "return_dict": true,
70
+ "return_dict_in_generate": false,
71
+ "sep_token_id": null,
72
+ "suppress_tokens": null,
73
+ "task_specific_params": null,
74
+ "temperature": 1.0,
75
+ "tf_legacy_loss": false,
76
+ "tie_encoder_decoder": false,
77
+ "tie_word_embeddings": true,
78
+ "tokenizer_class": null,
79
+ "top_k": 50,
80
+ "top_p": 1.0,
81
+ "torch_dtype": null,
82
+ "torchscript": false,
83
+ "transformers_version": "4.26.0.dev0",
84
+ "typical_p": 1.0,
85
+ "use_bfloat16": false,
86
+ "vocab_size": 49408
87
+ },
88
+ "text_config_dict": {
89
+ "hidden_size": 768,
90
+ "intermediate_size": 3072,
91
+ "num_attention_heads": 12,
92
+ "num_hidden_layers": 12
93
+ },
94
+ "torch_dtype": "float32",
95
+ "transformers_version": null,
96
+ "vision_config": {
97
+ "_name_or_path": "",
98
+ "add_cross_attention": false,
99
+ "architectures": null,
100
+ "attention_dropout": 0.0,
101
+ "bad_words_ids": null,
102
+ "begin_suppress_tokens": null,
103
+ "bos_token_id": null,
104
+ "chunk_size_feed_forward": 0,
105
+ "cross_attention_hidden_size": null,
106
+ "decoder_start_token_id": null,
107
+ "diversity_penalty": 0.0,
108
+ "do_sample": false,
109
+ "dropout": 0.0,
110
+ "early_stopping": false,
111
+ "encoder_no_repeat_ngram_size": 0,
112
+ "eos_token_id": null,
113
+ "exponential_decay_length_penalty": null,
114
+ "finetuning_task": null,
115
+ "forced_bos_token_id": null,
116
+ "forced_eos_token_id": null,
117
+ "hidden_act": "quick_gelu",
118
+ "hidden_size": 1024,
119
+ "id2label": {
120
+ "0": "LABEL_0",
121
+ "1": "LABEL_1"
122
+ },
123
+ "image_size": 224,
124
+ "initializer_factor": 1.0,
125
+ "initializer_range": 0.02,
126
+ "intermediate_size": 4096,
127
+ "is_decoder": false,
128
+ "is_encoder_decoder": false,
129
+ "label2id": {
130
+ "LABEL_0": 0,
131
+ "LABEL_1": 1
132
+ },
133
+ "layer_norm_eps": 1e-05,
134
+ "length_penalty": 1.0,
135
+ "max_length": 20,
136
+ "min_length": 0,
137
+ "model_type": "clip_vision_model",
138
+ "no_repeat_ngram_size": 0,
139
+ "num_attention_heads": 16,
140
+ "num_beam_groups": 1,
141
+ "num_beams": 1,
142
+ "num_channels": 3,
143
+ "num_hidden_layers": 24,
144
+ "num_return_sequences": 1,
145
+ "output_attentions": false,
146
+ "output_hidden_states": false,
147
+ "output_scores": false,
148
+ "pad_token_id": null,
149
+ "patch_size": 14,
150
+ "prefix": null,
151
+ "problem_type": null,
152
+ "projection_dim": 512,
153
+ "pruned_heads": {},
154
+ "remove_invalid_values": false,
155
+ "repetition_penalty": 1.0,
156
+ "return_dict": true,
157
+ "return_dict_in_generate": false,
158
+ "sep_token_id": null,
159
+ "suppress_tokens": null,
160
+ "task_specific_params": null,
161
+ "temperature": 1.0,
162
+ "tf_legacy_loss": false,
163
+ "tie_encoder_decoder": false,
164
+ "tie_word_embeddings": true,
165
+ "tokenizer_class": null,
166
+ "top_k": 50,
167
+ "top_p": 1.0,
168
+ "torch_dtype": null,
169
+ "torchscript": false,
170
+ "transformers_version": "4.26.0.dev0",
171
+ "typical_p": 1.0,
172
+ "use_bfloat16": false
173
+ },
174
+ "vision_config_dict": {
175
+ "hidden_size": 1024,
176
+ "intermediate_size": 4096,
177
+ "num_attention_heads": 16,
178
+ "num_hidden_layers": 24,
179
+ "patch_size": 14
180
+ }
181
+ }
safety_checker/pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:16d28f2b37109f222cdc33620fdd262102ac32112be0352a7f77e9614b35a394
3
+ size 1216064769
scheduler/scheduler_config.json ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_class_name": "DPMSolverMultistepScheduler",
3
+ "_diffusers_version": "0.10.0",
4
+ "algorithm_type": "dpmsolver++",
5
+ "beta_end": 0.012,
6
+ "beta_schedule": "scaled_linear",
7
+ "beta_start": 0.00085,
8
+ "clip_sample": false,
9
+ "dynamic_thresholding_ratio": 0.995,
10
+ "lower_order_final": true,
11
+ "num_train_timesteps": 1000,
12
+ "prediction_type": "epsilon",
13
+ "sample_max_value": 1.0,
14
+ "set_alpha_to_one": false,
15
+ "skip_prk_steps": true,
16
+ "solver_order": 2,
17
+ "solver_type": "midpoint",
18
+ "steps_offset": 1,
19
+ "thresholding": false,
20
+ "trained_betas": null
21
+ }
text_encoder/config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "openai/clip-vit-large-patch14",
3
+ "architectures": [
4
+ "CLIPTextModel"
5
+ ],
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 0,
8
+ "dropout": 0.0,
9
+ "eos_token_id": 2,
10
+ "hidden_act": "quick_gelu",
11
+ "hidden_size": 768,
12
+ "initializer_factor": 1.0,
13
+ "initializer_range": 0.02,
14
+ "intermediate_size": 3072,
15
+ "layer_norm_eps": 1e-05,
16
+ "max_position_embeddings": 77,
17
+ "model_type": "clip_text_model",
18
+ "num_attention_heads": 12,
19
+ "num_hidden_layers": 12,
20
+ "pad_token_id": 1,
21
+ "projection_dim": 768,
22
+ "torch_dtype": "float32",
23
+ "transformers_version": "4.26.0.dev0",
24
+ "vocab_size": 49408
25
+ }
text_encoder/pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f3ab84100c78aa0ea459336509d08ca3ecb2df5325b1aae35189d080326f5ab9
3
+ size 492307041
tokenizer/merges.txt ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer/special_tokens_map.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<|startoftext|>",
4
+ "lstrip": false,
5
+ "normalized": true,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "eos_token": {
10
+ "content": "<|endoftext|>",
11
+ "lstrip": false,
12
+ "normalized": true,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": "<|endoftext|>",
17
+ "unk_token": {
18
+ "content": "<|endoftext|>",
19
+ "lstrip": false,
20
+ "normalized": true,
21
+ "rstrip": false,
22
+ "single_word": false
23
+ }
24
+ }
tokenizer/tokenizer_config.json ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "add_prefix_space": false,
3
+ "bos_token": {
4
+ "__type": "AddedToken",
5
+ "content": "<|startoftext|>",
6
+ "lstrip": false,
7
+ "normalized": true,
8
+ "rstrip": false,
9
+ "single_word": false
10
+ },
11
+ "do_lower_case": true,
12
+ "eos_token": {
13
+ "__type": "AddedToken",
14
+ "content": "<|endoftext|>",
15
+ "lstrip": false,
16
+ "normalized": true,
17
+ "rstrip": false,
18
+ "single_word": false
19
+ },
20
+ "errors": "replace",
21
+ "model_max_length": 77,
22
+ "name_or_path": "openai/clip-vit-large-patch14",
23
+ "pad_token": "<|endoftext|>",
24
+ "special_tokens_map_file": "./special_tokens_map.json",
25
+ "tokenizer_class": "CLIPTokenizer",
26
+ "unk_token": {
27
+ "__type": "AddedToken",
28
+ "content": "<|endoftext|>",
29
+ "lstrip": false,
30
+ "normalized": true,
31
+ "rstrip": false,
32
+ "single_word": false
33
+ }
34
+ }
tokenizer/vocab.json ADDED
The diff for this file is too large to render. See raw diff
 
unet/config.json ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_class_name": "UNet2DConditionModel",
3
+ "_diffusers_version": "0.11.0.dev0",
4
+ "act_fn": "silu",
5
+ "attention_head_dim": 8,
6
+ "block_out_channels": [
7
+ 320,
8
+ 640,
9
+ 1280,
10
+ 1280
11
+ ],
12
+ "center_input_sample": false,
13
+ "cross_attention_dim": 768,
14
+ "down_block_types": [
15
+ "CrossAttnDownBlock2D",
16
+ "CrossAttnDownBlock2D",
17
+ "CrossAttnDownBlock2D",
18
+ "DownBlock2D"
19
+ ],
20
+ "downsample_padding": 1,
21
+ "dual_cross_attention": false,
22
+ "flip_sin_to_cos": true,
23
+ "freq_shift": 0,
24
+ "in_channels": 4,
25
+ "layers_per_block": 2,
26
+ "mid_block_scale_factor": 1,
27
+ "norm_eps": 1e-05,
28
+ "norm_num_groups": 32,
29
+ "num_class_embeds": null,
30
+ "only_cross_attention": false,
31
+ "out_channels": 4,
32
+ "sample_size": 64,
33
+ "up_block_types": [
34
+ "UpBlock2D",
35
+ "CrossAttnUpBlock2D",
36
+ "CrossAttnUpBlock2D",
37
+ "CrossAttnUpBlock2D"
38
+ ],
39
+ "use_linear_projection": false
40
+ }
unet/diffusion_pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fb65803ef8507baa7b4dcbfc398032d4b95ee1d7454fc6317fa4b7b25f7f3029
3
+ size 3438366373
v1-inference.yaml ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ model:
2
+ base_learning_rate: 1.0e-04
3
+ target: ldm.models.diffusion.ddpm.LatentDiffusion
4
+ params:
5
+ linear_start: 0.00085
6
+ linear_end: 0.0120
7
+ num_timesteps_cond: 1
8
+ log_every_t: 200
9
+ timesteps: 1000
10
+ first_stage_key: "jpg"
11
+ cond_stage_key: "txt"
12
+ image_size: 64
13
+ channels: 4
14
+ cond_stage_trainable: false # Note: different from the one we trained before
15
+ conditioning_key: crossattn
16
+ monitor: val/loss_simple_ema
17
+ scale_factor: 0.18215
18
+ use_ema: False
19
+
20
+ scheduler_config: # 10000 warmup steps
21
+ target: ldm.lr_scheduler.LambdaLinearScheduler
22
+ params:
23
+ warm_up_steps: [ 10000 ]
24
+ cycle_lengths: [ 10000000000000 ] # incredibly large number to prevent corner cases
25
+ f_start: [ 1.e-6 ]
26
+ f_max: [ 1. ]
27
+ f_min: [ 1. ]
28
+
29
+ unet_config:
30
+ target: ldm.modules.diffusionmodules.openaimodel.UNetModel
31
+ params:
32
+ image_size: 32 # unused
33
+ in_channels: 4
34
+ out_channels: 4
35
+ model_channels: 320
36
+ attention_resolutions: [ 4, 2, 1 ]
37
+ num_res_blocks: 2
38
+ channel_mult: [ 1, 2, 4, 4 ]
39
+ num_heads: 8
40
+ use_spatial_transformer: True
41
+ transformer_depth: 1
42
+ context_dim: 768
43
+ use_checkpoint: True
44
+ legacy: False
45
+
46
+ first_stage_config:
47
+ target: ldm.models.autoencoder.AutoencoderKL
48
+ params:
49
+ embed_dim: 4
50
+ monitor: val/rec_loss
51
+ ddconfig:
52
+ double_z: true
53
+ z_channels: 4
54
+ resolution: 256
55
+ in_channels: 3
56
+ out_ch: 3
57
+ ch: 128
58
+ ch_mult:
59
+ - 1
60
+ - 2
61
+ - 4
62
+ - 4
63
+ num_res_blocks: 2
64
+ attn_resolutions: []
65
+ dropout: 0.0
66
+ lossconfig:
67
+ target: torch.nn.Identity
68
+
69
+ cond_stage_config:
70
+ target: ldm.modules.encoders.modules.FrozenCLIPEmbedder
vae/config.json ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_class_name": "AutoencoderKL",
3
+ "_diffusers_version": "0.4.2",
4
+ "act_fn": "silu",
5
+ "block_out_channels": [
6
+ 128,
7
+ 256,
8
+ 512,
9
+ 512
10
+ ],
11
+ "down_block_types": [
12
+ "DownEncoderBlock2D",
13
+ "DownEncoderBlock2D",
14
+ "DownEncoderBlock2D",
15
+ "DownEncoderBlock2D"
16
+ ],
17
+ "in_channels": 3,
18
+ "latent_channels": 4,
19
+ "layers_per_block": 2,
20
+ "norm_num_groups": 32,
21
+ "out_channels": 3,
22
+ "sample_size": 256,
23
+ "up_block_types": [
24
+ "UpDecoderBlock2D",
25
+ "UpDecoderBlock2D",
26
+ "UpDecoderBlock2D",
27
+ "UpDecoderBlock2D"
28
+ ]
29
+ }
vae/diffusion_pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c98ebcd7ca5cb69d47b2ae287feba0695689fbf2c8fead2fab05fd3e0c28303
3
+ size 334707217
vae/diffusion_pytorch_model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:32db726da04f06c1b6b14c0043ce115cc87a501482945c5add89a40d838fcb46
3
+ size 334643276