alea31415
/

umamusume-full

Text-to-Image

stable-diffusion

anime

aiart

Model card Files Files and versions Community

alea31415 commited on Feb 25, 2023

Commit

d088e80

•

1 Parent(s): 5c19cd9

Update README.md

Browse files

Files changed (1) hide show

README.md +43 -2

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ tags:
 pipeline_tag: text-to-image
 ---
-**This model intends to be the ultimate uma-musume model where I try to train as many things about uma-musume as possible into it.**
 Ok it's not totally true. Only characters and outfits are trained, ant not locations/objects/running-style(?) or whatever.
@@ -16,6 +16,25 @@ Ok it's not totally true. Only characters and outfits are trained, ant not locat
 Comming soon... (Still training)
 ## Concepts
 #### List of characters
@@ -154,4 +173,26 @@ Around 60K images containing
 - 21377 fan arts
 - 270 support cards
 - 7822 resused images: From the above three 4019 images are selected to form a few-shot classifier training set, areas centered at face are cropped out which give 3803 images
-- 19279 regularization images

 pipeline_tag: text-to-image
 ---
+**This model intends to be the ultimate uma-musume model where I try to train as many things about uma-musume as possible into it.**
 Ok it's not totally true. Only characters and outfits are trained, ant not locations/objects/running-style(?) or whatever.
 Comming soon... (Still training)
+## Usage
+You don't need to use the model as is. Treat it as lora and do either
+- **Your favorite model + w * (umamusume model - ACertainty/Nai)**
+- or **Umamusume model + w * (your favorite model - ACertainty/Nai)**
+#### Embeddings
+For character specific outfit you'd better download embeddings from (coming soon...)
+#### How to prompt
+Here are two example captions that are used for training
+> AgnesDigital; CurrenChan, tracen school uniform; fanart; 2girls, horse girl, multiple girls, character doll, phone, beads, bangs, bow, blush, animal ears, school uniform, holding phone, two side up, heart in mouth, one eye closed, white background, sweat, tracen school uniform, selfie, red bow, shirt, sailor collar, open mouth, simple background, heart, horse ears, holding, purple shirt, purple skirt, skirt, long sleeves
+> CurrenChan; SmartFalcon, SmartFalcon-racing-suit; fanart; 2girls, horse girl, multiple girls, dress, diamond (shape), party, nail polish, red bow, see-through, teeth, frilled dress, upper teeth only, balloon, blurry, frilled collar, animal ears, timestamp, ring, black bow, pink dress, border, gem, short sleeves, puffy sleeves, heart hands, pink nails, strap, bow, lace-trimmed sleeves, black headband, chromatic aberration, white border, black dress, writing on wall, headband, frills, horse tail, chain, horse ears, puffy short sleeves, bracelet, tail, bangs, open mouth, collar, black nails, multicolored dress, multicolored clothes, depth of field, heart
 ## Concepts
 #### List of characters
 - 21377 fan arts
 - 270 support cards
 - 7822 resused images: From the above three 4019 images are selected to form a few-shot classifier training set, areas centered at face are cropped out which give 3803 images
+- 19279 regularization images
+## Training
+The model is trained in two phases (resolution 512, clip skip 1) on top of [ACertainty](https://huggingface.co/JosephusCheung/ACertainty)
+**The first phase is trained with [EveryDream2](https://github.com/victorchall/EveryDream2trainer)**
+- Specific weighting scheme with maximum repeat 50
+- consine scheduler, 2.5e-6 learning rate
+- conditional dropout 0.08
+Total steps x batch is around 1.1M
+- batch 8 for 61495 steps
+- batch 6 for ~3500 steps but this gets stopped due to some technical issues
+- batch 4 for 150621 steps
+**The second phase is trained with [naifu trainer](https://github.com/AdjointOperator/naifu-diffusion)**
+The goal is to train embeddings for character-specific outfits.
+Both unets and embeddings are trained.
+This phase does not use the anime screenshots nor the regularization images.