Repurpose as simple text to image app

#66
by adamelliotfields - opened

After dogfooding this for a while:

  • I love the speed (loading and inference) of SD 1.5 still; combined with good prompts, TI embeddings, and super resolution it's not trash
  • IP-Adapter and ControlNet isn't great on SD 1.5, more proof of concept, good for experimentation (I never use it)
  • Image-to-image is great for style transfer
  • Fast negative TI embeddings are convenient
  • I use the same styles and settings with the same models; should just create simple presets
  • Batch mode is useful but shouldn't use sequential seeds
  • I either use seed 42 or random
  • I don't use LoRAs because I'd rather just pay 4 cents to use a commercial model that looks better
  • I always use PNG format
  • I always use Karras sigmas
  • I never use custom dimensions
  • I never use clip skip, freeU, or taesd; DeepCache was for localhost only
  • I don't like the random prompts from Stable Studio; I just use "cat" with a style
  • I'd like to have SD 1.5 and XL models in one app
  • I need a dedicated upscale app with different SR models (TBD)

To that end:

  • Remove settings, image-to-image, and controlnet tabs; remove LoRA dropdowns and all the other unused settings
  • Remove negative prompt; styles will have negatives and fast negative embedding
  • Model selection replaced with "Quality" selection: SD 1.5 (good) or XL (best)
  • Inference steps replaced with "Speed" selection: fast (30), balanced (40), slow(50)
  • Style dropdown now pairs model, scheduler, CFG
  • Single "reproduceable" checkbox to use seed 42; otherwise random
  • Keep gallery, batching is incremental if reproduceable
  • Replace dice button (random prompt) with a cat button that picks a random style with the prompt "cat"
  • Will create focused image-to-image and upscaling apps

Edit: leaning towards keeping this and XL as-is; new simple app will just be named "sd".

Sign up or log in to comment