--- base_model: PixArt-alpha/PixArt-Sigma-XL-2-1024-MS library_name: diffusers license: creativeml-openrail-m tags: - stable-diffusion - stable-diffusion-diffusers - text-to-image - diffusers - full - pixart - pixart sigma inference: true widget: - text: A blonde sexy girl, wearing glasses at latex shirt and a blue beanie with a tattoo, blue and white, highly detailed, sublime, extremely beautiful, sharp focus, refined, cinematic, intricate, elegant, dynamic, rich deep colors, bright color, shining light, attractive, cute, pretty, background full, epic composition, dramatic atmosphere, radiant, professional, stunning parameters: negative_prompt: blurry, cropped, ugly output: url: ./assets/1.png - text: a wizard with a glowing staff and a glowing hat, colorful magic, dramatic atmosphere, sharp focus, highly detailed, cinematic, original composition, fine detail, intricate, elegant, creative, color spread, shiny, amazing, symmetry, illuminated, inspired, pretty, attractive, artistic, dynamic background, relaxed, professional, extremely inspirational, beautiful, determined, cute, adorable, best parameters: negative_prompt: blurry, cropped, ugly output: url: ./assets/2.png - text: girl in modern car, intricate, elegant, highly detailed, extremely complimentary colors, beautiful, glowing aesthetic, pretty, dramatic light, sharp focus, perfect composition, clear artistic color, calm professional background, precise, joyful, emotional, unique, cute, best, gorgeous, great delicate, expressive, thought, iconic, fine, awesome, creative, winning, charming, enhanced parameters: negative_prompt: blurry, cropped, ugly output: url: ./assets/3.png - text: A girl stands amidst scattered glass shards, surrounded by a beautifully crafted and expansive world. The scene is depicted from a dynamic angle, emphasizing her determined expression. The background features vast landscapes with floating crystals and soft, glowing lights that create a mystical and grand atmosphere. parameters: negative_prompt: blurry, cropped, ugly output: url: ./assets/ComfyUI_PixArt_00040_.png - text: A girl stands amidst scattered glass shards, surrounded by a beautifully crafted and expansive world. The scene is depicted from a dynamic angle, emphasizing her determined expression. The background features vast landscapes with floating crystals and soft, glowing lights that create a mystical and grand atmosphere. parameters: negative_prompt: blurry, cropped, ugly output: url: ./assets/ComfyUI_PixArt_00036_.png - text: A close-up shot of a beautiful girl in a serene world. She has white hair and is blindfolded, with a calm expression. Her hands are pressed together in a prayer pose, with fingers interlaced and palms touching. The background is softly blurred, enhancing her ethereal presence. parameters: negative_prompt: blurry, cropped, ugly output: url: ./assets/ComfyUI_PixArt_00041_.png --- # SigmaJourney: PixartSigma + MidJourney v6 ## Inference ### ComfyUI - Download model file `transformer/diffusion_pytorch_model.safetensors` and put into `ComfyUI/models/checkpoints` - Use ExtraModels node: https://github.com/city96/ComfyUI_ExtraModels?tab=readme-ov-file#pixart ![image/png](https://cdn-uploads.huggingface.co/production/uploads/643c7e91b409fef15e0bd11b/MJfTShin1fYOOCo4mTv2-.png) ```python import torch from diffusers import DiffusionPipeline, EulerAncestralDiscreteScheduler from diffusers.models import PixArtTransformer2DModel model_id = "TensorFamily/SigmaJourney" negative_prompt = "malformed, disgusting, overexposed, washed-out" pipeline = DiffusionPipeline.from_pretrained("PixArt-alpha/PixArt-Sigma-XL-2-1024-MS", torch_dtype=torch.float16) pipeline.transformer = PixArtTransformer2DModel.from_pretrained(model_id, subfolder="transformer", torch_dtype=torch.float16) pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(pipeline.scheduler.config) pipeline.to('cuda' if torch.cuda.is_available() else 'cpu') prompt = "On the left, there is a red cube. On the right, there is a blue sphere. On top of the red cube is a dog. On top of the blue sphere is a cat" image = pipeline( prompt=prompt, negative_prompt='blurry, cropped, ugly', num_inference_steps=30, generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826), width=1024, height=1024, guidance_scale=5.5, ).images[0] image.save("output.png", format="JPEG") ```