---
base_model: PixArt-alpha/PixArt-Sigma-XL-2-1024-MS
library_name: diffusers
license: creativeml-openrail-m
tags:
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
- diffusers
- full
- pixart
- pixart sigma
inference: true
widget:
- text: A blonde sexy girl, wearing glasses at latex shirt and a blue beanie with
    a tattoo, blue and white, highly detailed, sublime, extremely beautiful, sharp
    focus, refined, cinematic, intricate, elegant, dynamic, rich deep colors, bright
    color, shining light, attractive, cute, pretty, background full, epic composition,
    dramatic atmosphere, radiant, professional, stunning
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/1.png
- text: a wizard with a glowing staff and a glowing hat, colorful magic, dramatic
    atmosphere, sharp focus, highly detailed, cinematic, original composition, fine
    detail, intricate, elegant, creative, color spread, shiny, amazing, symmetry,
    illuminated, inspired, pretty, attractive, artistic, dynamic background, relaxed,
    professional, extremely inspirational, beautiful, determined, cute, adorable,
    best
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/2.png
- text: girl in modern car, intricate, elegant, highly detailed, extremely complimentary
    colors, beautiful, glowing aesthetic, pretty, dramatic light, sharp focus, perfect
    composition, clear artistic color, calm professional background, precise, joyful,
    emotional, unique, cute, best, gorgeous, great delicate, expressive, thought,
    iconic, fine, awesome, creative, winning, charming, enhanced
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/3.png
- text: A girl stands amidst scattered glass shards, surrounded by a beautifully crafted
    and expansive world. The scene is depicted from a dynamic angle, emphasizing her
    determined expression. The background features vast landscapes with floating crystals
    and soft, glowing lights that create a mystical and grand atmosphere.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/ComfyUI_PixArt_00040_.png
- text: A girl stands amidst scattered glass shards, surrounded by a beautifully crafted
    and expansive world. The scene is depicted from a dynamic angle, emphasizing her
    determined expression. The background features vast landscapes with floating crystals
    and soft, glowing lights that create a mystical and grand atmosphere.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/ComfyUI_PixArt_00036_.png
- text: A close-up shot of a beautiful girl in a serene world. She has white hair
    and is blindfolded, with a calm expression. Her hands are pressed together in
    a prayer pose, with fingers interlaced and palms touching. The background is softly
    blurred, enhancing her ethereal presence.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: ./assets/ComfyUI_PixArt_00041_.png
---

# SigmaJourney: PixartSigma + MidJourney v6


<Gallery />


## Inference

### ComfyUI
- Download model file `transformer/diffusion_pytorch_model.safetensors` and put into `ComfyUI/models/checkpoints`
- Use ExtraModels node: https://github.com/city96/ComfyUI_ExtraModels?tab=readme-ov-file#pixart

![image/png](https://cdn-uploads.huggingface.co/production/uploads/643c7e91b409fef15e0bd11b/MJfTShin1fYOOCo4mTv2-.png)

```python
import torch
from diffusers import DiffusionPipeline, EulerAncestralDiscreteScheduler
from diffusers.models import PixArtTransformer2DModel


model_id = "TensorFamily/SigmaJourney"
negative_prompt = "malformed, disgusting, overexposed, washed-out"

pipeline = DiffusionPipeline.from_pretrained("PixArt-alpha/PixArt-Sigma-XL-2-1024-MS", torch_dtype=torch.float16)
pipeline.transformer = PixArtTransformer2DModel.from_pretrained(model_id, subfolder="transformer", torch_dtype=torch.float16)
pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(pipeline.scheduler.config)
pipeline.to('cuda' if torch.cuda.is_available() else 'cpu')

prompt = "On the left, there is a red cube. On the right, there is a blue sphere. On top of the red cube is a dog. On top of the blue sphere is a cat"
image = pipeline(
    prompt=prompt,
    negative_prompt='blurry, cropped, ugly',
    num_inference_steps=30,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
    width=1024,
    height=1024,
    guidance_scale=5.5,
).images[0]
image.save("output.png", format="JPEG")
```