Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Abstract
This is a technical report on the 360-degree panoramic image generation task based on diffusion models. Unlike ordinary 2D images, 360-degree panoramic images capture the entire 360^circtimes 180^circ field of view. So the rightmost and the leftmost sides of the 360 panoramic image should be continued, which is the main challenge in this field. However, the current diffusion pipeline is not appropriate for generating such a seamless 360-degree panoramic image. To this end, we propose a circular blending strategy on both the denoising and VAE decoding stages to maintain the geometry continuity. Based on this, we present two models for Text-to-360-panoramas and Single-Image-to-360-panoramas tasks. The code has been released as an open-source project at https://github.com/ArcherFMY/SD-T2I-360PanoImage{https://github.com/ArcherFMY/SD-T2I-360PanoImage} and https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary{ModelScope}
Community
Isn't it the same as "circular_padding” argument in “StableDiffusionPanoramaPipeline” https://github.com/huggingface/diffusers/pull/4025 ?
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models (2023)
- DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation (2023)
- Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model (2023)
- Paragraph-to-Image Generation with Information-Enriched Diffusion Model (2023)
- Text-Guided Texturing by Synchronized Multi-View Diffusion (2023)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper