arxiv:2311.03226

LDM3D-VR: Latent Diffusion Model for 3D VR

Published on Nov 6, 2023

· Submitted by

akhaliq on Nov 7, 2023

Upvote

Authors:

Gabriela Ben Melech Stan ,

Diana Wofk ,

Estelle Aflalo ,

Shao-Yen Tseng ,

Zhipeng Cai ,

Michael Paulitsch ,

Vasudev Lal

Abstract

Latent diffusion models have proven to be state-of-the-art in the creation and manipulation of visual outputs. However, as far as we know, the generation of depth maps jointly with RGB is still limited. We introduce LDM3D-VR, a suite of diffusion models targeting virtual reality development that includes LDM3D-pano and LDM3D-SR. These models enable the generation of panoramic RGBD based on textual prompts and the upscaling of low-resolution inputs to high-resolution RGBD, respectively. Our models are fine-tuned from existing pretrained models on datasets containing panoramic/high-resolution RGB images, depth maps and captions. Both models are evaluated in comparison to existing related methods.

View arXiv page View PDF Add to collection