Panorama Generation From NFoV Image Done Right

PanoDecouple
Panorama Generation From NFoV Image Done Right

CVPR 2025 Highlight

(* The work is done when Dian Zheng is an intern at Alibaba Group., † corresponding authors)

¹ Sun Yat-sen University ² Monash University ³ Alibaba Group
⁴ Key Laboratory of Machine Intelligence and Advanced Computing, Ministry of Education, China

Abstract

Generating 360-degree panoramas from narrow field of view (NFoV) images is a promising computer vision task for Virtual Reality (VR) applications. Existing methods mostly assess the generated panoramas with InceptionNet or CLIP-based metrics, which tend to perceive the image quality and are not suitable for evaluating the distortion. In this work, we first propose a distortion-specific CLIP, named Distort-CLIP, to accurately evaluate panorama distortion and discover the "visual cheating" phenomenon in previous works (i.e., tending to improve visual results by sacrificing distortion accuracy). This phenomenon arises because prior methods employ a single network to learn the distinct panorama distortion and content completion at once, which leads the model to prioritize optimizing the latter. To address this phenomenon, we propose PanoDecouple, a decoupled diffusion model framework, which decouples panorama generation into distortion guidance and content completion, aiming to generate panoramas with both accurate distortion and visual appeal. Specifically, we design a DistortNet for distortion guidance by imposing panorama-specific distortion priors and a modified condition registration mechanism; and a ContentNet for content completion by imposing perspective image information. Additionally, a distortion correction loss function with Distort-CLIP is introduced to constrain the distortion explicitly. Extensive experiments validate that PanoDecouple surpasses existing methods in both distortion and visual metrics.

BibTeX

If you find our work useful, please consider citing our paper:

@InProceedings{zheng2025panorama, title={Panorama Generation From NFoV Image Done Right}, author={Zheng, Dian and Zhang, Cheng and Wu, Xiao-Ming and Li, Cao and Lv, Chengfei and Hu, Jian-Fang and Zheng, Wei-Shi}, booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition}, year={2025} }

PanoDecouple
Panorama Generation From NFoV Image Done Right

Abstract

Visual cheating phenomenon in existing models

Tuning Pipeline of Distort-CLIP

Training Pipeline of PanoDecouple

Comparison with existing methods

Real-World NFoV Image Outpainting

BibTeX

PanoDecouple Panorama Generation From NFoV Image Done Right

Abstract

Visual cheating phenomenon in existing models

Tuning Pipeline of Distort-CLIP

Training Pipeline of PanoDecouple

Comparison with existing methods

Real-World NFoV Image Outpainting

BibTeX

PanoDecouple
Panorama Generation From NFoV Image Done Right