Home

Awesome

<p align="center"> <img src="images/logo.png" width="400"> </p>

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

Tianhe Wu<sup>1,*</sup>, Shuwei Shi<sup>1,2*</sup>, Haoming Cai<sup>3</sup>, Mingdeng Cao<sup>2</sup>, Jing Xiao<sup>4</sup>, Yinqiang Zheng<sup>2</sup>, Yujiu Yang<sup>1</sup>

<sup>1</sup>Shenzhen International Graduate School, Tsinghua University<br> <sup>2</sup>The University of Tokyo<br> <sup>3</sup>University of Maryland, College Park<br> <sup>4</sup>Pingan Group

Tsinghua University Intelligent Interaction Group

:rocket: :rocket: :rocket: Updates:

paper download Open issue Closed issue visitors IIGROUP GitHub Stars

This repository is the official PyTorch implementation of Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment. :fire::fire::fire: We process all 2D VR IQA dataset MOS files, which can be seen in data folder. If you want to train your model, you can just download the dataset and modify the file in models and data folders.

Click to see different distorted level omnidirectional images. You can also download Insta360 Player to have a immersive experience.

Ground TruthDistortion 1Distortion 2Distortion 3Distortion 4
<img width="200" src="images/GT_1.png"><img width="200" src="images/dis1_1.png"><img width="200" src="images/dis2_1.png"><img width="200" src="images/dis3_1.png"><img width="200" src="images/dis4_1.png">
MOS (GT)3.45 (1)2.95 (2)1.6 (3)1.1 (4)
Ours (Assessor360)0.5933 (1)0.5213 (2)0.1220 (3)0.0120 (4)
<img width="200" src="images/GT_2.png"><img width="200" src="images/dis1_2.png"><img width="200" src="images/dis2_2.png"><img width="200" src="images/dis3_2.png"><img width="200" src="images/dis4_2.png">
MOS (GT)4.85 (1)3.25 (2)2.4 (3)1.3 (4)
Ours (Assessor360)0.9566 (1)0.7263 (2)0.3495 (3)0.0748 (4)
<img width="200" src="images/GT_3.png"><img width="200" src="images/dis1_3.png"><img width="200" src="images/dis2_3.png"><img width="200" src="images/dis3_3.png"><img width="200" src="images/dis4_3.png">
MOS (GT)4.35 (1)3.35 (2)2.6 (3)1.3 (4)
Ours (Assessor360)0.8194 (1)0.5481 (2)0.3583 (3)0.1165 (4)

Blind Omnidirectional Image Quality Assessment (BOIQA) aims to objectively assess the human perceptual quality of omnidirectional images (ODIs) without relying on pristine-quality image information. It is becoming more significant with the increasing advancement of virtual reality (VR) technology. However, the quality assessment of ODIs is severely hampered by the fact that the existing BOIQA pipeline lacks the modeling of the observer's browsing process. To tackle this issue, we propose a novel multi-sequence network for BOIQA called Assessor360, which is derived from the realistic multi-assessor ODI quality assessment procedure. Specifically, we propose a generalized Recursive Probability Sampling (RPS) method for the BOIQA task, combining content and details information to generate multiple pseudo viewport sequences from a given starting point. Additionally, we design a Multi-scale Feature Aggregation (MFA) module with a Distortion-aware Block (DAB) to fuse distorted and semantic features of each viewport. We also devise Temporal Modeling Module (TMM) to learn the viewport transition in the temporal domain. Extensive experimental results demonstrate that Assessor360 outperforms state-of-the-art methods on multiple OIQA datasets.


:book:Network Architecture

image.png

<a name="visual_results"></a>:eyes:Dataset

We test our model performance in CVIQD, OIQA, MVAQD, IQA-ODI four 2D VR IQA datasets.

You can send emails to OIQA and MVAQD dataset authors to obtain datasets.

:1st_place_medal: Checkpoints for Assessor360

Click into the website and download the pretrained model checkpoints.

Training SetTesting SetCheckpoints
CVIQD dataset (16 reference images, 429 distorted images for training)CVIQD dataset (99 distorted images for testing)download<br />SRCC:0.9801, PLCC:0.9836
OIQA dataset (16 reference images, 260 distorted images for training)OIQA dataset (60 distorted images for testing)download<br />SRCC:0.9661, PLCC:0.9757
IQA-ODI dataset (120 reference images, 768 distorted images for training)IQA-ODI dataset (192 distorted images for testing)download<br />SRCC:0.9476, PLCC:0.9544
MVAQD dataset (15 reference images, 240 distorted images for training)MVAQD dataset (60 distorted images for testing)download<br />SRCC:0.9615, PLCC:0.9687

:hammer_and_wrench: Usage

Training Assessor360

model_name and type_name is the file path for saving checkpoint and log file

python train_oiqa.py

Predicting one ODI quality score

python inference_one_image.py 

Acknowledgment

We would like to thank Xiangjie Sui for his great and patient guidance on this work, Xinzhe Ni for designing the model and DiffBIR published README.md file.

Related Work

NTIRE2022 IQA No-Reference Competition

[CVPRW 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment. (1th place in NR track)

paper code

Citation

@article{wu2023assessor360,
  title={Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment},
  author={Wu, Tianhe and Shi, Shuwei and Cai, Haoming and Cao, Mingdeng and Xiao, Jing and Zheng, Yinqiang and Yang, Yujiu},
  journal={arXiv preprint arXiv:2305.10983},
  year={2023}
}