Stereoscopic Video Quality Assessment Using Modified Parallax Attention Module

Hassan Imani; Selim Zaim; Md Baharul Islam; Masum Shah Junayed

doi:10.1007/978-3-030-90421-0_4

Back

Book chapter

Stereoscopic Video Quality Assessment Using Modified Parallax Attention Module

Hassan Imani, Selim Zaim, Md Baharul Islam and Masum Shah Junayed

DIGITIZING PRODUCTION SYSTEMS, ISPR2021, pp.39-50

Lecture Notes in Mechanical Engineering, Springer Nature

01-01-2022

DOI: https://doi.org/10.1007/978-3-030-90421-0_4

Abstract

Computer Science, Artificial Intelligence

Engineering, Industrial

Engineering, Mechanical

Science & Technology

Computer Science

Engineering

Technology

Deep learning techniques are utilized for most computer vision tasks. Especially, Convolutional Neural Networks (CNNs) have shown great performance in detection and classification tasks. Recently, in the field of Stereoscopic Video Quality Assessment (SVQA), 3D CNNs are used to extract spatial and temporal features from stereoscopic videos, but the importance of the disparity information which is very important did not consider well. Most of the recently proposed deep learning-based methods mostly used cost volume methods to produce the stereo correspondence for large disparities. Because the disparities can differ considerably for stereo cameras with different configurations, recently the Parallax Attention Mechanism (PAM) is proposed that captures the stereo correspondence disregarding the disparity changes. In this paper, we propose a new SVQA model using a base 3D CNN-based network, and a modified PAM-based left and right feature fusion model. Firstly, we use 3D CNNs and residual blocks to extract features from the left and right views of a stereo video patch. Then, we modify the PAM model to fuse the left and right features with considering the disparity information, and using some fully connected layers, we calculate the quality score of a stereoscopic video. We divided the input videos into cube patches for data augmentation and remove some cubes that confuse our model from the training dataset. Two standard stereoscopic video quality assessment benchmarks of LFOVIAS3DPh2 and NAMA3DS1-COSPAD1 are used to train and test our model. Experimental results indicate that our proposed model is very competitive with the state-of-the-art methods in the NAMA3DS1-COSPAD1 dataset, and it is the state-of-the-art method in the LFOVIAS3DPh2 dataset.

Files and links (1)

url

Link to publisher.View

Metrics

18 Record Views

3 Times Cited - Web of Science

Details

Title: Stereoscopic Video Quality Assessment Using Modified Parallax Attention Module
Creators: Hassan Imani - Bahçeşehir University
Selim Zaim - Bahcesehir Univ, Fac Engn & Nat Sci, Istanbul, Turkey
Md Baharul Islam - Florida Gulf Coast University, Department of Computing and Software Engineering
Masum Shah Junayed - Bahçeşehir University
Publication Details: DIGITIZING PRODUCTION SYSTEMS, ISPR2021, pp.39-50
Series: Lecture Notes in Mechanical Engineering
Publisher: Springer Nature
Number of pages: 12
Grant note: 118C301 / Scientific and Technological Research Council of Turkey (TUBITAK); Turkiye Bilimsel ve Teknolojik Arastirma Kurumu (TUBITAK)
Identifiers: 99384070140506570
Academic Unit: Department of Computing and Software Engineering
Language: English
Resource Type: Book chapter

Stereoscopic Video Quality Assessment Using Modified Parallax Attention Module

Abstract

Files and links (1)

Related links

Metrics

Details