Spatio-Temporal Consistent Non-homogeneous Extreme Video Retargeting

Hassan Imani; Md Baharul Islam

doi:10.1109/ICCE59016.2024.10444165

Back

Spatio-Temporal Consistent Non-homogeneous Extreme Video Retargeting

Conference proceeding

Peer reviewed

Spatio-Temporal Consistent Non-homogeneous Extreme Video Retargeting

Hassan Imani and Md Baharul Islam

2024 IEEE International Conference on Consumer Electronics (ICCE), pp.1-6

01-06-2024

DOI: https://doi.org/10.1109/ICCE59016.2024.10444165

Abstract

CNNs

Convolution

Feeds

Interpolation

Motion estimation

Saliency detection

Salient objects

Segmentation

Spatial and temporal coherence

Stereo image processing

Transformers

Video retargeting

Due to the availability of heterogeneous display devices and their aspect ratios, video retargeting has received considerable research attention among researchers. Non-consistent video retargeting can significantly affect a video's spatial and temporal quality, particularly for extreme retargeting cases. Since no perfectly annotated datasets exist for video retargeting, deep learning-based techniques are rarely utilized. This paper proposes a method that learns to retarget videos by detecting the salient areas and shifting them to the appropriate location. First, we segment the salient objects using a unified Transformer model. Using convolutional layers and a shifting strategy, we shift and warp objects to the suitable size and location in the frame. We use 1D convolution for shifting the salient objects. We also use a frame interpolation technique to preserve temporal information. To train the network, we feed the retargeted frames to a variational auto-encoder network to map the retargeted frames back to the input frames. Besides, we design perceptual and wavelet-based loss functions to train our model. Thus, we train the network unsupervised. Extensive qualitative and quantitative experiments and ablation studies on the DAVIS dataset show the superiority of the proposed method over the existing state-of-the-art methods.

Files and links (1)

url

Link to published articleView

Metrics

5 Record Views

Details

Title: Spatio-Temporal Consistent Non-homogeneous Extreme Video Retargeting
Creators: Hassan Imani - Bahçeşehir University
Md Baharul Islam - Florida Gulf Coast University
Publication Details: 2024 IEEE International Conference on Consumer Electronics (ICCE), pp.1-6
Publisher: IEEE
Identifiers: 99384037338006570
Academic Unit: Department of Software Engineering
Language: English
Resource Type: Conference proceeding