Logo image
AN EFFICIENT END-TO-END IMAGE COMPRESSION TRANSFORMER
Conference proceeding

AN EFFICIENT END-TO-END IMAGE COMPRESSION TRANSFORMER

Afsana Ahsan Jeny, Masum Shah Junayed and Md Baharul Islam
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, pp.1786-1790
IEEE International Conference on Image Processing ICIP
International Conference on Image Processing (Bordeaux, France, 2022–2022)
01-01-2022

Abstract

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Science & Technology Computer Science Engineering Technology
Image and video compression received significant research attention and expanded their applications. Existing entropy estimation-based methods combine with hyperprior and local context, limiting their efficacy. This paper introduces an efficient end-to-end transformer-based image compression model, which generates a global receptive field to tackle the long-range correlation issues. A hyper encoder-decoder-based transformer block employs a multi-head spatial reduction self-attention (MHSRSA) layer to minimize the computational cost of the self-attention layer and enable rapid learning of multi-scale and high-resolution features. A Casual Global Anticipation Module (CGAM) is designed to construct highly informative adjacent contexts utilizing channel-wise linkages and identify global reference points in the latent space for end-to-end rate-distortion optimization (RDO). Experimental results demonstrate the effectiveness and competitive performance of the KODAK dataset.
url
Link to published article.View

Related links

Metrics

Details

Logo image