WNet: A dual‐encoded multi‐human parsing network

Md Imran Hosen; Tarkan Aydin; Md Baharul Islam

doi:10.1049/ipr2.13176

Back

WNet: A dual‐encoded multi‐human parsing network

Journal article

Open access

Peer reviewed

WNet: A dual‐encoded multi‐human parsing network

Md Imran Hosen, Tarkan Aydin and Md Baharul Islam

IET image processing, Vol.18(12)

07-10-2024

DOI: https://doi.org/10.1049/ipr2.13176

Abstract

computer vision

image processing

image segmentation

Abstract In recent years, multi‐human parsing has become a focal point in research, yet prevailing methods often rely on intermediate stages and lacking pixel‐level analysis. Moreover, their high computational demands limit real‐world efficiency. To address these challenges and enable real‐time performance, low‐latency end‐to‐end network is proposed. This approach leverages vision transformer and convolutional neural network in a dual‐encoded network, featuring a lightweight Transformer‐based vision encoder) and a convolution encoder based on Darknet. This combination adeptly captures long‐range dependencies and spatial relationships. Incorporating a fuse block enables the seamless merging of features from the encoders. Residual connections in the decoder design amplify information flow. Experimental validation on crowd instance‐level human parsing and look into person datasets showcases the WNet's effectiveness, achieving high‐speed multi‐human parsing at 26.7 frames per second. Ablation studies further underscore WNet's capabilities, emphasizing its efficiency and accuracy in complex multi‐human parsing tasks.

Files and links (1)

url

https://doi.org/10.1049/ipr2.13176View

Published (Version of record) Open

Metrics

20 Record Views

4 Times Cited - Web of Science

Details

Title: WNet: A dual‐encoded multi‐human parsing network
Creators: Md Imran Hosen - Manarat International University
Tarkan Aydin - Bahçeşehir University
Md Baharul Islam - Bahçeşehir University
Publication Details: IET image processing, Vol.18(12)
Publisher: WILEY; HOBOKEN
Number of pages: 13
Identifiers: 99384064843106570
Academic Unit: Department of Computing and Software Engineering
Language: English
Resource Type: Journal article

WNet: A dual‐encoded multi‐human parsing network

Abstract

Files and links (1)

Related links

Metrics

Details