Pneumonia Detection Based on X-Ray and Blood Tests with Multimodal and Federated Learning

Alexandre R de Mello; Leticia A Cechinel; Leandro P da Silva; Claudio F. G. Santos; Patrick M de Faria; Marcio Biczyk; Leandro N de Castro; Vinicius M P Guirado; Rafael Marin Machado de Souza

doi:10.1007/978-3-032-23176-5_21

Book chapter

Pneumonia Detection Based on X-Ray and Blood Tests with Multimodal and Federated Learning

Alexandre R de Mello, Leticia A Cechinel, Leandro P da Silva, Claudio F. G. Santos, Patrick M de Faria, Marcio Biczyk, Leandro N de Castro, Vinicius M P Guirado and Rafael Marin Machado de Souza

Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, pp.296-311

Lecture Notes in Computer Science, Springer Nature Switzerland

05-01-2026

DOI: https://doi.org/10.1007/978-3-032-23176-5_21

Abstract

Artificial Intelligence

Federated Learning

Multimodal AI

This paper presents a pneumonia detection approach using real-world data, Multimodal, and Federated Learning (MMFL), combining real-world chest X-ray data and blood tests. The work compares late and intermediate fusion architectures for integrating image and tabular data to improve pneumonia identification. The dataset was anonymized and contains 2,343 entries from 2,201 patients after data cleaning. Two multimodal model architectures were explored: late fusion; and intermediate/hybrid fusion. Image classification models such as Visual Transformer (ViT), EfficientNetV2, and Xception were evaluated, and for tabular data, XGBoost was employed. In the context of federated learning, the study proposes a federated late fusion multimodal model. Each client trains ViT models for images and XGBoost for tabular data, which are subsequently aggregated on the server. Federated models were trained in multiple institutions, each with its own data division. The results showed that the centrally trained multimodal late fusion model using ViT and XGBoost achieved an accuracy and AUC of 95,40% and 98,79% respectively, achieving the best overall performance. The federated multimodal model results also proved to be a viable alternative when data is decentralized, with an accuracy of 90,33% and AUC of 96,67% when two clients are in the federation.

Metrics

1 Record Views

Details

Title: Pneumonia Detection Based on X-Ray and Blood Tests with Multimodal and Federated Learning
Creators: Alexandre R de Mello - Reference Center Foundation for Innovative Technologies
Leticia A Cechinel - Reference Center Foundation for Innovative Technologies
Leandro P da Silva - Reference Center Foundation for Innovative Technologies
Claudio F. G. Santos
Patrick M de Faria - Siemens (Brazil)
Marcio Biczyk - Hospital São Paulo
Leandro N de Castro - Universidade Estadual de Campinas (UNICAMP)
Vinicius M P Guirado - Hospital São Paulo
Rafael Marin Machado de Souza - Universidade Estadual de Campinas (UNICAMP)
Contributors: Deisy Chaves (Editor)
Manuel Forero Vargas (Editor)
Oswaldo Rojas Camacho (Editor)
Publication Details: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, pp.296-311
Series: Lecture Notes in Computer Science
Publisher: Springer Nature Switzerland; Cham
Identifiers: 99385964514906570
Language: English
Resource Type: Book chapter

Pneumonia Detection Based on X-Ray and Blood Tests with Multimodal and Federated Learning

Abstract

Related links

Metrics

Details