Comparative analysis of CNN, vision transformer, and hybrid architectures for white blood cancer classification

dc.contributor.authorManali, Dogu
dc.contributor.authorAkel, Mahmoud
dc.contributor.authorDemirel, Hasan
dc.date.accessioned2026-02-06T18:35:42Z
dc.date.issued2025
dc.departmentDoğu Akdeniz Üniversitesi
dc.description.abstractThis study presents a comparative analysis of 13 artificial intelligence-based classification architectures for white blood cell classification using microscopic images. The models include four convolutional neural networks, five vision transformers, and four hybrid convolutional neural network-transformer architectures. All architectures were trained and tested on a publicly available Kaggle dataset under similar experimental settings to ensure fair comparison. Among all the models, MobileViT-XS, which has hybrid architecture, achieved the highest F1-score of 98.76%, indicating exceptional classification performance across all white blood cell categories. CNN-based DenseNet121 followed closely with an F1 score of 98.65%, though it required significantly more training time. In contrast, Vision Transformers such as ViT-Base underperformed, with an F1-score of only 87.36%, despite higher parameter complexity. These results underscore that vision transformers often require architectural optimization to perform well in medical imaging tasks. Overall, the results demonstrate that hybrid architecture variant deliver more accurate predictions while requiring less computational power. Their lightweight architecture make promising future candidate for deployment in clinical and mobile healthcare settings.
dc.identifier.doi10.1007/s11760-025-05041-3
dc.identifier.issn1863-1703
dc.identifier.issn1863-1711
dc.identifier.issue18
dc.identifier.scopus2-s2.0-105025816173
dc.identifier.scopusqualityQ2
dc.identifier.urihttps://doi.org/10.1007/s11760-025-05041-3
dc.identifier.urihttps://hdl.handle.net/11129/12048
dc.identifier.volume19
dc.identifier.wosWOS:001647682300001
dc.identifier.wosqualityQ3
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherSpringer London Ltd
dc.relation.ispartofSignal Image and Video Processing
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WoS_20260204
dc.subjectConvolutional neural networks
dc.subjectDeeplearning
dc.subjectHybrid model
dc.subjectVision transformer
dc.subjectWhiteblood cells
dc.titleComparative analysis of CNN, vision transformer, and hybrid architectures for white blood cancer classification
dc.typeArticle

Files