Patch Token Fusion in Vision Transformers for Brain Cancer Classification

dc.contributor.authorManali, Dogu
dc.contributor.authorDemirel, Hasan
dc.date.accessioned2026-02-06T18:17:14Z
dc.date.issued2025
dc.departmentDoğu Akdeniz Üniversitesi
dc.description33rd Conference on Signal Processing and Communications Applications-SIU-Annual -- JUN 25-28, 2025 -- Istanbul, TURKIYE
dc.description.abstractAccurate and robust image classification plays a critical role in advancing medical diagnostics, particularly in detecting complex conditions such as brain cancer. This study investigates the integration of multiple Vision Transformer (ViT) models for patch-token-based image classification, aiming to enhance diagnostic accuracy. By leveraging three pre-trained ViT architectures (TinyViT, SmallViT, and BaseViT), features from each model are dynamically extracted, aligned, and combined into a unified representation for classification. The proposed approach demonstrated significant improvements in accuracy, AUC, and F1-score when evaluated across various model combinations and configurations. The highest performance was observed with specific combinations, achieving an accuracy of 95.96%, AUC of 99.58%, and F1-score of 95.95% for the ViT-Tiny-based classifier.
dc.description.sponsorshipInstitute of Electrical and Electronics Engineers Inc
dc.identifier.doi10.1109/SIU66497.2025.11112405
dc.identifier.isbn979-8-3315-6656-2
dc.identifier.isbn979-8-3315-6655-5
dc.identifier.issn2165-0608
dc.identifier.scopus2-s2.0-105015459149
dc.identifier.scopusqualityN/A
dc.identifier.urihttps://doi.org/10.1109/SIU66497.2025.11112405
dc.identifier.urihttps://hdl.handle.net/11129/8862
dc.identifier.wosWOS:001575462500347
dc.identifier.wosqualityN/A
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isotr
dc.publisherIEEE
dc.relation.ispartof2025 33Rd Signal Processing and Communications Applications Conference, Siu
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WoS_20260204
dc.subjectBrain cancer
dc.subjectMulti
dc.subjectmodel fusion
dc.subjectPatch token
dc.subjectVision Transformer
dc.titlePatch Token Fusion in Vision Transformers for Brain Cancer Classification
dc.typeConference Object

Files