Patch Token Fusion in Vision Transformers for Brain Cancer Classification
| dc.contributor.author | Manali, Dogu | |
| dc.contributor.author | Demirel, Hasan | |
| dc.date.accessioned | 2026-02-06T18:17:14Z | |
| dc.date.issued | 2025 | |
| dc.department | Doğu Akdeniz Üniversitesi | |
| dc.description | 33rd Conference on Signal Processing and Communications Applications-SIU-Annual -- JUN 25-28, 2025 -- Istanbul, TURKIYE | |
| dc.description.abstract | Accurate and robust image classification plays a critical role in advancing medical diagnostics, particularly in detecting complex conditions such as brain cancer. This study investigates the integration of multiple Vision Transformer (ViT) models for patch-token-based image classification, aiming to enhance diagnostic accuracy. By leveraging three pre-trained ViT architectures (TinyViT, SmallViT, and BaseViT), features from each model are dynamically extracted, aligned, and combined into a unified representation for classification. The proposed approach demonstrated significant improvements in accuracy, AUC, and F1-score when evaluated across various model combinations and configurations. The highest performance was observed with specific combinations, achieving an accuracy of 95.96%, AUC of 99.58%, and F1-score of 95.95% for the ViT-Tiny-based classifier. | |
| dc.description.sponsorship | Institute of Electrical and Electronics Engineers Inc | |
| dc.identifier.doi | 10.1109/SIU66497.2025.11112405 | |
| dc.identifier.isbn | 979-8-3315-6656-2 | |
| dc.identifier.isbn | 979-8-3315-6655-5 | |
| dc.identifier.issn | 2165-0608 | |
| dc.identifier.scopus | 2-s2.0-105015459149 | |
| dc.identifier.scopusquality | N/A | |
| dc.identifier.uri | https://doi.org/10.1109/SIU66497.2025.11112405 | |
| dc.identifier.uri | https://hdl.handle.net/11129/8862 | |
| dc.identifier.wos | WOS:001575462500347 | |
| dc.identifier.wosquality | N/A | |
| dc.indekslendigikaynak | Web of Science | |
| dc.indekslendigikaynak | Scopus | |
| dc.language.iso | tr | |
| dc.publisher | IEEE | |
| dc.relation.ispartof | 2025 33Rd Signal Processing and Communications Applications Conference, Siu | |
| dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.snmz | KA_WoS_20260204 | |
| dc.subject | Brain cancer | |
| dc.subject | Multi | |
| dc.subject | model fusion | |
| dc.subject | Patch token | |
| dc.subject | Vision Transformer | |
| dc.title | Patch Token Fusion in Vision Transformers for Brain Cancer Classification | |
| dc.type | Conference Object |










