Building discriminative features of scene recognition using multi-stages of inception-ResNet-v2
| dc.contributor.author | Khan, Altaf | |
| dc.contributor.author | Chefranov, Alexander | |
| dc.contributor.author | Demirel, Hasan | |
| dc.date.accessioned | 2026-02-06T18:34:18Z | |
| dc.date.issued | 2023 | |
| dc.department | Doğu Akdeniz Üniversitesi | |
| dc.description.abstract | Scene recognition is a challenging problem due to intra-class variations and inter-class similarities. Traditional methods and convolutional neural networks (CNN) represent the global spatial structure, which is suitable for general scene classification and object recognition, but show poor presentation for particular indoor or outdoor medium-scale scene datasets. In this manuscript, we study the local and global structures of image scene, and then combine both types of information for indoor and outdoor scenes to improve the scene recognition accuracy. Local region structure indicates sub-part of the scene, such as sky or ground, etc., and global structure indicates whole scene structure, such as sky-background-ground outdoor scene type. For this purpose, the multi-layer convolutional features of inception and residual-based architecture are used at intermediate and higher layers to preserve both local and global structures of image scene. Each layer used for feature extraction, is connected with the global average pooling to obtain a discriminative representation of the image scenes. In this way, local structure is explored at the intermediate convolutional layers, and global spatial structure is obtained from the higher layers. The proposed method is evaluated on 8-scene, 15-scene, UMC-21, MIT67, and 12-scene challenging datasets achieving 98.51%, 96.49%, 99.05%, 80.31%, and 84.88%, respectively, significantly outperforming state-of-the-art approaches. | |
| dc.identifier.doi | 10.1007/s10489-023-04460-4 | |
| dc.identifier.endpage | 18449 | |
| dc.identifier.issn | 0924-669X | |
| dc.identifier.issn | 1573-7497 | |
| dc.identifier.issue | 15 | |
| dc.identifier.scopus | 2-s2.0-85147039804 | |
| dc.identifier.scopusquality | Q1 | |
| dc.identifier.startpage | 18431 | |
| dc.identifier.uri | https://doi.org/10.1007/s10489-023-04460-4 | |
| dc.identifier.uri | https://hdl.handle.net/11129/11739 | |
| dc.identifier.volume | 53 | |
| dc.identifier.wos | WOS:000921805500003 | |
| dc.identifier.wosquality | Q2 | |
| dc.indekslendigikaynak | Web of Science | |
| dc.indekslendigikaynak | Scopus | |
| dc.language.iso | en | |
| dc.publisher | Springer | |
| dc.relation.ispartof | Applied Intelligence | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.snmz | KA_WoS_20260204 | |
| dc.subject | Scene recognition | |
| dc.subject | 3D Scene geometry | |
| dc.subject | Deep feature extraction | |
| dc.subject | Local and global scene structure | |
| dc.subject | Score level fusion | |
| dc.title | Building discriminative features of scene recognition using multi-stages of inception-ResNet-v2 | |
| dc.type | Article |










