Exploring multicepstral features in a new classical machine learning-based framework for replay attack detection
| dc.contributor.author | Contreras, Rodrigo Colnago | |
| dc.contributor.author | Viana, Monique Simplicio | |
| dc.contributor.author | Fonseca, Everthon Silva | |
| dc.contributor.author | Bongarti, Marcelo Adriano dos Santos | |
| dc.contributor.author | Toygar, Onsen | |
| dc.contributor.author | Guido, Rodrigo Capobianco | |
| dc.date.accessioned | 2026-02-06T18:37:32Z | |
| dc.date.issued | 2025 | |
| dc.department | Doğu Akdeniz Üniversitesi | |
| dc.description.abstract | The integration of Internet of Things (IoT) technologies has accelerated the adoption of recognition and authentication systems, offering seamless access across devices from smart homes to workplace systems. Among biometric traits, voice stands out due to its simplicity, cleanliness, low capture cost, uniqueness, and the extensive computational resources supporting it in the scientific literature. Recently, however, spoofing risks have emerged as a serious challenge to the security of voice-based systems. To counteract these threats without additional hardware, techniques analyzing inherent voice signal features have been developed. This paper introduces a new soft computing framework based on classical machine learning classifiers such as Support Vector Machine (SVM), Random Forest (RF), and Logistic Regression (LR), comprising Gaussian-noise-based data augmentation, extraction and fusion of multiple cepstral and non-cepstral features, and dimensionality reduction through Singular Value Decomposition (SVD). In particular, we explore eight distinct cepstral extraction techniques, exemplified by popular approaches such as MFCC and CQCC, and sixteen additional non-cepstral metrics such as Zero Crossing Rate (ZCR) and Harmonic-to-Noise Ratio (HNR). Additionally, we generalize cepstral pattern representation by proposing cepstral multiprojection, a novel strategy designed to systematically reduce the dimensionality and redundancy of multicepstral matrices, thereby enhancing discriminative power and computational efficiency. Evaluated with the ASVSpoof 2017 v2.0 competition benchmark, our approach achieved competitive results, reaching 5.14% equal error rate (EER) on the Dev set and 10.58% on the Eval set, | |
| dc.identifier.doi | 10.1016/j.compeleceng.2025.110570 | |
| dc.identifier.issn | 0045-7906 | |
| dc.identifier.issn | 1879-0755 | |
| dc.identifier.orcid | 0000-0003-4003-7791 | |
| dc.identifier.orcid | 0000-0002-2960-8293 | |
| dc.identifier.orcid | 0000-0002-9027-7702 | |
| dc.identifier.orcid | 0000-0002-0924-8024 | |
| dc.identifier.orcid | 0000-0001-6202-0806 | |
| dc.identifier.orcid | 0000-0001-7402-9058 | |
| dc.identifier.scopus | 2-s2.0-105011523726 | |
| dc.identifier.scopusquality | Q1 | |
| dc.identifier.uri | https://doi.org/10.1016/j.compeleceng.2025.110570 | |
| dc.identifier.uri | https://hdl.handle.net/11129/12509 | |
| dc.identifier.volume | 127 | |
| dc.identifier.wos | WOS:001541574200001 | |
| dc.identifier.wosquality | Q1 | |
| dc.indekslendigikaynak | Web of Science | |
| dc.indekslendigikaynak | Scopus | |
| dc.language.iso | en | |
| dc.publisher | Pergamon-Elsevier Science Ltd | |
| dc.relation.ispartof | Computers & Electrical Engineering | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.snmz | KA_WoS_20260204 | |
| dc.subject | Voice Liveness Detection | |
| dc.subject | Spoofing detection | |
| dc.subject | Pattern recognition | |
| dc.subject | Cepstral analysis | |
| dc.subject | Machine learning | |
| dc.title | Exploring multicepstral features in a new classical machine learning-based framework for replay attack detection | |
| dc.type | Article |










