Exploring multicepstral features in a new classical machine learning-based framework for replay attack detection

Contreras, Rodrigo Colnago; Viana, Monique Simplicio; Fonseca, Everthon Silva; Bongarti, Marcelo Adriano dos Santos; Toygar, Onsen; Guido, Rodrigo Capobianco

doi:10.1016/j.compeleceng.2025.110570

Exploring multicepstral features in a new classical machine learning-based framework for replay attack detection

dc.contributor.author	Contreras, Rodrigo Colnago
dc.contributor.author	Viana, Monique Simplicio
dc.contributor.author	Fonseca, Everthon Silva
dc.contributor.author	Bongarti, Marcelo Adriano dos Santos
dc.contributor.author	Toygar, Onsen
dc.contributor.author	Guido, Rodrigo Capobianco
dc.date.accessioned	2026-02-06T18:37:32Z
dc.date.issued	2025
dc.department	Doğu Akdeniz Üniversitesi
dc.description.abstract	The integration of Internet of Things (IoT) technologies has accelerated the adoption of recognition and authentication systems, offering seamless access across devices from smart homes to workplace systems. Among biometric traits, voice stands out due to its simplicity, cleanliness, low capture cost, uniqueness, and the extensive computational resources supporting it in the scientific literature. Recently, however, spoofing risks have emerged as a serious challenge to the security of voice-based systems. To counteract these threats without additional hardware, techniques analyzing inherent voice signal features have been developed. This paper introduces a new soft computing framework based on classical machine learning classifiers such as Support Vector Machine (SVM), Random Forest (RF), and Logistic Regression (LR), comprising Gaussian-noise-based data augmentation, extraction and fusion of multiple cepstral and non-cepstral features, and dimensionality reduction through Singular Value Decomposition (SVD). In particular, we explore eight distinct cepstral extraction techniques, exemplified by popular approaches such as MFCC and CQCC, and sixteen additional non-cepstral metrics such as Zero Crossing Rate (ZCR) and Harmonic-to-Noise Ratio (HNR). Additionally, we generalize cepstral pattern representation by proposing cepstral multiprojection, a novel strategy designed to systematically reduce the dimensionality and redundancy of multicepstral matrices, thereby enhancing discriminative power and computational efficiency. Evaluated with the ASVSpoof 2017 v2.0 competition benchmark, our approach achieved competitive results, reaching 5.14% equal error rate (EER) on the Dev set and 10.58% on the Eval set,
dc.identifier.doi	10.1016/j.compeleceng.2025.110570
dc.identifier.issn	0045-7906
dc.identifier.issn	1879-0755
dc.identifier.orcid	0000-0003-4003-7791
dc.identifier.orcid	0000-0002-2960-8293
dc.identifier.orcid	0000-0002-9027-7702
dc.identifier.orcid	0000-0002-0924-8024
dc.identifier.orcid	0000-0001-6202-0806
dc.identifier.orcid	0000-0001-7402-9058
dc.identifier.scopus	2-s2.0-105011523726
dc.identifier.scopusquality	Q1
dc.identifier.uri	https://doi.org/10.1016/j.compeleceng.2025.110570
dc.identifier.uri	https://hdl.handle.net/11129/12509
dc.identifier.volume	127
dc.identifier.wos	WOS:001541574200001
dc.identifier.wosquality	Q1
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Pergamon-Elsevier Science Ltd
dc.relation.ispartof	Computers & Electrical Engineering
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_WoS_20260204
dc.subject	Voice Liveness Detection
dc.subject	Spoofing detection
dc.subject	Pattern recognition
dc.subject	Cepstral analysis
dc.subject	Machine learning
dc.title	Exploring multicepstral features in a new classical machine learning-based framework for replay attack detection
dc.type	Article

Collections

WoS Indexed Publications Collection
Scopus İndeksli Yayınlar Koleksiyonu

Exploring multicepstral features in a new classical machine learning-based framework for replay attack detection

Files

Collections