A CUDA based Parallel Implementation of Speaker Verification System

DSpace Home
→
02 Faculty of Engineering
→
Department of Computer Engineering
→
Theses (Master's and Ph.D) – Computer Engineering
→
View Item

dc.contributor.author	Azhari, Mohammad
dc.date.accessioned	2012-11-29T12:09:22Z
dc.date.available	2012-11-29T12:09:22Z
dc.date.issued	2011
dc.identifier.citation	Azhari, Mohammad. (2011). A CUDA based Parallel Implementation of Speaker Verification System. Thesis (M.S.), Eastern Mediterranean University, Institute of Graduate Studies and Research, Dept. of Computer Engineering, Famagusta: North Cyprus.	en_US
dc.identifier.uri	http://hdl.handle.net/11129/56
dc.description	Master of Science in Computer Engineering. Thesis (M.S.)--Eastern Mediterranean University, Faculty of Engineering, Dept. of Computer Engineering, 2011. Supervisor: Assist. Prof. Dr. Cem Ergün.	en_US
dc.description.abstract	ABSTRACT: Speaker Verification (SV) is a type of speaker recognition that validates the identity of a claimed person by his/her voice. Training the models from large speech data requires a significant amount of memory and computational load. In this thesis we present a parallel implementation of speaker verification system based on Gaussian Mixture Modeling – Universal Background Modeling (GMM – UBM) designed for many-core architecture of NVIDIA’s Graphics Processing Units (GPU) using CUDA single instruction multiple threads (SIMT) model. CUDA implementation of these algorithms is designed in such a way that the speed of computation of the algorithm increases with number of GPU cores. In our experiments we have achieved 30 times speedup for k-means clustering and 65 times speedup for Expectation Maximization (EM) for an input of about 350K frames of 16 dimensions and 1024-2048 mixtures on GeForce GTX 570 (NVIDIA Fermi Series) with 480 cores when compared to a single threaded implementation on the traditional CPU. Keywords: Speaker Verification, Gaussian Mixture Models, Parallel Computing, Compute Unified Device Architecture, General-purpose computing on graphics processing units. …………………………………………………………………………………………………………………………………………………………………………………………………………………… ÖZ: Konuşmacı tanıma işlemlerinden olan konuşmacı doğrulama sisteminde iddia edilen konuşmacının sesinin doğruluğu onaylanır. Konuşmacıların modelleri eğitilirken önemli miktarda bellek ve işlem yükü gerektirir. Bu tezde biz konuşmacı dogrulama sistemini Gauss Karışım Modeli- Evrensel Arkaplan Modelleme tekniği (UBM-GMM) kullanılarak eğittik. Eğitim aşmasını hızlandırmak için seçilen paralel uygulama modeli CUDA teknolojili, tek komutlu çok izgeli (SIMT) işlemci sistemini destekleyen ve çoklu çekirdek desteği olan NVIDIA Grafik İşleme Üniteleri (GPU) kullanılarak gerçekleştirilmiştir. CUDA kullanılarak tasarlanan uygulamarın hesaplama hızı, GPU daki çekirdek sayısına bağlı olarak artmaktadır. Deneysel sonuçlara göre, 350K penceresi ve 16 boyutu olan öznitelik vektörleri k-ortalamala kümeleme algoritmasının paralelleştirilmesi ile elde edilen hızlanma faktörü 65 kat, aynı sayıda öznitelik vektörlerinin 2048 karışımlı GMM datasının Enbüyütme Beklentisi Algoritmasına sokulmasıyla elde edilen hızlanma faktörü 65 kat olarak gerçekleşmiştir. Anahtar Kelimeler: Konuşmacı Doğrulama, Gauss Karışım Modelleri, Paralel Hesaplama, Hesap Birleşik Aygıt Mimarisi, Grafik İşleme Ünitelerinde Genel-Amaçlı Hesapla.	en_US
dc.language.iso	en	en_US
dc.publisher	Eastern Mediterranean University (EMU)	en_US
dc.subject	Computer Engineering	en_US
dc.subject	Automatic speech recognition	en_US
dc.subject	Speaker Identification	en_US
dc.subject	Speaker Verification - Gaussian Mixture Models - Parallel Computing	en_US
dc.subject	Compute Unified Device Architecture	en_US
dc.subject	General Purpose Computing on Graphics Processing Units	en_US
dc.title	A CUDA based Parallel Implementation of Speaker Verification System	en_US
dc.type	Thesis	en_US