Markov Model Based Real Time Speaker Recognition using K-Means, Fast Fourier Transform and Mel Frequency Cepstral Coefficients

Home Page
About
Submit A Journal
Submit A Conference
Submit Paper/Book
- Submit a Preprint
- Submit a Book
Publisher/Editor Panel
- Sign In/Sign Up

Celal Bayar Üniversitesi Fen Bilimleri Dergisi
Vol: 15 Issue: 3
Markov Model Based Real Time Speaker Recognition using K-Means, Fast Fourier Transform and Mel Frequ...

Markov Model Based Real Time Speaker Recognition using K-Means, Fast Fourier Transform and Mel Frequency Cepstral Coefficients

Authors : Emin Borandağ

Pages : 287-292

Doi:10.18466/cbayarfbe.556936

View : 27 | Download : 5

Publication Date : 2019-09-30

Article Type : Research

Abstract :In this study, which was carried out using a combination of machine learning and sound processing methods, a speaker recognition system and application were developed using real-time Mel Frequency Cepstral Coefficients (MFCC) features and Markov chain model classifier. A sound sample was taken from each speaker for the training of the system and these sound samples were processed in Fast Fourier Transform and MFCC feature extraction algorithms. The MFCC features were clustered using the k-means clustering algorithm. A Markov chain model was created for each speaker by using the outputs obtained after clustering. By deducting the characteristic features of the voice of the speaker, the person who was talking in the society and how long and at which time intervals they spoke during the conversation was determined in real time with high accuracy.
Keywords : Real time speaker recognition, Mel-Frequency, K-Means, Machine Learning, Markov Chain, Fast Fourier Transform

ORIGINAL ARTICLE URL

VIEW PAPER (PDF)

All Rights Reserved. İzmir Akademi Derneği
CopyRight © 2024