Speech recognition based on convolutional neural networks and MFCC algorithm

Home Page
About
Submit A Journal
Submit A Conference
Submit Paper/Book
- Submit a Preprint
- Submit a Book
Publisher/Editor Panel
- Sign In/Sign Up

Advances in Artificial Intelligence Research
Volume:1 Issue:1
Speech recognition based on convolutional neural networks and MFCC algorithm

Speech recognition based on convolutional neural networks and MFCC algorithm

Authors : Arzo MAHMOOD, Utku KÖSE

Pages : 6-12

View : 27 | Download : 7

Publication Date : 2021-01-15

Article Type : Research Paper

Abstract :In this paper, an automatic speech recognition system based on convolutional neural networks and MFCC has been proposed. We have been investigated some deep models’ architecture with various hyperparameters options such as Dropout rate and Learning rate. The dataset used in this paper was collected from Kaggle TensorFlow Speech Recognition Challenge. Each audio file in the dataset contain one word with one second length the total words in the dataset correspond to 30 categories with one category for background noise. The dataset contains 64,721 files has been separated into 51,088 for the training set, 6,798 for the validation set and 6,835 for the testing set. We have evaluated 3 models with different hyperparameters configuration in order to choose the best model with higher accuracy. The highest accuracy achieved is 88.21%.
Keywords : convolutional neural networks, FFT, MFCC, speech recognition, feature extraction

ORIGINAL ARTICLE URL

VIEW PAPER (PDF)

All Rights Reserved. İzmir Akademi Derneği
CopyRight © 2025