Military Institute of Science and Technology
MIST Digital Archive

Multi Layer Neural Network Based Automatic Word Recognition for Bangla Spoken Language

dc.contributor.authorIstiak Ahmed, Md
dc.contributor.authorRidwan, Iffatur
dc.contributor.authorAhmed Fuad, Tanvir
dc.date.accessioned2015-07-05T04:50:42Z
dc.date.available2015-07-05T04:50:42Z
dc.date.issued2014-12
dc.descriptionen_US
dc.description.abstractAutomatic speech recognition (ASR) known as speech recognition is a computer technology that enables a device to recognize and understand spoken words and sentences, by digitizing the sound and matching its pattern against the stored patterns. In short, it is the conversion of spoken speech to text. Currently available devices are largely speaker-dependent and can recognize discrete speech better than the normal (continuous) speech. Speaker independent system recognizes speech of indefinite multiple people. In our research, we have used a system which is speaker independent and can detect continuous speech. Their major applications are in assistive for helping peopleinworkingaroundtheirdisabilities. OurproposedBanglaspeechsystem,based on MFCC+Neural Network+Triphone is a new approach towards the field of Bangla ASR system. For this thesis work, we have prepared a Bangla speech recognition system of Bangla ASR. Most of the Bangla ASR system uses a small number of speakers, but 30 speakers selected from a wide area of Bangladesh, where Bangla is used as a native language, are involved here. In the experiments, Mel-Frequency Cepstral Coefficients (MFCCs) and the result based on (recognized by) Neural Network are inputted to the Hidden Markov Model (HMM) based classifiers for obtaining speech recognition performance. Other than the traditional MFCC triphone model; a new method that have used Neural Network based triphone model had been experimented to get better ASR performance. We used k-mean clustering for the proposed method. From the experimental results, word correct rate and word accuracy for male and female voices distinctly provide much better result for the proposed model based on Neural Network than MFCC-38 as well as MFCC-39. So, our proposed system is in favor of gender independent fact. For male and female voices collectively, sometimes MFCC-39 based model and sometimes Neural Network based model shows better word accuracy and correct rate.en_US
dc.description.sponsorshipDepartment of Computer Science and Engineering, Military Institute of Science and Technologyen_US
dc.identifier.urihttp://hdl.handle.net/123456789/149
dc.language.isoenen_US
dc.publisherDepartment of Computer Science and Engineering, Military Institute of Science and Technologyen_US
dc.relation.ispartofseriesB.Sc. in Computer Science and Engineering Thesis;
dc.subjectMulti Layer, Neural Network, Automatic, Word, Recognition, Bangla Spoken Languageen_US
dc.titleMulti Layer Neural Network Based Automatic Word Recognition for Bangla Spoken Languageen_US
dc.typeThesisen_US

Files

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: