Multi Layer Neural Network Based Automatic Word Recognition for Bangla Spoken Language

Istiak Ahmed, Md; Ridwan, Iffatur; Ahmed Fuad, Tanvir

MIST Central Library Repository

MIST Repository Home
→
Department of Computer Science and Engineering (CSE)
→
Bachelor's Thesis
→
View Item

dc.contributor.author	Istiak Ahmed, Md
dc.contributor.author	Ridwan, Iffatur
dc.contributor.author	Ahmed Fuad, Tanvir
dc.date.accessioned	2015-07-05T04:50:42Z
dc.date.available	2015-07-05T04:50:42Z
dc.date.issued	2014-12
dc.identifier.uri	http://hdl.handle.net/123456789/149
dc.description		en_US
dc.description.abstract	Automatic speech recognition (ASR) known as speech recognition is a computer technology that enables a device to recognize and understand spoken words and sentences, by digitizing the sound and matching its pattern against the stored patterns. In short, it is the conversion of spoken speech to text. Currently available devices are largely speaker-dependent and can recognize discrete speech better than the normal (continuous) speech. Speaker independent system recognizes speech of indeﬁnite multiple people. In our research, we have used a system which is speaker independent and can detect continuous speech. Their major applications are in assistive for helping peopleinworkingaroundtheirdisabilities. OurproposedBanglaspeechsystem,based on MFCC+Neural Network+Triphone is a new approach towards the ﬁeld of Bangla ASR system. For this thesis work, we have prepared a Bangla speech recognition system of Bangla ASR. Most of the Bangla ASR system uses a small number of speakers, but 30 speakers selected from a wide area of Bangladesh, where Bangla is used as a native language, are involved here. In the experiments, Mel-Frequency Cepstral Coefﬁcients (MFCCs) and the result based on (recognized by) Neural Network are inputted to the Hidden Markov Model (HMM) based classiﬁers for obtaining speech recognition performance. Other than the traditional MFCC triphone model; a new method that have used Neural Network based triphone model had been experimented to get better ASR performance. We used k-mean clustering for the proposed method. From the experimental results, word correct rate and word accuracy for male and female voices distinctly provide much better result for the proposed model based on Neural Network than MFCC-38 as well as MFCC-39. So, our proposed system is in favor of gender independent fact. For male and female voices collectively, sometimes MFCC-39 based model and sometimes Neural Network based model shows better word accuracy and correct rate.	en_US
dc.description.sponsorship	Department of Computer Science and Engineering, Military Institute of Science and Technology	en_US
dc.language.iso	en	en_US
dc.publisher	Department of Computer Science and Engineering, Military Institute of Science and Technology	en_US
dc.relation.ispartofseries	B.Sc. in Computer Science and Engineering Thesis;
dc.subject	Multi Layer, Neural Network, Automatic, Word, Recognition, Bangla Spoken Language	en_US
dc.title	Multi Layer Neural Network Based Automatic Word Recognition for Bangla Spoken Language	en_US
dc.type	Thesis	en_US

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Bachelor's Thesis [49]

Show simple item record

Search DSpace

Advanced Search

Browse

All of MIST Repository
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

Multi Layer Neural Network Based Automatic Word Recognition for Bangla Spoken Language

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of MIST Repository

This Collection

My Account