Speech Recognition of Isolated Words using a New Speech Database in Sylheti
Gautam Chakraborty1, Navajit Saikia2
1Gautam Chakraborty, Department of Electronics & Telecommunication Engineering, Assam Engineering College, Guwahati, India.
2Navajit Saikia, Department of Electronics & Telecommunication Engineering, Assam Engineering College, Guwahati, India.
Manuscript received on 13 August 2019. | Revised Manuscript received on 17 August 2019. | Manuscript published on 30 September 2019. | PP: 6259-6268 | Volume-8 Issue-3 September 2019 | Retrieval Number: C5874098319/2019©BEIESP | DOI: 10.35940/ijrte.C5874.098319
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: With the advancements in the field of artificial intelligence, speech recognition based applications are becoming more and more popular in the recent years. Researchers working in many areas including linguistics, engineering, psychology, etc. have been trying to address various aspects relating to speech recognition in different natural languages around the globe. Although many interactive speech applications in “well-resourced” major languages are being developed, uses of these applications are still limited due to language barrier. Hence, researchers have also been concentrating to design speech recognition system in various under-resourced languages. Sylheti is one of such under-resourced languages primarily spoken in the Sylhet division of Bangladesh and also spoken in the southern part of Assam, India. This paper has two contributions: i) it presents a new speech database of isolated words for the Sylheti language, and ii) it presents speech recognition systems for the Sylheti language to recognize isolated Sylheti words by applying two variants of neural network classifiers. The performances of these recognition systems are evaluated with the proposed database and the observations are presented.
Keywords: Automatic Speech Recognition, Mel Frequency Cepstral Coefficient, Sylheti, Under-resourced Language, Feed-Forward Neural Network, Recurrent Neural Network.
Scope of the Article: Pattern Recognition