Sinhala speech recognition using hidden Markov based model and deep neural networks based model for number sequences

Show simple item record

dc.contributor.author Azir, M.Y.M.
dc.contributor.author Lorensuhewa, S.A.S.
dc.contributor.author Kalyani, M.A.L.
dc.date.accessioned 2021-12-20T10:03:03Z
dc.date.available 2021-12-20T10:03:03Z
dc.date.issued 2021-02-17
dc.identifier.issn 1391-8796
dc.identifier.uri http://ir.lib.ruh.ac.lk/xmlui/handle/iruor/4704
dc.description.abstract Speech recognition for Sinhala, which is a less-resourced language, has seen only a few pieces of research for Automatic Speech Recognition. Identifying the performance of the Sinhala speech in the modern approaches and toolkits will be helpful for the future works of Sinhala speech recognition. Although numbers are written mostly similarly in most of the languages, the verbal pronunciation differs according to the language. Through this research, an attempt has been made to recognize the number sequences spoken in Sinhala Language using the Hidden Markov Model based Speech Recognizer and to compare it with the performance of Deep Neural Network Based Speech recognition model which used a Multilayer Perceptron Architecture. Readily available state-of-the-art Automatic Speech Recognition toolkits such as Kaldi ASR toolkit, PyTorch-Kaldi are used to build the Speech Recognition Models. A speech corpus of Sinhala number sequences was also proposed exclusively for this study where the corpus was used to compare the performance of the two models. Finally, a rule-based approach was proposed which will map the spell out of numbers in Sinhala to their numeric forms. The Hidden Markov Model-Based approach produced an average accuracy of 18.04% of Word Error Rate and it was improved with the use of Deep Neural Networks to a Word Error Rate of 4.20%. en_US
dc.language.iso en en_US
dc.publisher Faculty of Science, University of Ruhuna, Matara, Sri Lanka en_US
dc.subject Speech Recognition en_US
dc.subject Hidden Markov Model en_US
dc.subject Deep Neural Network en_US
dc.subject Kaldi en_US
dc.subject PyTorch-Kaldi en_US
dc.title Sinhala speech recognition using hidden Markov based model and deep neural networks based model for number sequences en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account