Sinhala word suggestion algorithm for ad hoc Romanized Sinhala transliterations using a Trie.

Show simple item record

dc.contributor.author Sumanathilaka, T.G.D.K.
dc.contributor.author Weerasinghe, R.
dc.contributor.author Priyadarshana, H.Y.P.P.
dc.date.accessioned 2023-02-08T04:16:12Z
dc.date.available 2023-02-08T04:16:12Z
dc.date.issued 2023-01-18
dc.identifier.issn 1391-8796
dc.identifier.uri http://ir.lib.ruh.ac.lk/xmlui/handle/iruor/10855
dc.description.abstract With the revolution in social technology, Sinhala and Romanized Sinhala became the main language among the general Sri Lankan community. Informal shorthand typing used with Romanized Sinhala encourages the researchers to dive into the new arena of transliterations. As Sinhala is a lowresource language, the current system uses a rule-based approach for transliteration and suggestion generation on Romanized Sinhala to Sinhala. Therefore, different shorthand typing-based word predictions cannot be achieved. This proposed novel Suggestion transliterator uses an enhanced Trie which is an efficient information retrieval data structure for word prediction. The survey collected was used to identify the different typing patterns and adapted them as rules. Based on the rules, Sinhala dictionary was annotated and used to train the Trie. The trained model was used to identify the possible word prediction. The Romanized Sinhala words predicted by the model are compared with a Romanized Sinhala to Sinhala Knowledge base, which will return the unique Sinhala words as the suggestions. As an example, the shorthand Romanized Sinhala word “Adaraya” can be transliterated and suggested to its Sinhala representation as “ආදරය, ආදාරය”. The model was tested with 200 unique Romanized Sinhala test data. Each Romanized Sinhala sentence was fed to the model and word-level suggestions were compared with the expected output. The model achieved a word-level prediction accuracy of 84%. So, this novel transliterator can gap the ambiguity issue in Romanized Sinhala to Sinhala transliterations which will help future products to enhance the typing experience of their Romanized Sinhala users. en_US
dc.language.iso en en_US
dc.publisher Faculty of Science, University of Ruhuna, Matara, Sri Lanka en_US
dc.subject Romanized Sinhala en_US
dc.subject Rule-based approach en_US
dc.subject Suggestion en_US
dc.subject Trie en_US
dc.subject Transliteration en_US
dc.title Sinhala word suggestion algorithm for ad hoc Romanized Sinhala transliterations using a Trie. en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account