Profanity word detection for Sinhala language using deep learning

Show simple item record

dc.contributor.author Kumara, S.K.H.R.S.
dc.contributor.author Jayaneththi, J.K.D.B.G.
dc.date.accessioned 2023-02-10T09:33:02Z
dc.date.available 2023-02-10T09:33:02Z
dc.date.issued 2023-01-18
dc.identifier.issn 1391-8796
dc.identifier.uri http://ir.lib.ruh.ac.lk/xmlui/handle/iruor/11022
dc.description.abstract In the present world, content censorship is an important concept. When it comes to the Sinhala language, several studies have been conducted on textbased content censorship methods, but not for audio content. The Sri Lankan government prohibits the use of profanity in public media. Therefore, Sri Lankan media companies must check their videos for profanity before telecasting. Till now, this process has been done manually, and it is extremely difficult with long videos and audio clips. This study suggests developing a deep learning model that can automatically find profanity words in Sinhala audio files. The ten profanity words were selected and audio samples from 100 people were gathered. The data was preprocessed, transformed into spectrogram images, and applied to a convolutional neural network (CNN) to develop the profanity filter model. By converting audio files to spectrograms and applying image processing to extract the features from the dataset, the model predicts the profanity words. This paper addresses the procedure of the mentioned process and its capabilities with upcoming updated versions of the final product. en_US
dc.language.iso en en_US
dc.publisher Faculty of Science, University of Ruhuna, Matara, Sri Lanka en_US
dc.subject Profanity detection en_US
dc.subject Sinhala language en_US
dc.subject Deep learning en_US
dc.subject Audio processing en_US
dc.title Profanity word detection for Sinhala language using deep learning en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account