Uncovering the multifaceted nature of fake news: Latent dirichlet allocation topic modelling approach

Show simple item record

dc.contributor.author Abishethvarman, V.
dc.contributor.author Banujan, K.
dc.contributor.author Nirubikaa, R.
dc.contributor.author Kumara, B.T.G.S.
dc.date.accessioned 2024-03-18T06:38:20Z
dc.date.available 2024-03-18T06:38:20Z
dc.date.issued 2024-01-24
dc.identifier.issn 1391-8796
dc.identifier.uri http://ir.lib.ruh.ac.lk/xmlui/handle/iruor/16509
dc.description.abstract Disseminating fake news is a critical challenge in the digital information world. Misleading or fabricated information, masquerading as reliable news, has impacted public opinion and information integrity. Fake news is subjective to its context. This research aims to uncover diverse thematic structures within spanning topics of fake news through Latent Dirichlet Allocation (LDA) and visualizers. The well-established fake news labeled dataset was taken and then we followed the preprocessing techniques such as text normalization, punctuation, stop words, and custom word removal. We extracted features through CountVectorizer and moved for the topic modeling. The LDA model was employed to find ten topics. As a result, topics encompass keywords which are diverse themes, from everyday discourse to electoral dynamics, governance, and global affairs. Some topics underscore the role of news outlets in propagating misinformation, emphasizing the need for source scrutiny. Continued iterations made this more accurate in approach. Additionally, this research delves into political scenarios, gender issues, investigations, law enforcement, and political campaigns, revealing the multifaceted nature of fake news challenges as topics analyzed through the LDA visualizer. Visualizer helps to differentiate the nuanced dimensions of fake news between each topic. Each topic was analyzed through its semantic value in the fake news. The results emphasize the commonly used words in the fake news corpus. That pinpointed the triggering words in fake news, distinguishing them from reliable news through topic modeling and analysis. This methodology can be extended to user profiling based on the extracted keywords, into characteristics of individuals interacting with online content. en_US
dc.language.iso en en_US
dc.publisher Faculty of Science, University of Ruhuna, Matara, Sri Lanka en_US
dc.subject Fake News en_US
dc.subject Latent Dirichlet Allocation (LDA) en_US
dc.subject Topic Modelling en_US
dc.subject LDAvisualizer en_US
dc.title Uncovering the multifaceted nature of fake news: Latent dirichlet allocation topic modelling approach en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account