A classifier ensemble model for music emotion classification

Show simple item record

dc.contributor.author Charles, J.
dc.contributor.author Lekamge, L.S.
dc.date.accessioned 2023-03-20T03:49:05Z
dc.date.available 2023-03-20T03:49:05Z
dc.date.issued 2020-01-22
dc.identifier.issn 1391-8796
dc.identifier.uri http://ir.lib.ruh.ac.lk/xmlui/handle/iruor/11949
dc.description.abstract Computational modelling of music-emotion has attracted an increasing attention especially in today’s digital age. Even though the use of conventional machine learning algorithms for music-emotion classification are frequently reported in literature, the studies frequently use western/western classical music while traditional melodies like Sri Lankan folk music remains less explored. Therefore, we considered a Sri Lankan folk music dataset introduced by the authors in a previous study, comprising of 206 stimuli (30 seconds, 44100Hz; stereo; 32bit; .wav). The stimuli were purposely composed and orchestrated to express happy (54), sad (70), or fear (82) as predominant emotions and the emotion-annotation was by a panel of musicologists. Twenty-two features related to dynamics, rhythm, timbre, pitch, and tonality were extracted using MATLAB MIRToolbox. Five individual classifiers (Logistic Regression (LR), Naive Bayes, Decision Tree, Random Forest (RF), and k-Nearest Neighbor (k-NN)) were applied on the dataset. K-NN outperformed the others yielding an accuracy of 78.44% with 76.19% and 73.42% for RF and LR respectively. Above classifiers were considered in an ensemble model using Max-voting. Results were enhanced with ensemble boosting techniques (AdaBoost, Gradient Boosting, and XGBoost). With optimized features, AdaBoost (RF) yielded the highest accuracy (95.23%) while reducing the classifier training time significantly. The classifier outperformed the state-of-the art results, but a direct comparison was not possible due to the discrepancies brought by individual datasets, listener populations, etc. in each study. Utilizing features such as Mel-frequency cepstral coefficients (MFCCs) and Deltas of MFCCs which are predominantly used in emotion recognition literature are looked forward. en_US
dc.language.iso en en_US
dc.publisher Faculty of Science, University of Ruhuna, Matara, Sri Lanka en_US
dc.subject Ensemble classifier and max-voting and boosting en_US
dc.subject Machine learning en_US
dc.subject Music emotion classification en_US
dc.title A classifier ensemble model for music emotion classification en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account