JOURNAL ARTICLE

Genre Classification using Feature Extraction and Deep Learning Techniques

Abstract

Music genre refers to categorisation of music on the basis of interaction between artists, market forces, and culture. It helps to organize music into collections by indicating similarities between compositions or musicians. Automatic genre classification is non-trivial as it is difficult to distinguish between different genres. Many times the boundaries are not clearly defined and genres are overlapping. In this paper we present a novel approach to classify a list of songs present on Spotify into mainly four genres-Christian, Metal, Country, Rap. Two different kinds of data - lyrics and album artwork are used for the classification process. Two Natural Language Processing techniques namely bag-Of-Words and Term Frequency-Inverse Document Frequency (TFIDF) are used to process the lyrical data. We apply machine learning algorithms like Random Forest, Support Vector Machine (SVM), Naive Bayes, Linear Support Vector Classifier (Linear SVC) and eXtreme Gradient Boosting (XGBoost) on lyrical data and Deep Convolutional Neural Network (CNN) on the album artwork to predict the genre. On application of machine learning algorithms on lyrical data obtained from bag-Of-Words a mean precision of 75.96% and a mean f-score of 75.92% is achieved. On application of the same set of algorithms on lyrical data obtained from TFIDF a mean precision of 76.85% and a mean f-score of 77.38% is achieved. In both cases XGBoost outperforms all the other algorithms giving a maximum precision of 79.30% and 80.16% and a maximum f-score of 79.6% and 84.09% for bag-Of-Words and TFIDF respectively. On application of deep neural network on album artwork, a precision of 82.46% and a f-score of 81.84% is achieved.

Keywords:
tf–idf Artificial intelligence Support vector machine Computer science Naive Bayes classifier Random forest Machine learning Lyrics Convolutional neural network Classifier (UML) Feature extraction Statistical classification Gradient boosting Artificial neural network Natural language processing Pattern recognition (psychology) Term (time)

Metrics

9
Cited By
0.54
FWCI (Field Weighted Citation Impact)
13
Refs
0.66
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music Technology and Sound Studies
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Neuroscience and Music Perception
Life Sciences →  Neuroscience →  Cognitive Neuroscience
© 2026 ScienceGate Book Chapters — All rights reserved.