
Music Genre Classification
Definition
Project Overview:
Automatic music classification/recognition is one such area which is being widely used in many
commercial applications also like Shzam, Google Play, Sony Track ID, etc. All these applications have
one thing in common that they aim to understand the semantic of the music rather than just curating
the metadata out of it. To develop an advanced and intelligent music player there is a large semantic
gap between audio signal processing and listeners” preference. Most of the cloud based music
providers use collaborative filtering and sound Meta data to recommend the next song to the
listeners. But they could not fulfil the gap of listener’s preference i.e genre, mood, lyrics,
instrumentation, rhythm, music records time etc.
Thus, we aim to focus the major area for preference of a music listener is genre of music. Music genre
classification is one such problem in MIR (Music Information Retrieval) which has been addressed by
a lot of signal processing techniques along with standard machine learning algorithms in place. But as
we have seen that the traditional signal processing techniques to extract features don’t add much
advantage when it comes to typical problems of clustering, classification because we might miss many
important features at the time of feature extraction and selection. As Deep Learning has proven its
importance in image processing to understand the features from the image pixels. Similarly, we here
aim to adopt a similar approach for audio processing.
Applications
1. Music Tag Recommender (Genre based tagging)
2. Personalized Song Recommender Systems (eg. spotify)
3. Music Teaching Applications to make the students understand, what genre are all about.
4. Automatic Meta Tagging of songs
5. Segmenting the genres in music players or in local PCs for organized storage.
6. Develop song indexing systems using genre as on the key to the hash or node to the btree.
Problem Statement:
To goal of this project is to develop an algorithm that can take a set of songs and assign genre to the
song. Music can be categorized into different genres, majorly ten genres that we aim to recognize
automatically from the input song.
As we have discrete number of classes that we need to classify thus, our problem is a classification
problem with 10 classes as genres. As a multiclass supervised learning classification problem we could
probably start with the fairly simple machine learning algorithms, as there were a lot of features that
we can’t engineer well without the good understanding on the Audio Signal Processing thus, deep
learning has been chosen to build out the classifier.
After training the classifier we aimed to build a system that can classify the song genre when someone
inputs the raw music file. There are various things that needs to be done to build such a system i.e
converting audio into *.au or *.wav format , resample it, extract the features that can be used to train
the model and then also for predicting the new inputs.