This page contains our experimental datasets for research on music mood classification based on fusion of the audio and lyric modalities of the music.
The dataset consists of 777 music clips of 4 mood categories - angry, happy, relaxed and sad, in which 400 clips are used as the training set and the other 377 ones are used for testing. The distribution of clip samples in 4 mood categories are shown in the following table:
| Mood Category | Training Samples | Testing Samples |
|---|---|---|
| angry | 100 | 71 |
| happy | 100 | 106 |
| relaxed | 100 | 101 |
| sad | 100 | 99 |
For each music clip in the dataset, a plain text (.txt) file is provided consisting of every sentences of music lyrics, along with the time tags (i.e. the time offset [hour:minute.second] of the sentence relative to the start of the music).
For copyright reason, the audio data of the music clips cannot be provided here, which, on the other hand, can usually be sought via web search engines and downloaded from the Internet based on the information about the music clip provided in a text file "info.txt" in each training/testing sample set, which comprises 4 information fields separated by colon: