- About MET-lab
- News
- People
- Research projects
- K-12 education initiatives
- Publications
Research Day Poster April 17, 2007
By: Donald S. Williamson
Advisor: Dr. Youngmoo Kim
Introduction
Features: Mel-Frequency Cepstral Coefficients (MFCCs)

The next two figures display the MFCC-spectrograms for the same songs as above from Bryan Adams and U2. MFCCs make it easier to distinguish between two songs.

MFCCs more accurately reflect the information perceived by our auditory system, using the mel scale. The mel scale is a nonlinear scale with higher resolution at low-frequencies, similar to human perception.
Feature Comparison: Kullback-Leibler (KL) Divergence

Graphical Feature Comparison


The KL divergence value between the two song distributions in the bottom image would be higher because their mean and variance differ significantly.
Similarity Assessment

First, the KL divergence values between each song in the data set were computed. From there the plots were generated by performing Multi-Dimensional Scaling (MDS) on the matrix of KL values.
Similarity Evaluations

This survey was used to evaluate the results of our automatic song-similarity algorithm. Human subjects were also asked to rate the specific similarities and differences between various songs.
Preliminary Results

Ideally a small KL value should result in a high user rating and visa versa. The correlation between our system and the user ratings is -0.35; -1 signifies perfect correlation.
Future Work