Informed Selection of Frames for Music Similarity Computation.

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

In this paper we present a new method to compute frame based audio similarities, based on nearest neighbour density estimation. We do not recommend it is as a practical method for large collections because of the high runtime. Rather, we use this new method for a detailed analysis to get a deeper insight on how a bag of frames approach (BOF) determines similarities among songs, and in particular, to identify those audio frames that make two songs similar from a machine’s point of view. Our analysis reveals that audio frames of very low energy, which are of course not the most salient with respect to human perception, have a surprisingly big influence on current similarity measures. Based on this observation we propose to remove these low-energy frames before computing song models and show, via classification experiments, that the proposed frame selection strategy improves the audio similarity measure.
Original languageEnglish
Title of host publicationProceedings of the 12th International Conference on Digital Audio Effects (DAFx-09)
Pages146-149
Number of pages4
Publication statusPublished - 2009

Publication series

NameProceedings of the International Conference on Digital Audio Effects, DAFx
ISSN (Print)2413-6700

Fields of science

  • 102 Computer Sciences
  • 102001 Artificial intelligence
  • 102003 Image processing
  • 102015 Information systems
  • 202002 Audiovisual media

Cite this