Learning Binary Codes for Efficient Large-Scale Music Similarity Search

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Content-based music similarity estimation provides a way to find songs in the unpopular “long tail” of commercial catalogs. However, state-of-the-art music similarity measures are too slow to apply to large databases, as they are based on finding nearest neighbors among very high-dimensional or non-vector song representations that are difficult to index. In this work, we adopt recent machine learning methods to map such song representations to binary codes. A linear scan over the codes quickly finds a small set of likely neighbors for a query to be refined with the original expensive similarity measure. Although search costs grow linearly with the collection size, we show that for commercialscale databases and two state-of-the-art similarity measures, this outperforms five previous attempts at approximate nearest neighbor search. When required to return 90% of true nearest neighbors, our method is expected to answer 4.2 1-NN queries or 1.3 50-NN queries per second on a collection of 30 million songs using a single CPU core; an up to 260 fold speedup over a full scan of 90% of the database.
Original languageEnglish
Title of host publicationProceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR)
EditorsAlceu de Souza Britto, Fabien Gouyon, Simon Dixon
Pages581-586
Number of pages6
ISBN (Electronic)9780615900650
Publication statusPublished - Nov 2013

Fields of science

  • 202002 Audiovisual media
  • 102 Computer Sciences
  • 102001 Artificial intelligence
  • 102003 Image processing
  • 102015 Information systems

JKU Focus areas

  • Computation in Informatics and Mathematics
  • Engineering and Natural Sciences (in general)

Cite this