Abstract
Content-based music similarity estimation provides a
way to find songs in the unpopular “long tail” of commercial catalogs. However, state-of-the-art music similarity measures are too slow to apply to large databases, as
they are based on finding nearest neighbors among very
high-dimensional or non-vector song representations that
are difficult to index.
In this work, we adopt recent machine learning methods
to map such song representations to binary codes. A linear scan over the codes quickly finds a small set of likely
neighbors for a query to be refined with the original expensive similarity measure. Although search costs grow linearly with the collection size, we show that for commercialscale databases and two state-of-the-art similarity measures,
this outperforms five previous attempts at approximate nearest neighbor search. When required to return 90% of true
nearest neighbors, our method is expected to answer 4.2
1-NN queries or 1.3 50-NN queries per second on a collection of 30 million songs using a single CPU core; an up to
260 fold speedup over a full scan of 90% of the database.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR) |
| Editors | Alceu de Souza Britto, Fabien Gouyon, Simon Dixon |
| Pages | 581-586 |
| Number of pages | 6 |
| ISBN (Electronic) | 9780615900650 |
| Publication status | Published - Nov 2013 |
Fields of science
- 202002 Audiovisual media
- 102 Computer Sciences
- 102001 Artificial intelligence
- 102003 Image processing
- 102015 Information systems
JKU Focus areas
- Computation in Informatics and Mathematics
- Engineering and Natural Sciences (in general)