Abstract
In this paper, we present a similarity measure for music
artists based on search results of Google queries. Co-occurrences
of artist names on web pages are analyzed to measure
how often two artists are mentioned together on the
same web page. We estimate conditional probabilities using
the extracted page count. These conditional probabilities
give a similarity measure which is evaluated using a data
set containing 224 artists from 14 genres. For evaluation,
we use two different methods, intra-/intergroup-similarities
and k-Nearest Neighbors classification. Furthermore, a confidence
filter and combinations of the results gained from
three different query settings are tested. It is shown that
these enhancements can raise the performance of our similarity
measure. Comparing our results to those of similar
approaches show that our approach, though being quite simple,
performs well and can be used as a similarity measure
that incorporates social knowledge.
Original language | English |
---|---|
Title of host publication | Proceedings of the Fourth International Workshop on Content-Based Multimedia Indexing (CBMI'05), Riga, Lativa |
Number of pages | 8 |
Publication status | Published - 2005 |
Fields of science
- 102 Computer Sciences
- 102001 Artificial intelligence
- 102003 Image processing
- 102015 Information systems
- 202002 Audiovisual media