A Professionally Annotated and Enriched Multimodal Data Set on Popular Music

Activity: Talk or presentationContributed talkunknown

Description

This paper presents the MusiClef data set, a multimodal data set of professionally annotated music. It includes editorial meta-data about songs, albums, and artists, as well as MusicBrainz identifiers to facilitate linking to other data sets. In addition, several audio features (generic low-level descriptors and state-of-the-art music features) are provided. Different sets of annotations as well as music context data – collaboratively generated user tags, web pages about artists and albums, and the annotation labels provided by music experts – are included too. Versions of this data set were used in the MusiCLEF 2011 and in the MusiClef 2012 evaluation campaigns for auto-tagging tasks. In this paper, we report on the motivation for the data set, on its composition, on related sets, and on the evaluation campaigns in which versions of the set were already used. These campaigns likewise represent one use case, i.e. music auto-tagging, of the data set. The complete data set is publicly available for download at http://www.cp.jku.at/musiclef.
Period01 Mar 2013
Event title4th ACM Multimedia Systems Conference (MMSys 2013),
Event typeConference
LocationNorwayShow on map

Fields of science

  • 102 Computer Sciences
  • 102001 Artificial intelligence
  • 102003 Image processing

JKU Focus areas

  • Computation in Informatics and Mathematics
  • Engineering and Natural Sciences (in general)