ZERO-SHOT OPEN SET OBJECT DETECTION IN MUSIC ALBUM COVERS

  • Selene Andrade Lopez
  • , Arthur Flexer*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

We applied zero-shot object detection to music album
cover art enabling a quantitative iconographic analysis of
visual Pop music culture. For this we constructed and
partly manually annotated a dataset of decades of USA
Billboard chart music. We first used automatic image cap-
tioning to yield candidate object classes. Next we input
these object classes and the album cover images to a pre-
trained zero-shot object classification model, allowing de-
tection of objects and their classes without any re-training.
We confirmed accuracy of our approach on a manually
labeled sub sample of our dataset. Our results give an
overview of what different objects are depicted on album
covers belonging to different music genres and types of
artists.
Original languageGerman (Austria)
Title of host publicationProceedings of the 22nd Sound and Music Computing Conference (SMC2025)
Pages57-64
Number of pages8
Edition1
ISBN (Electronic)9783200106420
DOIs
Publication statusPublished - 08 Jul 2025

Fields of science

  • 102003 Image processing
  • 202002 Audiovisual media
  • 102001 Artificial intelligence
  • 102015 Information systems
  • 102 Computer Sciences

JKU Focus areas

  • Digital Transformation

Cite this