Abstract
We applied zero-shot object detection to music album
cover art enabling a quantitative iconographic analysis of
visual Pop music culture. For this we constructed and
partly manually annotated a dataset of decades of USA
Billboard chart music. We first used automatic image cap-
tioning to yield candidate object classes. Next we input
these object classes and the album cover images to a pre-
trained zero-shot object classification model, allowing de-
tection of objects and their classes without any re-training.
We confirmed accuracy of our approach on a manually
labeled sub sample of our dataset. Our results give an
overview of what different objects are depicted on album
covers belonging to different music genres and types of
artists.
cover art enabling a quantitative iconographic analysis of
visual Pop music culture. For this we constructed and
partly manually annotated a dataset of decades of USA
Billboard chart music. We first used automatic image cap-
tioning to yield candidate object classes. Next we input
these object classes and the album cover images to a pre-
trained zero-shot object classification model, allowing de-
tection of objects and their classes without any re-training.
We confirmed accuracy of our approach on a manually
labeled sub sample of our dataset. Our results give an
overview of what different objects are depicted on album
covers belonging to different music genres and types of
artists.
| Original language | German (Austria) |
|---|---|
| Title of host publication | Proceedings of the 22nd Sound and Music Computing Conference (SMC2025) |
| Pages | 57-64 |
| Number of pages | 8 |
| Edition | 1 |
| ISBN (Electronic) | 9783200106420 |
| DOIs | |
| Publication status | Published - 08 Jul 2025 |
Fields of science
- 102003 Image processing
- 202002 Audiovisual media
- 102001 Artificial intelligence
- 102015 Information systems
- 102 Computer Sciences
JKU Focus areas
- Digital Transformation