Projects per year
Abstract
Score following is the process of tracking a musical performance
(audio) with respect to a known symbolic representation
(a score). We start this paper by formulating
score following as a multimodal Markov Decision Process,
the mathematical foundation for sequential decision making.
Given this formal definition, we address the score following
task with state-of-the-art deep reinforcement learning
(RL) algorithms such as synchronous advantage actor
critic (A2C). In particular, we design multimodal RL
agents that simultaneously learn to listen to music, read
the scores from images of sheet music, and follow the audio
along in the sheet, in an end-to-end fashion. All this
behavior is learned entirely from scratch, based on a weak
and potentially delayed reward signal that indicates to the
agent how close it is to the correct position in the score.
Besides discussing the theoretical advantages of this learning
paradigm, we show in experiments that it is in fact superior
compared to previously proposed methods for score
following in raw sheet music images.
Original language | English |
---|---|
Title of host publication | Proceedings of 19th International Society for Music Information Retrieval Conference (ISMIR) |
Number of pages | 8 |
Publication status | Published - 2018 |
Fields of science
- 202002 Audiovisual media
- 102 Computer Sciences
- 102001 Artificial intelligence
- 102003 Image processing
- 102015 Information systems
JKU Focus areas
- Computation in Informatics and Mathematics
- Engineering and Natural Sciences (in general)
Projects
- 1 Finished
-
Con Espressione - Getting at the Heart of Things: Towards Expressivity-aware Computer Systems in Music (ERC Advanced Grant)
Widmer, G. (PI)
01.01.2016 → 31.12.2021
Project: Funded research › EU - European Union