TY - UNPB
T1 - Optical Music Recognition of Jazz Lead Sheets
AU - Martinez-Sevilla, Juan Carlos
AU - Foscarin, Francesco
AU - Garcia-Iasci, Patricia
AU - Rizo, David
AU - Calvo-Zaragoza, Jorge
AU - Widmer, Gerhard
PY - 2025/8/31
Y1 - 2025/8/31
N2 - In this paper, we address the challenge of Optical Music Recognition (OMR) for handwritten jazz lead sheets, a widely used musical score type that encodes melody and chords. The task is challenging due to the presence of chords, a score component not handled by existing OMR systems, and the high variability and quality issues associated with handwritten images. Our contribution is two-fold. We present a novel dataset consisting of 293 handwritten jazz lead sheets of 163 unique pieces, amounting to 2021 total staves aligned with Humdrum **kern and MusicXML ground truth scores. We also supply synthetic score images generated from the ground truth. The second contribution is the development of an OMR model for jazz lead sheets. We discuss specific tokenisation choices related to our kind of data, and the advantages of using synthetic scores and pretrained models. We publicly release all code, data, and models.
AB - In this paper, we address the challenge of Optical Music Recognition (OMR) for handwritten jazz lead sheets, a widely used musical score type that encodes melody and chords. The task is challenging due to the presence of chords, a score component not handled by existing OMR systems, and the high variability and quality issues associated with handwritten images. Our contribution is two-fold. We present a novel dataset consisting of 293 handwritten jazz lead sheets of 163 unique pieces, amounting to 2021 total staves aligned with Humdrum **kern and MusicXML ground truth scores. We also supply synthetic score images generated from the ground truth. The second contribution is the development of an OMR model for jazz lead sheets. We discuss specific tokenisation choices related to our kind of data, and the advantages of using synthetic scores and pretrained models. We publicly release all code, data, and models.
U2 - 10.48550/arXiv.2509.05329
DO - 10.48550/arXiv.2509.05329
M3 - Preprint
T3 - arXiv.org
BT - Optical Music Recognition of Jazz Lead Sheets
ER -