Abstract
A considerable number of MIR tasks requires annotations
at the note-level for the purpose of in-depth evaluation. A
common means of obtaining accurately annotated data corpora
is to start with a symbolic representation of a piece and
generate corresponding audio data. This study investigates
the effect of audio quality and source on the performance of
two representative MIR algorithms – Onset Detection and
Audio Alignment. Three kinds of audio material are compared:
piano pieces generated using a freely available software
synthesizer with its default instrument patches; a commercial
high-quality sample library; and audio recordings
made on a real (computer-controlled) grand piano. Also, the
effect of varying richness of artistic changes in tempo and
dynamics or natural asynchronies is examined. We show
that the algorithms’ performance on the different datasets
varies considerably, but synthesized audio, does not necessarily
yield better results.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR 2011) |
| Number of pages | 6 |
| Publication status | Published - 2011 |
Fields of science
- 102 Computer Sciences
- 102001 Artificial intelligence
- 102003 Image processing
JKU Focus areas
- Computation in Informatics and Mathematics
- Engineering and Natural Sciences (in general)
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver