madmom: a new Python Audio and Music Signal Processing Library

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

In this paper, we present madmom , an open-source audio processing and music information retrieval (MIR) library written in Python. madmom features a concise, NumPy - compatible, object oriented design with simple calling con- ventions and sensible default values for all parameters, which facilitates fast prototyping of MIR applications. Prototypes can be seamlessly converted into callable processing pipelines through madmom 's concept of Processors , callable objects that run transparently on multiple cores. Processors can also be serialised, saved, and re-run to allow results to be easily reproduced anywhere. Apart from low-level audio processing, madmom puts em- phasis on musically meaningful high-level features. Many of these incorporate machine learning techniques and mad- mom provides a module that implements some methods commonly used in MIR such as hidden Markov models and neural networks. Additionally, madmom comes with several state-of-the-art MIR algorithms for onset detection, beat, downbeat and meter tracking, tempo estimation, and chord recognition. These can easily be incorporated into bigger MIR systems or run as stand-alone programs.
Original languageGerman (Austria)
Title of host publicationProceedings of the 24th ACM International Conference on Multimedia (ACMMM)
Number of pages5
Publication statusPublished - Jul 2016

Fields of science

  • 202002 Audiovisual media
  • 102 Computer Sciences
  • 102001 Artificial intelligence
  • 102003 Image processing
  • 102015 Information systems

JKU Focus areas

  • Computation in Informatics and Mathematics
  • Engineering and Natural Sciences (in general)

Cite this