Abstract
In this paper, we present
madmom
, an open-source audio
processing and music information retrieval (MIR) library
written in Python.
madmom
features a concise,
NumPy
-
compatible, object oriented design with simple calling con-
ventions and sensible default values for all parameters, which
facilitates fast prototyping of MIR applications. Prototypes
can be seamlessly converted into callable processing pipelines
through
madmom
's concept of
Processors
, callable objects
that run transparently on multiple cores.
Processors
can
also be serialised, saved, and re-run to allow results to be
easily reproduced anywhere.
Apart from low-level audio processing,
madmom
puts em-
phasis on musically meaningful high-level features. Many
of these incorporate machine learning techniques and
mad-
mom
provides a module that implements some methods
commonly used in MIR such as hidden Markov models and
neural networks. Additionally,
madmom
comes with several
state-of-the-art MIR algorithms for onset detection, beat,
downbeat and meter tracking, tempo estimation, and chord
recognition. These can easily be incorporated into bigger
MIR systems or run as stand-alone programs.
| Original language | German (Austria) |
|---|---|
| Title of host publication | Proceedings of the 24th ACM International Conference on Multimedia (ACMMM) |
| Number of pages | 5 |
| Publication status | Published - Jul 2016 |
Fields of science
- 202002 Audiovisual media
- 102 Computer Sciences
- 102001 Artificial intelligence
- 102003 Image processing
- 102015 Information systems
JKU Focus areas
- Computation in Informatics and Mathematics
- Engineering and Natural Sciences (in general)
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver