Projects per year
Abstract
This report describes the CP-JKU team’s submissions for Task 1
(Acoustic Scene Classification, ASC) of the DCASE-2017 challenge,
and discusses some observations we made about the data and
the classification setup. Our approach is based on the methodology
that achieved ranks 1 and 2 in the 2016 ASC challenge: a fusion of
i-vector modelling using MFCC features derived from left and right
audio channels, and deep convolutional neural networks (CNNs)
trained on raw spectrograms. The data provided for the 2017 ASC
task presented some new challenges – in particular, audio stimuli
of very short duration. These will be discussed in detail, and our
measures for addressing them will be described. The result of our
experiments is a classification system that achieves classification
accuracies of around 90% on the provided development data, as estimated
via the prescribed four-fold cross-validation scheme. On
the unseen evaluation data, our best performing method achieved
73.8% and 5th place in the team ranking.
Original language | English |
---|---|
Title of host publication | Proceedings of DCASE 2017 |
Number of pages | 5 |
Publication status | Published - 2017 |
Fields of science
- 202002 Audiovisual media
- 102 Computer Sciences
- 102001 Artificial intelligence
- 102003 Image processing
- 102015 Information systems
JKU Focus areas
- Computation in Informatics and Mathematics
- Engineering and Natural Sciences (in general)
Projects
- 1 Finished
-
Strategic FExFE Project on Deep Learning
Dorfer, M. (Researcher), Eghbal-Zadeh, H. (Researcher) & Widmer, G. (PI)
01.05.2015 → 31.12.2018
Project: Funded research › FFG - Austrian Research Promotion Agency