Abstract
This technical report describes the CP-JKU team’s submission for Task 1–Subtask A (Acoustic Scene Classification with Multiple Devices) and Subtask B (Low-Complexity Acoustic Scene Classification) of the DCASE-2020 challenge [1]. For Subtask 1. A, we provide our Receptive Field (RF) regularized CNN model as a baseline, and additionally explore the use of two different domain adaptation objectives in the form of the Maximum Mean Discrepancy (MMD) and the Sliced Wasserstein Distance (SWD). For Subtask 1. B, we investigate different parameter reduction methods such as Pruning, while maintaining the receptive field of the networks. Additionally, we incorporate a decomposed convolutional layer that reduces the number of non-zero parameters in our models while only slightly decreasing the accuracy, compared to the full-parameter baseline.
Original language | English |
---|---|
Number of pages | 4 |
Publication status | Published - 2020 |
Fields of science
- 202002 Audiovisual media
- 102 Computer Sciences
- 102001 Artificial intelligence
- 102003 Image processing
- 102015 Information systems
JKU Focus areas
- Digital Transformation