Acoustic Scene Classification with Mismatched Recording Devices

Paul Primus, David Eitelsebner

Research output: Chapter in Book/Report/Conference proceedingConference proceedings

Abstract

This technical report describes CP-JKU Student team’s approach for Task 1 - Subtask B of the DCASE 2019 challenge. In this context, we propose two loss functions for domain adaptation to learn invariant representations given time-aligned recordings. We show that these methods improve the classification performance on our cross-validation, as well as performance on the Kaggle leader board, up to three percentage points compared to our baseline model. Our best scoring submission is an ensemble of eight classifiers.
Original languageEnglish
Title of host publicationDCASE 2019 Technical Report
Number of pages4
Publication statusPublished - 2020

Fields of science

  • 202002 Audiovisual media
  • 102 Computer Sciences
  • 102001 Artificial intelligence
  • 102003 Image processing
  • 102015 Information systems

JKU Focus areas

  • Digital Transformation

Cite this