Skip to main navigation Skip to search Skip to main content

Over-Parameterization and Generalization in Audio Classification

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Convolutional Neural Networks (CNNs) have been dominating classification tasks in various domains, such as machine vision, machine listening, and natural language processing. In machine listening, while generally exhibiting very good generalization capabilities, CNNs are sensitive to the specific audio recording device used, which has been recognized as a substantial problem in the acoustic scene classification (DCASE) community. In this study, we investigate the relationship between over-parameterization of acoustic scene classification models, and their resulting generalization abilities. Specifically, we test scaling CNNs in width and depth, under different conditions. Our results indicate that increasing width improves generalization to unseen devices, even without an increase in the number of parameters.
Original languageEnglish
Title of host publicationICML 2021 Workshop on Overparameterization: Pitfalls & Opportunities
Number of pages8
Publication statusPublished - 2021

Fields of science

  • 202002 Audiovisual media
  • 102 Computer Sciences
  • 102001 Artificial intelligence
  • 102003 Image processing
  • 102015 Information systems

JKU Focus areas

  • Digital Transformation

Cite this