Learning General Audio Representations With Large-Scale Training of Patchout Audio Transformers

Khaled Koutini, Shahed Masoudian, Florian Schmid, Hamid Eghbal-Zadeh, Jan Schlüter, Gerhard Widmer

Research output: Contribution to journalArticlepeer-review

Original languageEnglish
Number of pages10
JournalProceedings of HEAR: HolisticEvaluation of Audio Representations, volume 166 ofProceedings of Machin
Publication statusPublished - 2023

Fields of science

  • 202002 Audiovisual media
  • 102 Computer Sciences
  • 102001 Artificial intelligence
  • 102003 Image processing
  • 102015 Information systems

JKU Focus areas

  • Digital Transformation

Cite this