Coughing-based Recognition of Covid-19 with Spatial Attentive ConvLSTM Recurrent Neural Networks

  • T. Yan
  • , H. Meng
  • , Emilia Parada-Cabaleiro
  • , S. Liu
  • , M. Song
  • , B. Schuller

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

abstract, if available: The rapid emergence of COVID-19 has become a major public health threat around the world. Although early detection is crucial to reduce its spread, the existing diagnostic methods are still insufficient in bringing the pandemic under control. Thus, more sophisticated systems, able to easily identify the infection from a larger variety of symptoms, such as cough, are urgently needed. Deep learning models can indeed convey numerous- signal features relevant to fight against the disease; yet, the performance of state-of-the-art approaches is still severely restricted by the feature information loss typically due to the high number of layers. To mitigate this phenomenon, identifying the most relevant feature areas by drawing into attention mechanisms becomes essential. In this paper, we introduce Spatial Attentive ConvLSTM-RNN (SACRNN), a novel algorithm that is using Convolutional Long-Short Term Memory Recurrent Neural Networks with embedded attention that has the ability to identify the most valuable features. The promising results achieved by the fusion between the proposed model and a conventional Attentive Convolutional Recurrent Neural Network, on the automatic recognition of COVID-19 coughing (73.2% of Unweighted Average Recall) show the great potential of the presented approach in developing efficient solutions to defeat the pandemic.
Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association (Interspeech)
Pages3681-3685
Number of pages5
ISBN (Electronic)9781713836902
DOIs
Publication statusPublished - 2021

Publication series

NameProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume5
ISSN (Print)2308-457X
ISSN (Electronic)2958-1796

Fields of science

  • 202002 Audiovisual media
  • 102 Computer Sciences
  • 102001 Artificial intelligence
  • 102003 Image processing
  • 102015 Information systems

JKU Focus areas

  • Digital Transformation

Cite this