Cross-domain informativeness classification for disaster situations

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Social Media services gain increasing importance as a new data source for achieving Situation Awareness in disaster management. One crucial prerequisite is to automatically filter social media messages towards informativeness commonly realized by supervised machine learning. Since disaster situations are different, most classification approaches focus on informativeness classification of similar disasters. Thus their use is limited to particular disaster types, for instance earthquakes or floods, lacking general applicability. At the same time, how to get accurate informativeness classification for new disaster events is not yet totally understood due to variations in training data, features, classification algorithms and their settings. To address these issues, our contribution is threefold: First, a systematic and in-depth analysis of an existing twitter crisis data set is provided along four different dimensions in order to gain a comprehensive understanding of those characteristics indicating informative Tweets in disaster situations. On basis of these insights, a cross domain classifier is engineered, which is applicable not only across different disaster events but also across disaster events of different types. Finally, systematic classification experiments are conducted, demonstrating that our classification approach is more accurate than other disaster type specific ones.
Original languageEnglish
Title of host publicationMEDES '18 Proceedings of the 10th International Conference on Management of Digital EcoSystems
PublisherACM
Pages183-190
Number of pages8
ISBN (Print)978-1-4503-5622-0
Publication statusPublished - Sept 2018

Fields of science

  • 202007 Computer integrated manufacturing (CIM)
  • 102001 Artificial intelligence
  • 102006 Computer supported cooperative work (CSCW)
  • 102010 Database systems
  • 102014 Information design
  • 102015 Information systems
  • 102016 IT security
  • 102022 Software development
  • 102025 Distributed systems
  • 102033 Data mining
  • 502007 E-commerce
  • 505002 Data protection
  • 506002 E-government
  • 509018 Knowledge management

JKU Focus areas

  • Computation in Informatics and Mathematics

Cite this