Abstract
Social Media services gain increasing importance as a new data source for achieving Situation Awareness in disaster management. One crucial prerequisite is to automatically filter social media messages towards informativeness commonly realized by supervised machine learning. Since disaster situations are different, most classification approaches focus on informativeness classification of similar disasters. Thus their use is limited to particular disaster types, for instance earthquakes or floods, lacking general applicability. At the same time, how to get accurate informativeness classification for new disaster events is not yet totally understood due to variations in training data, features, classification algorithms and their settings. To address these issues, our contribution is threefold: First, a systematic and in-depth analysis of an existing twitter crisis data set is provided along four different dimensions in order to gain a comprehensive understanding of those characteristics indicating informative Tweets in disaster situations. On basis of these insights, a cross domain classifier is engineered, which is applicable not only across different disaster events but also across disaster events of different types. Finally, systematic classification experiments are conducted, demonstrating that our classification approach is more accurate than other disaster type specific ones.
Original language | English |
---|---|
Title of host publication | MEDES '18 Proceedings of the 10th International Conference on Management of Digital EcoSystems |
Publisher | ACM |
Pages | 183-190 |
Number of pages | 8 |
ISBN (Print) | 978-1-4503-5622-0 |
Publication status | Published - Sept 2018 |
Fields of science
- 202007 Computer integrated manufacturing (CIM)
- 102001 Artificial intelligence
- 102006 Computer supported cooperative work (CSCW)
- 102010 Database systems
- 102014 Information design
- 102015 Information systems
- 102016 IT security
- 102022 Software development
- 102025 Distributed systems
- 102033 Data mining
- 502007 E-commerce
- 505002 Data protection
- 506002 E-government
- 509018 Knowledge management
JKU Focus areas
- Computation in Informatics and Mathematics