Abstract
Monitoring and predicting quality properties of complex systems relies on collecting and analyzing huge amounts of data at run time. Machine learning is frequently adopted to analyze time series and event data, often coming from multiple systems. In such a context, extracting and preprocessing data is an essential but also highly tedious task. In this paper, we thus present an offline preprocessing framework that can handle multivariate time series and event data in a multisystem environment that also takes the system's topology into account. After a discussion of the key requirements, we present the architecture and implementation of our highly configurable and easy-to-use framework. We demonstrate how the framework allows to extract data and to yield output files for machine learning via configuration settings. In a two-step evaluation, we investigate the framework's usefulness and scalability. We demonstrate the usefulness in an event prediction case study of real-world multi-system time series data. Our results show the significant impact of different data preprocessing settings on machine learning. Our experiments further demonstrate that processing performance scales linearly with respect to the number of systems and time series.
| Originalsprache | Englisch |
|---|---|
| Titel | Proceedings of the 19th IEEE International Symposium on High Assurance Systems Engineering (HASE'19) |
| Herausgeber*innen | Congfeng Jiang, Vu Nguyen, Dongjin Yu |
| Verlag | IEEE |
| Seiten | 115-122 |
| Seitenumfang | 8 |
| ISBN (elektronisch) | 9781538685402 |
| DOIs | |
| Publikationsstatus | Veröffentlicht - 22 März 2019 |
Publikationsreihe
| Name | Proceedings of IEEE International Symposium on High Assurance Systems Engineering |
|---|---|
| Band | 2019-January |
| ISSN (Print) | 1530-2059 |
Wissenschaftszweige
- 102 Informatik
- 102022 Softwareentwicklung
- 102025 Verteilte Systeme
JKU-Schwerpunkte
- Digital Transformation
Projekte
- 2 Abgeschlossen
-
Application Performance Management (M03)
Bitto, V. (Forscher*in), Chalupar, P. (Forscher*in), Gnedt, D. (Forscher*in), Hofer, P. (Forscher*in), Kahlhofer, M. (Forscher*in), Lengauer, P. (Forscher*in), Makor, L. (Forscher*in), Schörgenhumer, A. (Forscher*in), Weninger, M. (Forscher*in) & Grünbacher, P. (Projektleiter*in)
01.02.2013 → 31.08.2020
Projekt: Geförderte Forschung › Andere Geldgeber
-
Christian Doppler Labor für Monitoring and Evolution of Very-Large-Scale Software Systems
Grünbacher, P. (Projektleiter*in)
01.02.2013 → 31.08.2020
Projekt: Geförderte Forschung › CDG - Christian Doppler Forschungsgesellschaft
Dieses zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver