A Framework for Preprocessing Multivariate, Topology-Aware Time Series and Event Data in a Multi-System Environment

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Monitoring and predicting quality properties of complex systems relies on collecting and analyzing huge amounts of data at run time. Machine learning is frequently adopted to analyze time series and event data, often coming from multiple systems. In such a context, extracting and preprocessing data is an essential but also highly tedious task. In this paper, we thus present an offline preprocessing framework that can handle multivariate time series and event data in a multisystem environment that also takes the system's topology into account. After a discussion of the key requirements, we present the architecture and implementation of our highly configurable and easy-to-use framework. We demonstrate how the framework allows to extract data and to yield output files for machine learning via configuration settings. In a two-step evaluation, we investigate the framework's usefulness and scalability. We demonstrate the usefulness in an event prediction case study of real-world multi-system time series data. Our results show the significant impact of different data preprocessing settings on machine learning. Our experiments further demonstrate that processing performance scales linearly with respect to the number of systems and time series.
Original languageEnglish
Title of host publicationProceedings of the 19th IEEE International Symposium on High Assurance Systems Engineering (HASE'19)
EditorsCongfeng Jiang, Vu Nguyen, Dongjin Yu
PublisherIEEE
Pages115-122
Number of pages8
ISBN (Electronic)9781538685402
DOIs
Publication statusPublished - Mar 2019

Fields of science

  • 102 Computer Sciences
  • 102022 Software development
  • 102025 Distributed systems

JKU Focus areas

  • Digital Transformation

Cite this