A Formal Approach for Failure Detection in Large-Scale Distributed Systems Using Abstract State Machines

Andreea Buga, Sorana-Tania Nemes

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Large-scale distributed systems have been widely adopted in various domains due to their ability to compose services and resources tailored to user requirements. Such systems are characterized by high complexity and heterogeneity. Maintaining a high-level availability and a normal execution of the components implies precise monitoring and robust adaptation. Monitors capture relevant metrics and transform them to meaningful knowledge, which is further used in justifying adaptation actions. The current paper proposes an Abstract State Machine model for defining monitoring processes addressing failures and unavailability of the system nodes. The specification is simulated and validated with the aid of the ASMETA toolset. The solution is complemented with a small ontology reflecting the structure of the system. We emphasize the role of formal models in achieving the proposed requirements.
Original languageEnglish
Title of host publicationDatabase and Expert Systems Applications - 28th International Conference, DEXA 2017, Lyon, France, August 28-31, 2017, Proceedings, Part I
Editors Springer
PublisherSpringer Verlag
Pages505-513
Number of pages9
Volume10438
ISBN (Print)978-3-319-64467-7
DOIs
Publication statusPublished - 2017

Publication series

NameLecture Notes in Computer Science (LNCS)

Fields of science

  • 102011 Formal languages
  • 102014 Information design
  • 102015 Information systems
  • 102022 Software development
  • 102025 Distributed systems
  • 509018 Knowledge management

JKU Focus areas

  • Computation in Informatics and Mathematics

Cite this