Skip to main navigation Skip to search Skip to main content

Supporting High-Level to Low-Level Requirements Coverage Reviewing with Large Language Models

  • Anamaria-Roberta Preda
  • , Atif Mashkoor
  • , Christoph Mayr-Dorn
  • , Alexander Egyed

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Refining high-level requirements into low-level requirements is a common task, especially in safety-critical systems engineering. The objective is to describe every important aspect of the high-level requirement in a low-level requirement, ensuring a complete and correct implementation of the system’s features. To this end, standards and regulations for safety-critical systems require reviewing the coverage of high-level requirements by all its low-level requirements to ensure no missing aspects. Supporting automatic requirements coverage reviewing is difficult as high-level and low-level requirements reside at different levels of abstraction, are natural language heavy, and often use different vocabulary. Unfortunately, this problem has received noticeably little attention from the research community. With the rise of Large Language Models (LLMs) that have been trained on a huge corpus of text and hence might ``understand'' the context of high-level and low-level requirements, we would expect to be able to address this problem. This paper presents the first study to explore the performance of LLMs to check requirements coverage. For evaluation, we selected requirements from five publicly available data sets and evaluated whether GPT-3.5 and GPT-4 can detect whether the traced low-level requirements cover a high-level requirement. While GPT-3.5 with a zero-shot plus explanation prompting strategy correctly classifies covered high-level requirements across four projects, it correctly identifies incomplete coverage due to a single removed low-level requirements with 99.7% recall across the complete evaluation data set.
Original languageEnglish
Title of host publicationMining Software Repositories (MSR) conference, Lisbon, Portugal
Number of pages12
Publication statusPublished - Apr 2024

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 4 - Quality Education
    SDG 4 Quality Education

Fields of science

  • 102 Computer Sciences
  • 102022 Software development
  • 101019 Stochastics
  • 102003 Image processing
  • 103029 Statistical physics
  • 101018 Statistics
  • 101017 Game theory
  • 102001 Artificial intelligence
  • 202017 Embedded systems
  • 101016 Optimisation
  • 101015 Operations research
  • 101014 Numerical mathematics
  • 101029 Mathematical statistics
  • 101028 Mathematical modelling
  • 101026 Time series analysis
  • 101024 Probability theory
  • 102032 Computational intelligence
  • 102004 Bioinformatics
  • 102013 Human-computer interaction
  • 101027 Dynamical systems
  • 305907 Medical statistics
  • 101004 Biomathematics
  • 305905 Medical informatics
  • 101031 Approximation theory
  • 102033 Data mining
  • 305901 Computer-aided diagnosis and therapy
  • 102019 Machine learning
  • 106007 Biostatistics
  • 102018 Artificial neural networks
  • 106005 Bioinformatics
  • 202037 Signal processing
  • 202036 Sensor systems
  • 202035 Robotics

JKU Focus areas

  • Digital Transformation
  • Sustainable Development: Responsible Technologies and Management

Cite this