Evaluating LLMs on Formal Models Coursework

Activity: Talk or presentationContributed talkscience-to-science

Description

The use of Large Language Models is on the rise in educational
settings such as creating teaching material, helping with
assignments or solving exercises. At the same time these new
possibilities bring new challenges in regards to the understanding
of the capabilities of LLMs. To offer a glimpse into the current
capabilities of LLMs in regards to solving coursework, we developed
a set of questions from our Bachelor's course Formal Models, which
allows us to rate and compare the generated answers of LLMs. We
developed an evaluation template to rate the responses, which allows
us to not only compare the end result of given answers, but also
intermediate steps and understanding of given tasks. The evaluations
are presented on an interactive website, which allows us to offer
new insights and enables a faster response to newly released models.
Period02 Aug 2025
Event titleThEdu'25
Event typeWorkshop
LocationStuttgart, GermanyShow on map
Degree of RecognitionInternational

Fields of science

  • 101013 Mathematical logic
  • 102031 Theoretical computer science
  • 603109 Logic
  • 102011 Formal languages
  • 102022 Software development
  • 102001 Artificial intelligence
  • 102030 Semantic technologies
  • 102 Computer Sciences

JKU Focus areas

  • Digital Transformation