Detecting semantic alignments between textual specifications and domain models
Loading...
Identifiers
Publication date
Reading date
Collaborators
Advisors
Tutors
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Share
Center
Department/Institute
Abstract
Context: Having domain models derived from textual specifications has proven to be very useful in the early
phases of software engineering. However, creating correct domain models and establishing clear links with the
textual specification is a challenging task, especially for novice modelers.
Objective: We propose an approach for determining the alignment between a partial domain model and a
textual specification.
Methods: To this aim, we use Natural Language Processing techniques to pre-process the text, generate an
artificial natural language specification for each model element, and then use an LLM to compare the generated
description with matched sentences from the original specification. Ultimately, our algorithm classifies each
model element as either aligned (i.e., correct), misaligned (i.e., incorrect), or unclassified (i.e., insufficient
evidence). Furthermore, it outputs the related sentences from the textual specification that provide the evidence
for the determined class.
Results: We have evaluated our approach on a set of examples from the literature containing diverse domains,
each consisting of a textual specification and a reference domain model, as well as on models containing
modeling errors that were systematically derived from the correct models through mutation. Our results
show that we are able to identify alignments and misalignments with a precision close to 1 and a recall
of approximately 78%, with execution times ranging from 18 s to 1 min per model element.
Conclusion: Since our algorithm almost never classifies model elements incorrectly, and is able to classify
over 3/4 of the model elements, it could be integrated into a modeling tool to provide positive feedback or
generate warnings, or employed for offline validation and quality assessment.
Description
Bibliographic citation
Shwetali Shimangaud, Lola Burgueño, Jörg Kienzle, Rijul Saini, Detecting semantic alignments between textual specifications and domain models, Information and Software Technology, Volume 197, 2026, 108154, ISSN 0950-5849, https://doi.org/10.1016/j.infsof.2026.108154.
Collections
Endorsement
Review
Supplemented By
Referenced by
Creative Commons license
Except where otherwised noted, this item's license is described as Attribution-NonCommercial 4.0 International













