Towards standarized benchmarks of LLMs in software modeling tasks: a conceptual framework

dc.centroE.T.S.I. Informáticaes_ES
dc.contributor.authorCámara-Moreno, Javier
dc.contributor.authorBurgueño-Caballero, Lola
dc.contributor.authorTroya-Castilla, Javier
dc.date.accessioned2024-09-18T12:15:35Z
dc.date.available2024-09-18T12:15:35Z
dc.date.issued2024-09-03
dc.departamentoInstituto de Tecnología e Ingeniería del Software de la Universidad de Málaga
dc.description.abstractThe integration of Large Language Models (LLMs) in software modeling tasks presents both opportunities and challenges. This Expert Voice addresses a significant gap in the evaluation of these models, advocating for the need for standardized benchmarking frameworks. Recognizing the potential variability in prompt strategies, LLM outputs, and solution space, we propose a conceptual framework to assess their quality in software model generation. This framework aims to pave the way for standardization of the benchmarking process, ensuring consistent and objective evaluation of LLMs in software modeling. Our conceptual framework is illustrated using UML class diagrams as a running example.es_ES
dc.description.sponsorshipFunding for open access charge: Universidad de Málaga / CBUAes_ES
dc.identifier.citationCámara, J., Burgueño, L. & Troya, J. Towards standarized benchmarks of LLMs in software modeling tasks: a conceptual framework. Softw Syst Model (2024). https://doi.org/10.1007/s10270-024-01206-9es_ES
dc.identifier.doi10.1007/s10270-024-01206-9
dc.identifier.urihttps://hdl.handle.net/10630/32630
dc.language.isoenges_ES
dc.publisherSpringeres_ES
dc.rightsAtribución 4.0 Internacional*
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectEmpresas - Gestiónes_ES
dc.subject.otherModelinges_ES
dc.subject.otherLLMses_ES
dc.subject.otherBenchmarkinges_ES
dc.titleTowards standarized benchmarks of LLMs in software modeling tasks: a conceptual frameworkes_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication20052283-aeaf-42b8-85ee-52d9589e5797
relation.isAuthorOfPublication31808e70-d2ec-4318-8ead-dded38954d40
relation.isAuthorOfPublication3ea98dd7-8c4e-4639-9c87-2228ad0f56be
relation.isAuthorOfPublication.latestForDiscovery20052283-aeaf-42b8-85ee-52d9589e5797

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s10270-024-01206-9 (1).pdf
Size:
1.41 MB
Format:
Adobe Portable Document Format
Description:

Collections