<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-06-01T08:40:10Z</responseDate><request verb="GetRecord" identifier="oai:riuma.uma.es:10630/32630" metadataPrefix="qdc">https://riuma.uma.es/rest/oai/request</request><GetRecord><record><header><identifier>oai:riuma.uma.es:10630/32630</identifier><datestamp>2026-02-03T11:05:42Z</datestamp><setSpec>com_10630_2254</setSpec><setSpec>col_10630_37953</setSpec></header><metadata><qdc:qualifieddc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:doc="http://www.lyncode.com/xoai" xmlns:qdc="http://dspace.org/qualifieddc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://purl.org/dc/elements/1.1/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dc.xsd http://purl.org/dc/terms/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dcterms.xsd http://dspace.org/qualifieddc/ http://www.ukoln.ac.uk/metadata/dcmi/xmlschema/qualifieddc.xsd">
   <dc:title>Towards standarized benchmarks of LLMs in software modeling tasks: a conceptual framework</dc:title>
   <dc:creator>Cámara-Moreno, Javier</dc:creator>
   <dc:creator>Burgueño-Caballero, Lola</dc:creator>
   <dc:creator>Troya-Castilla, Javier</dc:creator>
   <dc:subject>Empresas - Gestión</dc:subject>
   <dcterms:abstract>The integration of Large Language Models (LLMs) in software modeling tasks presents both opportunities and challenges.&#xd;
This Expert Voice addresses a significant gap in the evaluation of these models, advocating for the need for standardized&#xd;
benchmarking frameworks. Recognizing the potential variability in prompt strategies, LLM outputs, and solution space, we&#xd;
propose a conceptual framework to assess their quality in software model generation. This framework aims to pave the way&#xd;
for standardization of the benchmarking process, ensuring consistent and objective evaluation of LLMs in software modeling.&#xd;
Our conceptual framework is illustrated using UML class diagrams as a running example.</dcterms:abstract>
   <dcterms:dateAccepted>2024-09-18T12:15:35Z</dcterms:dateAccepted>
   <dcterms:available>2024-09-18T12:15:35Z</dcterms:available>
   <dcterms:created>2024-09-18T12:15:35Z</dcterms:created>
   <dcterms:issued>2024-09-03</dcterms:issued>
   <dc:type>journal article</dc:type>
   <dc:identifier>Cámara, J., Burgueño, L. &amp; Troya, J. Towards standarized benchmarks of LLMs in software modeling tasks: a conceptual framework. Softw Syst Model (2024). https://doi.org/10.1007/s10270-024-01206-9</dc:identifier>
   <dc:identifier>https://hdl.handle.net/10630/32630</dc:identifier>
   <dc:identifier>10.1007/s10270-024-01206-9</dc:identifier>
   <dc:language>eng</dc:language>
   <dc:rights>http://creativecommons.org/licenses/by/4.0/</dc:rights>
   <dc:rights>open access</dc:rights>
   <dc:rights>Atribución 4.0 Internacional</dc:rights>
   <dc:publisher>Springer</dc:publisher>
</qdc:qualifieddc>
</metadata></record></GetRecord></OAI-PMH>