<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-06-05T14:41:20Z</responseDate><request verb="GetRecord" identifier="oai:riuma.uma.es:10630/22685" metadataPrefix="qdc">https://riuma.uma.es/rest/oai/request</request><GetRecord><record><header><identifier>oai:riuma.uma.es:10630/22685</identifier><datestamp>2026-02-03T12:04:32Z</datestamp><setSpec>com_10630_2254</setSpec><setSpec>col_10630_37959</setSpec></header><metadata><qdc:qualifieddc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:doc="http://www.lyncode.com/xoai" xmlns:qdc="http://dspace.org/qualifieddc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://purl.org/dc/elements/1.1/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dc.xsd http://purl.org/dc/terms/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dcterms.xsd http://dspace.org/qualifieddc/ http://www.ukoln.ac.uk/metadata/dcmi/xmlschema/qualifieddc.xsd">
   <dc:title>Detection of tumor morphology mentions in clinical reports in Spanish using transformers</dc:title>
   <dc:creator>López-García, Guillermo</dc:creator>
   <dc:creator>Jerez-Aragonés, José Manuel</dc:creator>
   <dc:creator>Ribelles, Nuria</dc:creator>
   <dc:creator>Alba-Conejo, Emilio</dc:creator>
   <dc:creator>Veredas-Navarro, Francisco Javier</dc:creator>
   <dc:subject>Oncología</dc:subject>
   <dcterms:abstract>The aim of this study is to systematically examine the performance of transformer-based models for the detection of tumor morphology mentions in clinical documents in Spanish. For this purpose, we analyzed 3 transformer models supporting the Spanish language, namely multilingual BERT, BETO and XLM-RoBERTa. By means of a transfer- learning-based approach, the models were first pretrained on a collection of real-world oncology clinical cases with the goal of adapting trans- formers to the distinctive features of the Spanish oncology domain. The resulting models were further fine-tuned on the Cantemist-NER task, addressing the detection of tumor morphology mentions as a multi-class sequence-labeling problem. To evaluate the effectiveness of the proposed approach, we compared the obtained results by the domain-specific ver- sion of the examined transformers with the performance achieved by the general-domain version of the models. The results obtained in this pa- per empirically demonstrated that, for every analyzed transformer, the clinical version outperformed the corresponding general-domain model on the detection of tumor morphology mentions in clinical case reports in Spanish. Additionally, the combination of the transfer-learning-based approach with an ensemble strategy exploiting the predictive capabilities of the distinct transformer architectures yielded the best obtained results, achieving a precision value of 0.893, a recall of 0.887 and an F1-score of 0.89, which remarkably surpassed the prior state-of-the-art performance for the Cantemist-NER task.</dcterms:abstract>
   <dcterms:dateAccepted>2021-07-23T06:19:22Z</dcterms:dateAccepted>
   <dcterms:available>2021-07-23T06:19:22Z</dcterms:available>
   <dcterms:created>2021-07-23T06:19:22Z</dcterms:created>
   <dcterms:issued>2021</dcterms:issued>
   <dc:type>conference output</dc:type>
   <dc:identifier>https://hdl.handle.net/10630/22685</dc:identifier>
   <dc:language>eng</dc:language>
   <dc:relation>International Work Conference on Artificial Neural Networks (IWANN 2021)</dc:relation>
   <dc:relation>Madeira, Portugal</dc:relation>
   <dc:relation>Junio, 2021</dc:relation>
   <dc:rights>open access</dc:rights>
</qdc:qualifieddc>
</metadata></record></GetRecord></OAI-PMH>