<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-05-28T06:27:16Z</responseDate><request verb="GetRecord" identifier="oai:riuma.uma.es:10630/22685" metadataPrefix="rdf">https://riuma.uma.es/rest/oai/request</request><GetRecord><record><header><identifier>oai:riuma.uma.es:10630/22685</identifier><datestamp>2026-02-03T12:04:32Z</datestamp><setSpec>com_10630_2254</setSpec><setSpec>col_10630_37959</setSpec></header><metadata><rdf:RDF xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:doc="http://www.lyncode.com/xoai" xmlns:ds="http://dspace.org/ds/elements/1.1/" xmlns:ow="http://www.ontoweb.org/ontology/1#" xmlns:rdf="http://www.openarchives.org/OAI/2.0/rdf/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/rdf/ http://www.openarchives.org/OAI/2.0/rdf.xsd">
   <ow:Publication rdf:about="oai:riuma.uma.es:10630/22685">
      <dc:title>Detection of tumor morphology mentions in clinical reports in Spanish using transformers</dc:title>
      <dc:creator>López-García, Guillermo</dc:creator>
      <dc:creator>Jerez-Aragonés, José Manuel</dc:creator>
      <dc:creator>Ribelles, Nuria</dc:creator>
      <dc:creator>Alba-Conejo, Emilio</dc:creator>
      <dc:creator>Veredas-Navarro, Francisco Javier</dc:creator>
      <dc:subject>Oncología</dc:subject>
      <dc:description>The aim of this study is to systematically examine the performance of transformer-based models for the detection of tumor morphology mentions in clinical documents in Spanish. For this purpose, we analyzed 3 transformer models supporting the Spanish language, namely multilingual BERT, BETO and XLM-RoBERTa. By means of a transfer- learning-based approach, the models were first pretrained on a collection of real-world oncology clinical cases with the goal of adapting trans- formers to the distinctive features of the Spanish oncology domain. The resulting models were further fine-tuned on the Cantemist-NER task, addressing the detection of tumor morphology mentions as a multi-class sequence-labeling problem. To evaluate the effectiveness of the proposed approach, we compared the obtained results by the domain-specific ver- sion of the examined transformers with the performance achieved by the general-domain version of the models. The results obtained in this pa- per empirically demonstrated that, for every analyzed transformer, the clinical version outperformed the corresponding general-domain model on the detection of tumor morphology mentions in clinical case reports in Spanish. Additionally, the combination of the transfer-learning-based approach with an ensemble strategy exploiting the predictive capabilities of the distinct transformer architectures yielded the best obtained results, achieving a precision value of 0.893, a recall of 0.887 and an F1-score of 0.89, which remarkably surpassed the prior state-of-the-art performance for the Cantemist-NER task.</dc:description>
      <dc:date>2021-07-23T06:19:22Z</dc:date>
      <dc:date>2021-07-23T06:19:22Z</dc:date>
      <dc:date>2021-07-22</dc:date>
      <dc:date>2021</dc:date>
      <dc:type>conference output</dc:type>
      <dc:identifier>https://hdl.handle.net/10630/22685</dc:identifier>
      <dc:language>eng</dc:language>
      <dc:relation>International Work Conference on Artificial Neural Networks (IWANN 2021)</dc:relation>
      <dc:relation>Madeira, Portugal</dc:relation>
      <dc:relation>Junio, 2021</dc:relation>
      <dc:rights>open access</dc:rights>
   </ow:Publication>
</rdf:RDF>
</metadata></record></GetRecord></OAI-PMH>