Advancing Deep Learning Solutions in the Era of Precision Medicine: From Omics Data to Clinical Narratives.

Loading...
Thumbnail Image

Identifiers

Publication date

Reading date

2024-02-06

Authors

López-García, Guillermo

Collaborators

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

UMA Editorial

Metrics

Google Scholar

Share

Research Projects

Organizational Units

Journal Issue

Abstract

The main objective of this PhD Thesis is the development of deep learning (DL)-based approaches to tackle inherently complex predictive problems in the domain of precision medicine. We have focused on two of the most important data modalities in personalized medicine nowadays: omics data and clinical narratives. For both, we have proposed and developed several strategies that have proven successful in overcoming the challenges encountered by DL methods when applied to clinical data. Transfer learning (TL) has emerged as a pivotal methodology in this Thesis. TL-based approaches have been developed to counterbalance the scarcity of annotated samples, crucial for training sophisticated DL algorithms on omics data and clinical notes. For instance, TL has been leveraged in proteomics to predict methionine oxidation sites utilizing extensive phosphorylation data. Moreover, in transcriptomics, TL methodologies have facilitated effective survival prediction on a specific cancer type using a large dataset of gene expression samples from thirty other tumor types. TL strategies have also played a significant role in adapting Transformer-based models to the particularities of clinical narratives. The resulting models have being assessed on multiple tasks in the clinical Natural Language Processing (NLP) domain. Across all predictive problems examined in this Thesis, the developed domain-specific transformers have established new state-of-the-art (SOTA) performances.

Description

Moreover, in this Thesis other methodologies beyond TL have been developed to address challenges associated with applying DL models to medical data. In this way, novel approaches have been designed for generating structured representation from omics data. Furthermore, we have developed strategies to enhance the explainability of DL models in processing clinical narratives. Finally, work has been performed to assess the robustness of advanced DL methods in true clinical settings, validating the viability of deploying these SOTA systems in real-world medical scenarios.

Bibliographic citation

Collections

Endorsement

Review

Supplemented By

Referenced by

Creative Commons license

Except where otherwised noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 Internacional