High-performance computing in bioinformatics: accelerating de novo assembly.

dc.contributor.advisorPlata-González, Óscar Guillermo
dc.contributor.advisorLarrosa-Jiménez, Rafael
dc.contributor.authorEspinosa García, Elena María
dc.date.accessioned2025-11-21T08:43:37Z
dc.date.available2025-11-21T08:43:37Z
dc.date.created2025
dc.date.issued2025
dc.date.submitted2025-10-10
dc.description.abstractAdvances in sequencing technologies—currently divided into short-read and long-read platforms—have enabled the study of genomes from a large number of organisms with high coverage and resolution. However, those advances have also created an urgent need for efficient, scalable, and specialized software solutions that are continually evolving to keep pace with the rapid improvements in sequencing methods. In particular, de novo assembly has been one of the most significant challenges in bioinformatics due to its complexity and high computational cost. However, advances in sequencing technologies have significantly improved the accuracy of genome assemblies and made the process more feasible. In parallel, new data structures, efficient algorithms, and computational techniques have been developed to address these challenges. Despite these improvements, de novo assembly still demands substantial computational resources, and remains an active area of research with ongoing development of diverse methods and strategies. In this thesis, we conduct an in-depth study of de novo genome assembly and its main bottlenecks. Building on these findings, we propose software- and hardware-level solutions to accelerate de novo genome assembly and make its computation as energy-efficient as possible. To this end, this work includes a comprehensive review of de novo genome assembly, a benchmark analysis of the most widely used assemblers to identify key bottlenecks, and the proposal of two acceleration tools: SeqMatcher and GenTEK.es_ES
dc.identifier.urihttps://hdl.handle.net/10630/40862
dc.language.isoenges_ES
dc.publisherUMA Editoriales_ES
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectArquitectura de ordenadores - Tesis doctoraleses_ES
dc.subjectBioinformáticaes_ES
dc.subjectDispositivos FPGAses_ES
dc.subjectMatrices lógicas programables por el usuarioes_ES
dc.subjectGenómicaes_ES
dc.subjectSecuenciación de ácidos nucleicoses_ES
dc.subject.otherApproximate string matchinges_ES
dc.subject.otherAVX-512es_ES
dc.subject.otherFPGAes_ES
dc.subject.otherGenome assemblyes_ES
dc.subject.otherHiFies_ES
dc.titleHigh-performance computing in bioinformatics: accelerating de novo assembly.es_ES
dc.typedoctoral thesises_ES
dspace.entity.typePublication
relation.isAdvisorOfPublication34b85e22-88ce-4035-a53e-2bafb0c3310b
relation.isAdvisorOfPublicationc5afb8e7-f15e-4e86-b402-399d9016f30a
relation.isAdvisorOfPublication.latestForDiscovery34b85e22-88ce-4035-a53e-2bafb0c3310b

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TD_ ESPINOSA_GARCÍA_Elena_María.pdf
Size:
15.83 MB
Format:
Adobe Portable Document Format
Description:

Collections