Boosting Backward Search Throughput for FM-Index Using a Compressed Encoding

dc.centroE.T.S.I. Informáticaen_US
dc.contributor.authorHerruzo-Ruiz, José Manuel
dc.contributor.authorGonzález-Navarro, Sonia
dc.contributor.authorIbáñez, Pablo
dc.contributor.authorViñals, Victor
dc.contributor.authorAlastruey-Benedé, Jesús
dc.contributor.authorPlata-González, Óscar Guillermo
dc.date.accessioned2019-04-11T11:31:35Z
dc.date.available2019-04-11T11:31:35Z
dc.date.created2019-04-11
dc.date.issued2019-04-11
dc.departamentoArquitectura de Computadores
dc.description.abstractThe rapid development of DNA sequencing technologies has demanded for com- pressed data structures supporting fast pattern matching queries. FM-index is a widely-used compressed data structure that also supports fast pattern matching queries. It is common for the exact matching algorithm to be memory bound, resulting in poor performance. Searching several symbols in a single step improves data locality, although the memory bandwidth requirements remains the same. We propose a new data-layout of FM-index, called Split bit-vector, that compacts all data needed to search k symbols in a single step (k-step), reducing both memory movement and computing requirements at the cost of increasing memory footprint.en_US
dc.description.sponsorshipUniversidad de Málaga. Campus de Excelencia Internacional Andalucía Tech.en_US
dc.identifier.urihttps://hdl.handle.net/10630/17525
dc.language.isoengen_US
dc.relation.eventdateMarzo 26-29, 2019en_US
dc.relation.eventplaceSnowbird, Utah, USAen_US
dc.relation.eventtitle(DCC 2019) 2019 Data Compression Conferenceen_US
dc.rights.accessRightsopen accessen_US
dc.subjectADN - Análisis secuencialen_US
dc.subject.otherADNen_US
dc.titleBoosting Backward Search Throughput for FM-Index Using a Compressed Encodingen_US
dc.typeconference outputen_US
dspace.entity.typePublication
relation.isAuthorOfPublication8ac2ec53-179e-4e61-949a-0b1887431677
relation.isAuthorOfPublication047ee521-a1cb-4839-a050-9cb2cfd62aec
relation.isAuthorOfPublication34b85e22-88ce-4035-a53e-2bafb0c3310b
relation.isAuthorOfPublication.latestForDiscovery8ac2ec53-179e-4e61-949a-0b1887431677

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DCC 2019.pdf
Size:
120.32 KB
Format:
Adobe Portable Document Format
Description: