Artículo
Efficient repeat finding in sets of strings via suffix arrays
Barenbaum, Pablo
; Becher, Veronica Andrea
; Deymonnaz, Alejandro; Halsband, Melisa; Heiber, Pablo Ariel
Fecha de publicación:
04/2013
Editorial:
Discrete Mathematics and Theoretical Computer Science
Revista:
Discrete Mathematics and Theoretical Computer Science
ISSN:
1365-8050
Idioma:
Inglés
Tipo de recurso:
Artículo publicado
Clasificación temática:
Resumen
We consider two repeat finding problems relative to sets of strings: (a) Find the largest substrings that occur in every string of a given set; (b) Find the maximal repeats in a given string that occur in no string of a given set. Our solutions are based on the suffix array construction, requiring O(m) memory, where m is the length of the longest input string, and O(n &log;m) time, where n is the the whole input size (the sum of the length of each string in the input). The most expensive part of our algorithms is the computation of several suffix arrays. We give an implementation and experimental results that evidence the efficiency of our algorithms in practice, even for very large inputs.
Palabras clave:
Stringology
,
Repeats
,
Suffix Array
,
Longest Maximal Substring
Archivos asociados
Licencia
Identificadores
Colecciones
Articulos(OCA CIUDAD UNIVERSITARIA)
Articulos de OFICINA DE COORDINACION ADMINISTRATIVA CIUDAD UNIVERSITARIA
Articulos de OFICINA DE COORDINACION ADMINISTRATIVA CIUDAD UNIVERSITARIA
Citación
Barenbaum, Pablo; Becher, Veronica Andrea; Deymonnaz, Alejandro; Halsband, Melisa; Heiber, Pablo Ariel; Efficient repeat finding in sets of strings via suffix arrays; Discrete Mathematics and Theoretical Computer Science; Discrete Mathematics and Theoretical Computer Science; 15; 2; 4-2013; 59-70
Compartir