Propuesta de un prototipo de sistema para almacenamiento de información de secuencias biológicas en estructuras de árboles de posiciones

Quiroga Rivas, Julie Alexandra

Propuesta de un prototipo de sistema para almacenamiento de información de secuencias biológicas en estructuras de árboles de posiciones

dc.contributor.advisor	Parra, Carlos Arturo
dc.contributor.author	Quiroga Rivas, Julie Alexandra
dc.contributor.cvlac	Parra, Carlos Arturo [0000746274]	spa
dc.contributor.orcid	Parra, Carlos Arturo [0000-0003-3593-9504]	spa
dc.coverage.campus	UNAB Campus Bucaramanga	spa
dc.coverage.spatial	Bucaramanga, Santander (Colombia)	spa
dc.date.accessioned	2024-10-08T20:22:53Z
dc.date.available	2024-10-08T20:22:53Z
dc.date.issued	1998
dc.degree.name	Ingeniero de Sistemas	spa
dc.description.abstract	Muchos investigadores han estado estudiando proyectos de secuenciamiento, generando cantidades de información y grandes volúmenes de secuencias imposibles de analizar sin el uso de herramientas computacionales. Motivados por esta necesidad se ha desarrollado un prototipo de sistema de software que consta de un conjunto de operaciones en árboles de posiciones que permiten la construcción de los mismos y la localización rápida de subsecuencias en archivos de secuencias biológicas para un máximo de cien secuencias. La información de secuencias es comúnmente almacenada en locaciones contiguas de memoria de acuerdo a las secuencias biológicas en las moléculas. Este método de almacenamiento no es eficiente para el procesamiento de aplicaciones de grandes grupos de secuencias de datos. El problema clave está en el hecho de que los datos almacenados secuencialmente tienen que ser procesados secuencialmente. La información dentro de una secuencia frecuentemente codificada a través de la presencia de una cierta subsecuencia de moléculas, por ejemplo, una secuencia de DNA codifica una cierta proteína. Para detectar la presencia de cualquier subsecuencia dada, se tiene que accesar la secuencia completa y para detectar una subsecuencia en un conjunto de secuencias, cada secuencia debe ser accesada secuencialmente. En la medida en que aumenta el volumen de información y el tiempo de acceso a la secuencia de datos, se convierte en un factor limítrofe de recuperación de la información de la secuencia independientemente de la velocidad de comparación de secuencias.	spa
dc.description.abstractenglish	Many researchers have been studying sequencing projects, generating large amounts of information and volumes of sequences impossible to analyze without the use of computational tools. Motivated by this need, a prototype software system has been developed that consists of a set of operations on position trees that allow the construction of trees and the rapid location of subsequences in biological sequence files for up to one hundred sequences. Sequence information is commonly stored in contiguous memory locations according to the biological sequences in the molecules. This storage method is not efficient for processing applications of large sets of sequence data. The key problem lies in the fact that data stored sequentially has to be processed sequentially. The information within a sequence is often encoded through the presence of a certain subsequence of molecules, for example, a DNA sequence encodes a certain protein. To detect the presence of any given subsequence, the entire sequence has to be accessed and to detect a subsequence in a set of sequences, each sequence must be accessed sequentially. As the volume of information and the access time to the data sequence increases, it becomes a limiting factor for retrieving sequence information regardless of the speed of sequence comparison.	spa
dc.description.degreelevel	Pregrado	spa
dc.description.learningmodality	Modalidad Presencial	spa
dc.description.tableofcontents	INTRODUCCIÓN PRESENTACIÓN DEL PROYECTO SECUENCIAS BIOLÓGICAS PLANTEAMIENTO DEL PROBLEMA SOLUCIONES AL PROBLEMA OPERACIONES BÁSICAS CON ÁRBOLES DE POSICIONES DISEÑO DE UN PROTOTIPO DE SISTEMA PARA EL ALMACENAMIENTO DE INFORMACIÓN DE SECUENCIAS BIOLÓGICAS APLICACIONES DE LAS OPERACIONES DE ÁRBOLES DE POSICIONES Y ÁRBOLES DE POSICIONES GENERALIZADO CONCLUSIONES RECOMENDACIONES REFERENCIAS BIBLIOGRÁFICAS ANEXOS	spa
dc.format.mimetype	application/pdf	spa
dc.identifier.instname	instname:Universidad Autónoma de Bucaramanga - UNAB	spa
dc.identifier.reponame	reponame:Repositorio Institucional UNAB	spa
dc.identifier.repourl	repourl:https://repository.unab.edu.co	spa
dc.identifier.uri	http://hdl.handle.net/20.500.12749/26874
dc.language.iso	spa	spa
dc.publisher.faculty	Facultad Ingeniería	spa
dc.publisher.grantor	Universidad Autónoma de Bucaramanga UNAB	spa
dc.publisher.program	Pregrado Ingeniería de Sistemas	spa
dc.relation.references	AHO, HOPCROFT, ULLMAN, Estructura de Datos y Algoritmos, Ed. Addison-Weley, 1988	spa
dc.relation.references	ALTSCHUL, $. F., Gish, W., Miller, W., Myers, E. and Lipmann (1990). Basic Local Alignment Search Tool J. Mol. Biol. 215:403 p	spa
dc.relation.references	AHO, HOPCROFT, ULLMAN, The Design and Analysis of Computer Algorithms, 1974. 346-357 p.	spa
dc.relation.references	BYRON $. Gottfried, Programación en C. Ed. Mc Graw-Hill, 1992	spa
dc.relation.references	BODMER W. (1995), Where Will Genome Analysis Lead Us Forty Years On Ann. NY. Acad.	spa
dc.relation.references	EWE, Thorwald. Conceptos Actuales Ingeniería Genética, 1987	spa
dc.relation.references	FISHER, M.J. and M.S. Paterson[1974]. "String-Matching and other products". Project MAG Thecnical Memorandum 41, MIT, Cambridge, Mass.	spa
dc.relation.references	FU LiMin; Neural Networks in Computer Intelligence; Mc Graw-Hill, 1994	spa
dc.relation.references	GOLDBERG David E.; Genetic Algorithms in search, Optimization, and Machine Learning; Addison-Wesley Pub. Co. 1989,	spa
dc.relation.references	HIRSCHBERG, D.S.[1973]. "A linear space algorithm for computing maximal common subsequences", TR-138, Computer Science Laboratory, Dept. of Electrical Enginnering, Princeton University , Princeton N.J,	spa
dc.relation.references	KARP, R.M., REE. Miller, and A. L Rosenberg[1972]. "Rapid Identification of Repeated Patterns in Strings, trees and arrays", Proc 4'" Annual ACM Symposium on Theory of Computing, 125-136 p.	spa
dc.relation.references	KNUTH, D.E. [1973b] "Notes on Pattern Matching". University of Frondheim, Norway.	spa
dc.relation.references	LIPMANN, D. J and Pearson, W. R. (1985). Rapid and Sensitive Protein Similarity Searches. Science 227:1435 p.	spa
dc.relation.references	PERRY, Greg. Aprendiendo Programación Orientada a Objetos con Turbo C++ en 21 d ías. 1995	spa
dc.relation.references	SCHILDT Herbert, Turbo C/C++ Manual de Referencia. 1992	spa
dc.relation.references	SCHILDT Herbert, Programación en Turbo C, ed. Mc Graw-Hill	spa
dc.relation.references	SOUCEK Branco, The iris Group; Dynamic, Genetic, and Chaotic Programming the sixth generation; Wiley Inter-science 1993,	spa
dc.relation.references	RICH Elaine, Artificial Intelligence, McGraw-Hill Book Company, 1983.	spa
dc.relation.references	SOUCEK Branco, The lris Group; Dynamic, Genetic, and Chaotic Programming the sixth generation; Wiley Inter-science 1993.	spa
dc.relation.references	WAGNER, R.A. and M.J. Fischer[1974]. "The string-to-string correction problem", d.ACM, 21:1, 168-173 p.	spa
dc.relation.references	WEINER P.[1973] "Linear Pattern Matching Algorithms"”, conference record, \|EEE, 14% Annual Symposium on Switching and Automata Theory 1-11	spa
dc.relation.references	Winston Patrick, inteligencia Artificial. 3ra edición. 1994. Pag. 70-85	spa
dc.relation.references	Técnicas de Programación. Universidad Autónoma de Bucaramanga TEC de Monterrey. Abril 1997,	spa
dc.relation.references	Accessing Databases. http://arep.med, harvard.edu/seganal/db. html	spa
dc.relation.references	Database Artifacts. http:/wod.med.harvard.edu/seganal/contam.html	spa
dc.relation.references	EDELKAMP Estefan, Multi Suffix Trees. Institut Fúr Informatik, Universiát Freiburg. edelkampHinformatik. uni-freiburg.de	spa
dc.rights.accessrights	info:eu-repo/semantics/openAccess	spa
dc.rights.creativecommons	Atribución-NoComercial-SinDerivadas 2.5 Colombia	*
dc.rights.local	Abierto (Texto Completo)	spa
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/2.5/co/	*
dc.subject.keywords	Systems engineer	spa
dc.subject.keywords	Technological innovations	spa
dc.subject.keywords	Bioinformatics	spa
dc.subject.keywords	Biocomputing	spa
dc.subject.keywords	Computational techniques	spa
dc.subject.keywords	Information retrieval	spa
dc.subject.keywords	Prototype development	spa
dc.subject.keywords	Information storage and retrieval systems	spa
dc.subject.lemb	Ingeniería de sistemas	spa
dc.subject.lemb	Innovaciones tecnológicas	spa
dc.subject.lemb	Recuperación de información	spa
dc.subject.lemb	Desarrollo de prototipos	spa
dc.subject.lemb	Sistemas de almacenamiento y recuperación de información	spa
dc.subject.proposal	Bioinformática	spa
dc.subject.proposal	Biocomputación	spa
dc.subject.proposal	Técnicas computacionales	spa
dc.title	Propuesta de un prototipo de sistema para almacenamiento de información de secuencias biológicas en estructuras de árboles de posiciones	spa
dc.title.translated	Proposal for a prototype system for storing biological sequence information in position tree structures	spa
dc.type.coar	http://purl.org/coar/resource_type/c_7a1f
dc.type.coarversion	http://purl.org/coar/version/c_ab4af688f83e57aa	spa
dc.type.driver	info:eu-repo/semantics/bachelorThesis
dc.type.hasversion	info:eu-repo/semantics/acceptedVersion
dc.type.local	Trabajo de Grado	spa
dc.type.redcol	http://purl.org/redcol/resource_type/TP

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: 1998_Quiroga_Rivas_Julie.pdf
Tamaño:: 16.59 MB
Formato:: Adobe Portable Document Format
Descripción:: Tesis

Descargar

Bloque de licencias

Mostrando 1 - 1 de 1

Nombre:: license.txt
Tamaño:: 829 B
Formato:: Item-specific license agreed upon to submission
Descripción:

Descargar

Colecciones

Pregrado Ingeniería de Sistemas