Sistema web de reconocimiento y clasificación de patologías a través de imágenes médicas basado en técnicas de aprendizaje de máquina

dc.contributor.advisorOrtiz Beltrán, Arielspa
dc.contributor.authorArias Trillos, Yhary Estefaníaspa
dc.contributor.cvlacOrtiz Beltrán, Ariel [0001459925]*
dc.contributor.researchgateOrtiz Beltrán, Ariel [Ariel-Ortiz-Beltran]*
dc.contributor.researchgroupGrupo de Investigación Tecnologías de Información - GTIspa
dc.contributor.researchgroupGrupo de Investigaciones Clínicasspa
dc.coverage.campusUNAB Campus Bucaramangaspa
dc.date.accessioned2020-07-27T19:19:09Z
dc.date.available2020-07-27T19:19:09Z
dc.date.issued2019
dc.degree.nameIngeniero de Sistemasspa
dc.description.abstractLa detección de cáncer de tiroides es un proceso que en la actualidad se realiza mediante la interpretación manual que realizan radiólogos especialistas, estas se clasifican utilizando una prueba de tamizaje (discriminatoria) conocida como EU- TIRADS 2017 [2], que determina el grado de malignidad del nódulo tiroideo. La escasez de profesionales y la creciente demanda de este tipo de estudios plantea el problema de la automatización a través de algoritmos de aprendizaje de máquina como los basados en Deep Learning y específicamente, las Redes Neuronales Convolucionales, que han sido probadas anteriormente con éxito para la clasificación de otro tipo de imágenes médicas. En un trabajo anterior, con un dataset de 2000 imágenes balanceado entre 4 categorías (TI-RADS2 - TI-RADS5) se logró una medida de precisión (accuracy) cercana del 65% y una pérdida logarítmica (cross-entropy loss) cercana a 0.78. Sin embargo, este artículo plantea el estudio exploratorio para una posible optimización del algoritmo a través de diferentes pruebas medibles en su parametrización. Las variables que serán ajustadas son: El número de capas convolucionales, el tamaño de la máscara de convolución, las funciones de activación, el número de neuronas en la capa densa, el uso de más capas densas para el aprendizaje, el uso de dropouts aleatorios para controlar el sobreajuste (overfitting), entre otros. La medición comparativa se realiza a través de los valores de precisión, pérdida, la matriz de confusión, y el área bajo la curva ROC. Al final del documento se describe la mejor combinación de los parámetros evaluados y las observaciones pertinentes.spa
dc.description.abstractenglishDetection of thyroid cancer is a process which is currently done through manual interpretation who perform specialist radiologists, these are classified using a screening test (discriminatory) known as EU- TIRADS 2017 [2], which determines the degree of malignancy of the thyroid nodule. The shortage of professionals and the growing Demand for this type of study raises the problem of automation through machine learning algorithms such as those based on Deep Learning and specifically, Networks Convolutionary neurons, which have been tested formerly successfully for the classification of another type of medical images In a previous job, with a 2000 dataset balanced images between 4 categories (TI-RADS2 - TI-RADS5) a measure of accuracy (accuracy) close to 65% was achieved and a logarithmic loss (cross-entropy loss) close to 0.78. Without However, this article raises the exploratory study for a possible algorithm optimization through different tests Measurable in its parameterization. The variables that will be Fitted are: The number of convolutional layers, the size of the convolution mask, the activation functions, the number of neurons in the dense layer, using more dense layers for the learning, the use of random dropouts to control the overfitting, among others. The comparative measurement is performs through the values of precision, loss, the matrix of confusion, and the area under the ROC curve. At the end of the document describes the best combination of the parameters evaluated and the relevant observations.eng
dc.description.degreelevelPregradospa
dc.description.learningmodalityModalidad Presencialspa
dc.description.tableofcontentsResumen 8 Abstract 8 Introducción 9 Planteamiento del problema 9 Pregunta de investigación 10 Objetivos 10 Objetivo General 10 Objetivos específicos 10 Revisión de literatura 11 Tabla de proyectos 11 Tabla de aplicaciones: 14 Tabla de artículos: 16 Estado del arte 17 Proyectos: 17 Aplicaciones 22 Artículos 26 Marco Teórico 30 Manejo de valores nulos 31 Imputación 32 Estandarización 32 Manejo de variables categóricas 32 La multicolinealidad y su impacto 33 Teorema de Fourier 33 Ultrasonido 34 Extracción de características 37 Pruebas en imágenes 38 Gammagrafía 38 Métricas para evaluar un algoritmo de aprendizaje 39 Precisión de Clasificación 39 Pérdida logarítmica 39 Matriz de confusión 39 Algoritmos de predicción 40 Área bajo la curva 41 Aprendizaje de maquina 42 Metodología 44 Tipo de estudio 45 Población/muestra de referencia 45 Muestra elegible 46 Procesamiento de la imagen 46 Análisis de la imagen: 48 Arquitectura de la red neuronal CNN 48 Entrenamiento de la red neuronal 49 Mediciones de desempeño 50 Resultados 51 Resultados de la prueba 52 Conclusiones 53 Cronograma de actividades 54 Actividades 54 Presupuesto 56 Equipo necesario para el desarrollo 56 Costos: 56 Herramientas: 57 Glosario 58 Referencias 62spa
dc.format.mimetypeapplication/pdfspa
dc.identifier.instnameinstname:Universidad Autónoma de Bucaramanga - UNABspa
dc.identifier.reponamereponame:Repositorio Institucional UNABspa
dc.identifier.repourlrepourl:https://repository.unab.edu.cospa
dc.identifier.urihttp://hdl.handle.net/20.500.12749/7053
dc.language.isospaspa
dc.publisher.facultyFacultad Ingenieríaspa
dc.publisher.grantorUniversidad Autónoma de Bucaramanga UNABspa
dc.publisher.programPregrado Ingeniería de Sistemasspa
dc.relation.references[1] Chapter 1: Supervised Learning and Naive Bayes Classification — Part 2 (Coding). (2017). Retrieved from https://medium.com/machine-learning101/chapter-1supervised-learning-and-naive-bayes-classification-part-2coding-5966f25f1475spa
dc.relation.references[2] clinicalkey.es. (2018). Thyroid Imaging. clinicalkey. Retrieved from https://wwwclinicalkey-es.aure.unab.edu.co/service/content/pdf/watermarked/3s2.0B9780323189071000792.pdf?locale=es_ESspa
dc.relation.references[3] clinicalkey.es. (2018). Thyroid Neoplasia. Retrieved from https://wwwclinicalkeyes.aure.unab.edu.co/service/content/pdf/watermarked/3s2.0B9780323189071000925.pdf?locale=es_ESspa
dc.relation.references[4] ClinicalKey.es. (2018). The Thyroid Gland. Estados Unidos. Retrieved from https://www-clinicalkey- es.aure.unab.edu.co/service/content/pdf/watermarked/3- s2.0B9780323401715000195.pdf?locale=es_ESspa
dc.relation.references[5] Department of Computer Science. (1999). Correlation-based Feature Selection for Machine Learning. Hamilton, NewZealand. Retrieved from https://www.lri.fr/~pierres/donn%E9es/save/these/articles/lprqueue/hall99correlationb ased.pdfspa
dc.relation.references[6] Department of Computer Science University of Waikato. (1999). Feature Selection for Machine Learning: Comparing a Correlation-based Filter Approach to the Wrapper. New Zealand: waikato. Retrieved from http://www.aaai.org/Papers/FLAIRS/1999/FLAIRS99-042.pdfspa
dc.relation.references[7] ESMERILADO - Definición - Significado. (2018). Retrieved from https://diccionario.motorgiga.com/esmeriladospa
dc.relation.references[8] Exámenes PET CT y RM: ¿Qué son y cómo usarlos en la medicina diagnóstica?. (2015). Retrieved from http://www.mv.com.br/es/blog/examenes- -pet-ct-y-rm---que-son-y-como-usarlos-en-la-medicina-diagnosticarspa
dc.relation.references[9] Medicina nuclear: SPECT y PET en tumores primarios del Sistema Nervioso | NeuroWikia. (2010). Retrieved from http://www.neurowikia.es/content/medicina-nuclear-spect-y-pet-en-tumoresprimariosdel-snspa
dc.relation.references[10] Northside Radiology Associates. (2016). MIBG scintiscan. Atlanta: Editorial team. Retrieved from https://medlineplus.gov/ency/article/003830.htmspa
dc.relation.references[11] Positron. (2018). Retrieved from https://en.wikipedia.org/wiki/Positronspa
dc.relation.references[12] Pruebas para detectar el cáncer de tiroides. (2016). Retrieved from https://www.cancer.org/es/cancer/cancer-de-tiroides/detecciondiagnosticoclasificacion-por-etapas/como-se-diagnostica.htmlspa
dc.relation.references[13] University of Washington School of Medicine. (2016). Gammagrafía de la tiroides. Seattle: Editorial Director y A.D.A.M. Editorial team. Retrieved from https://medlineplus.gov/spanish/ency/article/003829.htmspa
dc.relation.references[14] working paper series. (2000). Correlation - based feature selection ffor discrete and numeric class Machine Learning. New Zealand. Retrieved from https://researchcommons.waikato.ac.nz/bitstream/handle/10289/1024/uow-cswp- 2000-08.pdf?sequence=1&isAllowed=yspa
dc.relation.references[15] Yodo-131. (2018). Retrieved from https://es.wikipedia.org/wiki/Yodo-131spa
dc.relation.references[16] ¿Qué es una API REST? - Idento. (2018). Retrieved from https://www.idento.es/blog/desarrollo-web/que-es-una-api-rest/spa
dc.relation.references[17] Cloud AutoML - Custom Machine Learning Models | AutoML | Google Cloud. (2018). Retrieved from https://cloud.google.com/automl/spa
dc.relation.references[18] Convolutional Neural Networks for Visual Recognition. (2018). Retrieved from http://cs231n.github.io/convolutional-networks/spa
dc.relation.references[19] general, M. (2018). Salarios de Médico/a general en Colombia | Indeed.com. Retrieved from https://co.indeed.com/salaries/M%C3%A9dico/a-generalSalariesspa
dc.relation.references[20] Journal of Physics: Conference Series. (2016). Recent development of feature extraction and classification multispectral/hyperspectral images: a systematic literature review. Retrieved from http://iopscience.iop.org/article/10.1088/17426596/801/1/012045/pdfspa
dc.relation.references[21] Judith Marcin, M. (2018). MRI Scans: Definition, uses, and procedure. Retrieved from https://www.medicalnewstoday.com/articles/146309.phpspa
dc.relation.references[22] Judith Marcin, M. (2018). MRI Scans: Definition, uses, and procedure. Retrieved from https://www.medicalnewstoday.com/articles/146309.phpspa
dc.relation.references[21] Principal Component Analysis [PCA]. (2017). Retrieved from https://medium.com/100-days-of-algorithms/day-92-pca-bdb66840a8fbspa
dc.relation.references[22] ProClass Software - 2018 Reviews, Pricing & Demo. (2018). Retrieved from https://www.softwareadvice.com/registration/proclass-profile/spa
dc.relation.references[23] ProClass Software - 2018 Reviews, Pricing & Demo. (2018). Retrieved from https://www.softwareadvice.com/registration/proclass-profile/spa
dc.relation.references[24] Purves, D., Augustine, G., Fitzpatrick, D., Katz, L., LaMantia, A., McNamara, J., & Williams, S. (2018). Types of Eye Movements and Their Functions. Retrieved from https://www.ncbi.nlm.nih.gov/books/NBK10991/spa
dc.relation.references[25] Robust image hashing using ring partition-PGNMF and local features. (2016). Retrieved from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5118381/spa
dc.relation.references[26] School of Electronics and Control Engineering Chang'an. (2012). Hyperspectral Image Classification Based on Spectral-Spatial Feature Extraction. Xi’an, China. Retrieved from https://ieeexplore- ieeeorg.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=7958808&tag=1spa
dc.relation.references[27] sistemas, I. (2018). Salarios de Ingeniero/a en sistemas en Colombia | Indeed.com. Retrieved from https://co.indeed.com/salaries/Ingeniero/a-ensistemas-Salariesspa
dc.relation.references[26] What is a data set? - Quora. (2017). Retrieved from https://www.quora.com/What-is-adata-setspa
dc.relation.references[27] Brownlee, J. (2016). What is Deep Learning? Retrieved from https://machinelearningmastery.com/what-is-deep-learningspa
dc.relation.references[27] Brownlee, J. (2016). What is Deep Learning? Retrieved from https://machinelearningmastery.com/what-is-deep-learningspa
dc.relation.references[29] Raicea, R. (2018). Want to know how Deep Learning works? Here’s a quick guide for everyone. Retrieved from https://medium.freecodecamp.org/want-toknow-how-deeplearning-works-heres-a-quick-guide-for-everyone1aedeca88076spa
dc.relation.references[30] Tiempo, C. (2018). Sueldo de un profesional con posgrado. Retrieved from https://www.portafolio.co/economia/empleo/un-trabajador-con-posgrado-ganaenpromedio-3-3-millones-mas-que-un-bachiller-512462spa
dc.relation.references[31] School of Electronics and Control Engineering Chang'an University (2017). Hyperspectral Image Classification Based on SpectralSpatial Feature Extraction. Xi’an, China. Retrieved from https://ieeexplore-ieeorg.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=7958808spa
dc.relation.references[32] School of Electronics and Control Engineering. (2013). Using Nonnegative Matrix Factorization with Projected Gradient for Hyperspectral Images Feature Extraction. Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=6566423&ta g=1spa
dc.relation.references[33] Dharmsinh Desai University. (2009). Project Classification Using Soft Computing. Nadiad, Gujarat. Retrieved from https://ieeexplore- ieeeorg.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=5376511&ta g=1spa
dc.relation.references[34] Dharmsinh Desai University. (2009). Project Classification Using Soft Computing. Nadiad, Gujarat. Retrieved from https://ieeexplore- ieeeorg.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=5376511&ta g=1spa
dc.relation.references[35] Department of Computer Science California State Polytechnic University. (2016). Oculomotor Plant Feature Extraction from Human Saccadic Eye Movements. Pomona, USA. Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=8424698spa
dc.relation.references[36] Bharathidasan College of Arts & Science. (2016). Fusion of Big Data and Neural Networks for Predicting Thyroid. Tamilnadu, India. Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=7955223spa
dc.relation.references[36] Bharathidasan College of Arts & Science. (2016). Fusion of Big Data and Neural Networks for Predicting Thyroid. Tamilnadu, India. Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=7955223spa
dc.relation.references[37] Bernard, O., Lalande, A., Zotti, C., & Cervenansky, F. (2018). Deep Learning Techniques for Automatic MRI Cardiac Multi-structures Segmentation and Diagnosis: Is the Problem Solved? Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=8360453spa
dc.relation.references[38] School of Informatics and Computing, Indiana University. (2015). Temporal Pattern and Association Discovery of Diagnosis Codes using Deep Learning. IN, USA. Retrieved from https://ieeexploreieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=7349719&ta g=1spa
dc.relation.references[39] Amazon Rekognition – Videos e imágenes – AWS. (2018). Retrieved from https://aws.amazon.com/es/rekognition/?hp=tile&soexp=belowspa
dc.relation.references[40] AMI de aprendizaje profundo de Amazon. (2018). Retrieved from https://aws.amazon.com/es/machine-learning/amis/?hp=tile&soexp=belowspa
dc.relation.references[41] Amazon Polly. (2018). Retrieved from https://aws.amazon.com/es/polly/?hp=tile&so-exp=below pruébelo, V. (2018). Chatbot | Deep learning | Amazon Lex. Retrieved from https://aws.amazon.com/es/lex/?hp=tile&soexp=belowspa
dc.relation.references[42] Amazon Translate – Traducción de máquina neural – AWS. (2018). Retrieved from https://aws.amazon.com/es/translate/?hp=tile&soexp=belowspa
dc.relation.references[43] What is Watson. (2018). Retrieved from https://www.ibm.com/watson/about/index.html ©2018 IEEE. (2018). Delta Univ. for Science and Technology. Gamasa City, Egypt, from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=8358209spa
dc.relation.references[44] National Chung Hsing University. (2009). Optimal Grouping by using Genetic Algorithm and Support Vector Machines. Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=5420079spa
dc.relation.references[45] EEE TRANSACTIONS ON BIOMEDICAL ENGINEERING. (2010). Thyroid Segmentation and Volume Estimation in Ultrasound Images. Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp tp=&arnumber=5415666&tag=1spa
dc.relation.references[46] IEEE. (2017). Tiroid Kanserinde BilgisayarlÕ Tomografi Temelli Yeni Öznitelikler Computerized Tomography Based Novel Features in Thyroid Cancer. Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=8238050spa
dc.relation.references[47] IEEE. (2009). Computer-Aided Diagnosis of Thyroid Malignancy Using an Artificial Immune System Classification Algorithm. Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=4539694spa
dc.relation.references[48] AIMBE. (2012). Automated Benign & Malignant Thyroid Lesion Characterization and Classification in 3D Contrast-Enhanced Ultrasound. IEEE. Retrieved from https://ieeexplore-ieee- org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=6345965spa
dc.relation.references[49] The department of thyroid surgery. (2012). Discussion about misdiagnosed reasons and reoperation of thyroid cancer. Changchun city, China. Retrieved from https://ieeexplore- ieeeorg.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=6291383spa
dc.relation.references[50] Department of Electronics and Computer Engineering, Gifu University Yanagido. (1992). Neural Network Approach for the ComputerAided Diagnosis of Coronary Artery Diseases in Nuclear Medicine. Retrieved from https://ieeexplore-ieeeorg.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=227168spa
dc.relation.references[51] Shandong Provincial Key Laboratory of Computer Networks. (2016). Web Identification Image Recognition Based on Deep Learning. Jinan, China. Retrieved from https://ieeexplore-ieeeorg.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=7726261spa
dc.relation.references[52] College of Computer Science and Technology. (2012). Interface Schema Matching with the Machine Learning for Deep Web. Harbin, P.R. China. Retrieved from https://ieeexplore-ieee.org.aure.unab.edu.co/stamp/stamp.jsp?tp=&arnumber=6526056spa
dc.relation.references[53] Towards Data Science. (2019). Metrics to Evaluate your Machine Learning Algorithm. [online] Available at: https://towardsdatascience.com/metrics-to-evaluate-your-machinelearning-algorithm-f10ba6e38234 [Accessed 5 Jun. 2019].spa
dc.rights.accessrightsinfo:eu-repo/semantics/openAccessspa
dc.rights.accessrightshttp://purl.org/coar/access_right/c_abf2spa
dc.rights.creativecommonsAtribución-NoComercial-SinDerivadas 2.5 Colombia*
dc.rights.localAbierto (Texto Completo)spa
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/2.5/co/*
dc.subject.keywordsSystems engineereng
dc.subject.keywordsTechnological innovationseng
dc.subject.keywordsData seteng
dc.subject.keywordsDeep learningeng
dc.subject.keywordsMachingeng
dc.subject.keywordsData increaseeng
dc.subject.keywordsNeural networkseng
dc.subject.keywordsArtificial intelligenceeng
dc.subject.keywordsUltrasoundeng
dc.subject.keywordsRadiologyeng
dc.subject.keywordsArea under the curveeng
dc.subject.keywordsConfusion matrixeng
dc.subject.keywordsConcordance studyeng
dc.subject.keywordsCancer diagnosiseng
dc.subject.keywordsX-rayseng
dc.subject.keywordsMedical examinationseng
dc.subject.keywordsDiagnostic serviceeng
dc.subject.keywordsEndocrine glandseng
dc.subject.lembIngeniería de sistemasspa
dc.subject.lembInnovaciones tecnológicasspa
dc.subject.lembInteligencia artificialspa
dc.subject.lembCáncer diagnósticospa
dc.subject.lembRayos xspa
dc.subject.lembExámenes médicosspa
dc.subject.lembServicio de diagnósticospa
dc.subject.lembGlándulas endocrinasspa
dc.subject.proposalConjunto de datosspa
dc.subject.proposalAprendizaje profundospa
dc.subject.proposalAumento de datosspa
dc.subject.proposalRedes neuronalesspa
dc.subject.proposalUltrasonidospa
dc.subject.proposalRadiologíaspa
dc.subject.proposalÁrea bajo la curvaspa
dc.subject.proposalMatriz de confusiónspa
dc.subject.proposalEstudio de concordanciaspa
dc.titleSistema web de reconocimiento y clasificación de patologías a través de imágenes médicas basado en técnicas de aprendizaje de máquinaspa
dc.title.translatedWeb recognition and classification system pathologies through medical images based on machine learning techniqueseng
dc.type.coarhttp://purl.org/coar/resource_type/c_7a1f
dc.type.driverinfo:eu-repo/semantics/bachelorThesis
dc.type.hasversioninfo:eu-repo/semantics/acceptedVersion
dc.type.localTrabajo de Gradospa
dc.type.redcolhttp://purl.org/redcol/resource_type/TP

Archivos

Bloque original

Mostrando 1 - 2 de 2
Cargando...
Miniatura
Nombre:
2019_Tesis_YharyEstefania_Arias_Trillos.pdf
Tamaño:
1.58 MB
Formato:
Adobe Portable Document Format
Descripción:
Tesis
Cargando...
Miniatura
Nombre:
2019_Licencia_YharyEstefania_Arias_Trillos.pdf
Tamaño:
117.02 KB
Formato:
Adobe Portable Document Format
Descripción:
Licencia

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
license.txt
Tamaño:
1.71 KB
Formato:
Item-specific license agreed upon to submission
Descripción: