Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/425230
Title: Cancer Classification using Microarray Gene Expression Data
Researcher: NIMRITA KOUL
Guide(s): SUNILKUMAR S. MANVI
Keywords: Computer Science
Computer Science Theory and Methods
Engineering and Technology
University: REVA University
Completed Date: 2022
Abstract: Accurate and early identification of the molecular nature of cancer is very important in prescribing an effective treatment for it. This has the potential to improve the quality of life for the patients and reduce the toxicity caused by general chemotherapy. Morpho- logical and clinical classification of tumors is effective only in the late stages of tumor development and does not capture molecular level variations. Gene-level data analysis can help in the early diagnosis and classification of cancers. newlineGene expression data made available by the Deoxyribose Nucleic Acid Microar- ray and Next-Generation Sequencing technologies allow the use of data analysis and machine learning techniques to identify useful patterns in this data. The focus of this thesis is on achieving an accurate classification of microarray gene expression profiles through innovations in the machine learning-based methods of feature selection, and classification. newlineOur original contributions to knowledge through this thesis are - the development of an adaptive ensemble feature selection technique with a new score metric, develop- ment of a scheme for globally optimized feature selection using simulated annealing and partial least squares regression, development of a multipurpose scheme for inference of gene regulatory networks and network-based feature selection, and development of an end-to-end optimized framework for accurate classification of gene expression data with few genes. newlineThe first contribution is a two-stage feature selection scheme using adaptive filter- ing of genes with a varying threshold at stage one, followed by an ensemble of feature selection methods at stage two. We identified a small number of relevant genes using this scheme and obtained a classification accuracy of about 100% for each of the six microarray cancer gene expression datasets with these genes. newlineThe second contribution is a feature selection scheme capable of selecting the globally-best features.
URI: http://hdl.handle.net/10603/425230
Appears in Departments:School of Computing and Information Technology

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File55.13 kBAdobe PDFView/Open
02_prelim pages.pdf5.36 MBAdobe PDFView/Open
03_content.pdf42.34 kBAdobe PDFView/Open
04_abstarct .pdf12.51 kBAdobe PDFView/Open
05_chapter 1.pdf821.24 kBAdobe PDFView/Open
06_chapter 2.pdf937.45 kBAdobe PDFView/Open
07_chapter 3.pdf1.84 MBAdobe PDFView/Open
08_chapter 4.pdf1.07 MBAdobe PDFView/Open
09_chapter 5.pdf725.33 kBAdobe PDFView/Open
10_annexures .pdf1.83 MBAdobe PDFView/Open
80_recommendation.pdf124.45 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: