Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/324381
Title: Enhanced Feature Selection Techniques for Classification of Microarray Gene Expression Data
Researcher: Rabindra Kumar Singh
Guide(s): Sivabalakrishnan, M
Keywords: Computer Science
Computer Science Interdisciplinary Applications
Engineering and Technology
University: VIT University
Completed Date: 2020
Abstract: Thanks to the breakthroughs that have taken place in bioinformatics during recent years. Data in the field of bioinformatics is accumulating at an unseen pace. Data-intensive, large scale biological problems from the computational point of view were addressed in bioinformatics. These data are still analyzed using some old statistical methods. Microarray technology has been used extensively to study the gene expression of Deoxy Ribonucleic Acid (DNA) data for the diagnosis and prognosis of diseases, including cancer in the organism. However, advancement in microarray technology has facilitated the researcher to the next level for analyzing the gene expression datasets. Hence, microarray technology has evolved as one of the powerful tools for analysing gene expression levels for an organism, that facilitated the researcher to apply Machine learning algorithms like classification for analyzing and extracting exciting patterns of gene expression. However, microarray datasets contain a very massive amount of features( genes) but comparatively small samples size, because of the mentioned characteristics of microarray data, the efficiency of machine learning algorithms are taxed. Hence, there is a need for feature engineering for achieving better outcomes of machine learning algorithms for obtaining interesting gene expression patterns. This thesis focuses on improvising the efficiency of the machine learning algorithm by using feature engineering on microarray datasets. The works presented in this thesis are improvised feature engineering techniques with the help of proper resampling with efficient feature engineering and hybridization of mRMR and AGA method. The experiments were conducted using RFEVSS and RSRV as an efficient feature selection on four microarray datasets, which was the first contribution. The microarray gene expression data common characteristics are high dimension along with a small newlinesample size. The performance of the learning algorithms is never up to mark because of these characteristics. Another c
Pagination: i-xii, 1-117
URI: http://hdl.handle.net/10603/324381
Appears in Departments:School of Computing Science and Engineering -VIT-Chennai

Files in This Item:
File Description SizeFormat 
01_tiltle page.pdfAttached File218.3 kBAdobe PDFView/Open
02_declaration & certificate.pdf249.51 kBAdobe PDFView/Open
03_abstract.pdf123.83 kBAdobe PDFView/Open
04_acknowledgement.pdf63.11 kBAdobe PDFView/Open
05_table of contents.pdf198.14 kBAdobe PDFView/Open
06_list of figures.pdf132.75 kBAdobe PDFView/Open
07_list of tables.pdf68.44 kBAdobe PDFView/Open
08_list of terms and abbreviations.pdf228.39 kBAdobe PDFView/Open
09_chapter_01.pdf1 MBAdobe PDFView/Open
10_chapter_02.pdf1.35 MBAdobe PDFView/Open
11_chapter_03.pdf816.39 kBAdobe PDFView/Open
12_chapter_04.pdf3.31 MBAdobe PDFView/Open
13_chapter_05.pdf2.77 MBAdobe PDFView/Open
14_chapter_06.pdf136.52 kBAdobe PDFView/Open
15_references.pdf1.15 MBAdobe PDFView/Open
16_list of publications.pdf62.01 kBAdobe PDFView/Open
80_recommendation.pdf355.2 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: