Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/342421
Title: Distributed fuzzy clustering techniques for cancer diagnosis with protein sequence data
Researcher: Thenmozhi, K
Guide(s): Karthikeyani Visalakshi, N
Keywords: clustering
Cancer disease
Protein sequences
University: Anna University
Completed Date: 2019
Abstract: The cell of a human body has genes in De oxyribo Nucleic Acid (DNA) which controls the functions of the cell by making proteins. The mutation of one or more genes in a cell causes cancer. This mutation causes the abnormal protein creation which has different information in nucleotide sequence, protein structure than a normal protein. Forming a chain of amino acids is called protein. Proteins are utilized in the early stage of cancer detection because the protein sequences and protein structure motif establish amino acid sequence patterns in all parts of a protein. Biomedical data analysis reveals that the protein data is non-numerical in nature. Data mining plays a major role in extracting the information from big volumes of non-numerical protein sequences. The data can be grouped based on similarities which enables the speedy convergence for clustering different sizes of data. Usually, analyzing the huge volume of data requires more computation time for identifying cancerous protein sequences during the diagnosis of cancer disease. The current research utilizes distributed clustering to deal with the computational time complexity issue by distributing the huge data into different local clusters. Additionally, it utilizes the soft computing approach to group similar protein sequences into specific clusters by finding the global clustering from the results of local clusters. The uncertainty of protein sequences is addressed with the utilization of a fuzzy logic system. This, in turn, leads to achieve the main objective of the research which is identifying the cancerous protein sequences at an early stage from the distributed large data and to improve accuracy, F-measure, precision and recall in clustering the protein sequences with the reduction of the False Positive Rate (FPR) as well as newline
Pagination: xxvii,187p.
URI: http://hdl.handle.net/10603/342421
Appears in Departments:Faculty of Science and Humanities

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File24.44 kBAdobe PDFView/Open
02_certificates.pdf453.11 kBAdobe PDFView/Open
03_vivaproceedings.pdf208.93 kBAdobe PDFView/Open
04_bonafidecertificate.pdf149.53 kBAdobe PDFView/Open
05_abstracts.pdf115.82 kBAdobe PDFView/Open
06_acknowledgements.pdf184.38 kBAdobe PDFView/Open
07_contents.pdf164.24 kBAdobe PDFView/Open
08_listoftables.pdf288.62 kBAdobe PDFView/Open
09_listoffigures.pdf183.43 kBAdobe PDFView/Open
10_listofabbreviation.pdf160.61 kBAdobe PDFView/Open
11_chapter1.pdf386.78 kBAdobe PDFView/Open
12_chapter2.pdf311.67 kBAdobe PDFView/Open
13_chapter3.pdf529.23 kBAdobe PDFView/Open
14_chapter4.pdf635.1 kBAdobe PDFView/Open
15_chapter5.pdf641.52 kBAdobe PDFView/Open
16_chapter6.pdf784.32 kBAdobe PDFView/Open
17_conclusion.pdf163.82 kBAdobe PDFView/Open
18_references.pdf199.37 kBAdobe PDFView/Open
19_listofpublications.pdf97.56 kBAdobe PDFView/Open
80_recommendation.pdf163.33 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: