Distributed fuzzy clustering techniques for cancer diagnosis with protein sequence data

Thenmozhi, K

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/342421

Title:	Distributed fuzzy clustering techniques for cancer diagnosis with protein sequence data
Researcher:	Thenmozhi, K
Guide(s):	Karthikeyani Visalakshi, N
Keywords:	clustering Cancer disease Protein sequences
University:	Anna University
Completed Date:	2019
Abstract:	The cell of a human body has genes in De oxyribo Nucleic Acid (DNA) which controls the functions of the cell by making proteins. The mutation of one or more genes in a cell causes cancer. This mutation causes the abnormal protein creation which has different information in nucleotide sequence, protein structure than a normal protein. Forming a chain of amino acids is called protein. Proteins are utilized in the early stage of cancer detection because the protein sequences and protein structure motif establish amino acid sequence patterns in all parts of a protein. Biomedical data analysis reveals that the protein data is non-numerical in nature. Data mining plays a major role in extracting the information from big volumes of non-numerical protein sequences. The data can be grouped based on similarities which enables the speedy convergence for clustering different sizes of data. Usually, analyzing the huge volume of data requires more computation time for identifying cancerous protein sequences during the diagnosis of cancer disease. The current research utilizes distributed clustering to deal with the computational time complexity issue by distributing the huge data into different local clusters. Additionally, it utilizes the soft computing approach to group similar protein sequences into specific clusters by finding the global clustering from the results of local clusters. The uncertainty of protein sequences is addressed with the utilization of a fuzzy logic system. This, in turn, leads to achieve the main objective of the research which is identifying the cancerous protein sequences at an early stage from the distributed large data and to improve accuracy, F-measure, precision and recall in clustering the protein sequences with the reduction of the False Positive Rate (FPR) as well as newline
Pagination:	xxvii,187p.
URI:	http://hdl.handle.net/10603/342421
Appears in Departments:	Faculty of Science and Humanities

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	24.44 kB	Adobe PDF	View/Open
02_certificates.pdf		453.11 kB	Adobe PDF	View/Open
03_vivaproceedings.pdf		208.93 kB	Adobe PDF	View/Open
04_bonafidecertificate.pdf		149.53 kB	Adobe PDF	View/Open
05_abstracts.pdf		115.82 kB	Adobe PDF	View/Open
06_acknowledgements.pdf		184.38 kB	Adobe PDF	View/Open
07_contents.pdf		164.24 kB	Adobe PDF	View/Open
08_listoftables.pdf		288.62 kB	Adobe PDF	View/Open
09_listoffigures.pdf		183.43 kB	Adobe PDF	View/Open
10_listofabbreviation.pdf		160.61 kB	Adobe PDF	View/Open
11_chapter1.pdf		386.78 kB	Adobe PDF	View/Open
12_chapter2.pdf		311.67 kB	Adobe PDF	View/Open
13_chapter3.pdf		529.23 kB	Adobe PDF	View/Open
14_chapter4.pdf		635.1 kB	Adobe PDF	View/Open
15_chapter5.pdf		641.52 kB	Adobe PDF	View/Open
16_chapter6.pdf		784.32 kB	Adobe PDF	View/Open
17_conclusion.pdf		163.82 kB	Adobe PDF	View/Open
18_references.pdf		199.37 kB	Adobe PDF	View/Open
19_listofpublications.pdf		97.56 kB	Adobe PDF	View/Open
80_recommendation.pdf		163.33 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET