Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/342421
Title: | Distributed fuzzy clustering techniques for cancer diagnosis with protein sequence data |
Researcher: | Thenmozhi, K |
Guide(s): | Karthikeyani Visalakshi, N |
Keywords: | clustering Cancer disease Protein sequences |
University: | Anna University |
Completed Date: | 2019 |
Abstract: | The cell of a human body has genes in De oxyribo Nucleic Acid (DNA) which controls the functions of the cell by making proteins. The mutation of one or more genes in a cell causes cancer. This mutation causes the abnormal protein creation which has different information in nucleotide sequence, protein structure than a normal protein. Forming a chain of amino acids is called protein. Proteins are utilized in the early stage of cancer detection because the protein sequences and protein structure motif establish amino acid sequence patterns in all parts of a protein. Biomedical data analysis reveals that the protein data is non-numerical in nature. Data mining plays a major role in extracting the information from big volumes of non-numerical protein sequences. The data can be grouped based on similarities which enables the speedy convergence for clustering different sizes of data. Usually, analyzing the huge volume of data requires more computation time for identifying cancerous protein sequences during the diagnosis of cancer disease. The current research utilizes distributed clustering to deal with the computational time complexity issue by distributing the huge data into different local clusters. Additionally, it utilizes the soft computing approach to group similar protein sequences into specific clusters by finding the global clustering from the results of local clusters. The uncertainty of protein sequences is addressed with the utilization of a fuzzy logic system. This, in turn, leads to achieve the main objective of the research which is identifying the cancerous protein sequences at an early stage from the distributed large data and to improve accuracy, F-measure, precision and recall in clustering the protein sequences with the reduction of the False Positive Rate (FPR) as well as newline |
Pagination: | xxvii,187p. |
URI: | http://hdl.handle.net/10603/342421 |
Appears in Departments: | Faculty of Science and Humanities |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 24.44 kB | Adobe PDF | View/Open |
02_certificates.pdf | 453.11 kB | Adobe PDF | View/Open | |
03_vivaproceedings.pdf | 208.93 kB | Adobe PDF | View/Open | |
04_bonafidecertificate.pdf | 149.53 kB | Adobe PDF | View/Open | |
05_abstracts.pdf | 115.82 kB | Adobe PDF | View/Open | |
06_acknowledgements.pdf | 184.38 kB | Adobe PDF | View/Open | |
07_contents.pdf | 164.24 kB | Adobe PDF | View/Open | |
08_listoftables.pdf | 288.62 kB | Adobe PDF | View/Open | |
09_listoffigures.pdf | 183.43 kB | Adobe PDF | View/Open | |
10_listofabbreviation.pdf | 160.61 kB | Adobe PDF | View/Open | |
11_chapter1.pdf | 386.78 kB | Adobe PDF | View/Open | |
12_chapter2.pdf | 311.67 kB | Adobe PDF | View/Open | |
13_chapter3.pdf | 529.23 kB | Adobe PDF | View/Open | |
14_chapter4.pdf | 635.1 kB | Adobe PDF | View/Open | |
15_chapter5.pdf | 641.52 kB | Adobe PDF | View/Open | |
16_chapter6.pdf | 784.32 kB | Adobe PDF | View/Open | |
17_conclusion.pdf | 163.82 kB | Adobe PDF | View/Open | |
18_references.pdf | 199.37 kB | Adobe PDF | View/Open | |
19_listofpublications.pdf | 97.56 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 163.33 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: