Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/253191
Title: A novel approach for efficient mining and discrimination of gene sequencing in protein sequence database
Researcher: Jeyabharathi J
Guide(s): Shanthi D
Keywords: Engineering and Technology,Computer Science,Computer Science Information Systems
Gene Sequencing
Mining
Novel Approach
Protein Sequence
University: Anna University
Completed Date: 2018
Abstract: Sequential pattern mining is the task of identifying the patterns present in a certain number of data instances. The existing sequence mining algorithms mainly focus on mining for sub sequences. However, a wide range of applications such as biological DNA and protein motif mining needs an effective mining for identifying the approximate frequent patterns. The existing approximate frequent pattern mining algorithms have some delimitation such as lack of knowledge to finding the patterns, poor scalability and complexity to adapt into some other applications. The algorithm Generalised Approximate Pattern mining Algorithm (GAPA) is proposed to efficiently mine the approximate frequent patterns in the protein sequence database. Pearsonand#8223;s coefficient correlation is computed among the protein sequence database items to analyse the approximate frequent patterns. This work proposes a novel Enhanced Sequence Identification (ESI) approach to effectively find the frequent patterns from the huge dataset. The Hybrid Frequent Pattern Mining (HFPM) algorithm employs the tree-based structure that achieves a significant reduction in the space complexity. Association rules are used for mining the frequent patterns by identifying the relationship between the items and finding the approximate frequent patterns from the databases. The frequent items with dependency are added down to the leaves of the tree. The proposed HFPM-ESI algorithm shows high performance newlinewith less memory consumption and lower run time than the existing algorithms. The proposed algorithm ensures the effective extraction of frequent patterns with newlinethe optimization of resource constraints newline newline
Pagination: xiv, 115p.
URI: http://hdl.handle.net/10603/253191
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File112.14 kBAdobe PDFView/Open
02_certificates.pdf408.29 kBAdobe PDFView/Open
03_abstract.pdf172.31 kBAdobe PDFView/Open
04_acknowledgement.pdf95.16 kBAdobe PDFView/Open
05_table of contents.pdf222.38 kBAdobe PDFView/Open
06_list_of_tables.pdf170.5 kBAdobe PDFView/Open
07_list_of_figures.pdf97.38 kBAdobe PDFView/Open
08_list_of_symbols and abbreviations.pdf171.6 kBAdobe PDFView/Open
09_chapter1.pdf312.12 kBAdobe PDFView/Open
10_chapter2.pdf383.67 kBAdobe PDFView/Open
11_chapter3.pdf576.91 kBAdobe PDFView/Open
12_chapter4.pdf748.13 kBAdobe PDFView/Open
13_chapter5.pdf881.18 kBAdobe PDFView/Open
14_chapter6.pdf1.17 MBAdobe PDFView/Open
15_conclusion.pdf181.15 kBAdobe PDFView/Open
16_references.pdf205.57 kBAdobe PDFView/Open
17_list_of_publications.pdf175.88 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: