Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/253191
Title: | A novel approach for efficient mining and discrimination of gene sequencing in protein sequence database |
Researcher: | Jeyabharathi J |
Guide(s): | Shanthi D |
Keywords: | Engineering and Technology,Computer Science,Computer Science Information Systems Gene Sequencing Mining Novel Approach Protein Sequence |
University: | Anna University |
Completed Date: | 2018 |
Abstract: | Sequential pattern mining is the task of identifying the patterns present in a certain number of data instances. The existing sequence mining algorithms mainly focus on mining for sub sequences. However, a wide range of applications such as biological DNA and protein motif mining needs an effective mining for identifying the approximate frequent patterns. The existing approximate frequent pattern mining algorithms have some delimitation such as lack of knowledge to finding the patterns, poor scalability and complexity to adapt into some other applications. The algorithm Generalised Approximate Pattern mining Algorithm (GAPA) is proposed to efficiently mine the approximate frequent patterns in the protein sequence database. Pearsonand#8223;s coefficient correlation is computed among the protein sequence database items to analyse the approximate frequent patterns. This work proposes a novel Enhanced Sequence Identification (ESI) approach to effectively find the frequent patterns from the huge dataset. The Hybrid Frequent Pattern Mining (HFPM) algorithm employs the tree-based structure that achieves a significant reduction in the space complexity. Association rules are used for mining the frequent patterns by identifying the relationship between the items and finding the approximate frequent patterns from the databases. The frequent items with dependency are added down to the leaves of the tree. The proposed HFPM-ESI algorithm shows high performance newlinewith less memory consumption and lower run time than the existing algorithms. The proposed algorithm ensures the effective extraction of frequent patterns with newlinethe optimization of resource constraints newline newline |
Pagination: | xiv, 115p. |
URI: | http://hdl.handle.net/10603/253191 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 112.14 kB | Adobe PDF | View/Open |
02_certificates.pdf | 408.29 kB | Adobe PDF | View/Open | |
03_abstract.pdf | 172.31 kB | Adobe PDF | View/Open | |
04_acknowledgement.pdf | 95.16 kB | Adobe PDF | View/Open | |
05_table of contents.pdf | 222.38 kB | Adobe PDF | View/Open | |
06_list_of_tables.pdf | 170.5 kB | Adobe PDF | View/Open | |
07_list_of_figures.pdf | 97.38 kB | Adobe PDF | View/Open | |
08_list_of_symbols and abbreviations.pdf | 171.6 kB | Adobe PDF | View/Open | |
09_chapter1.pdf | 312.12 kB | Adobe PDF | View/Open | |
10_chapter2.pdf | 383.67 kB | Adobe PDF | View/Open | |
11_chapter3.pdf | 576.91 kB | Adobe PDF | View/Open | |
12_chapter4.pdf | 748.13 kB | Adobe PDF | View/Open | |
13_chapter5.pdf | 881.18 kB | Adobe PDF | View/Open | |
14_chapter6.pdf | 1.17 MB | Adobe PDF | View/Open | |
15_conclusion.pdf | 181.15 kB | Adobe PDF | View/Open | |
16_references.pdf | 205.57 kB | Adobe PDF | View/Open | |
17_list_of_publications.pdf | 175.88 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: