Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/256107
Title: Transcription factor binding site prediction using deep learning
Researcher: Mohamed Divan Masood M
Guide(s): Manjula D
Keywords: ANN
Engineering and Technology,Computer Science,Computer Science Information Systems
Transcription Factor
University: Anna University
Completed Date: 2018
Abstract: A majority of the human genome consists of sequences that do not code for a particular protein, so called non-coding DNA. The non-coding region plays a major role in gene expression. These non-coding regions of the DNA contain cis-regulatory components such as promoters and enhancers, and can be bound by Transcription Factor (TF) proteins and thereby control the rate of transcription of DNA to messenger RNA. Next Generation Sequencing (NGS) techniques helps to identifying and studying the genomic factors such as Transcription Factor Binding Sites (TFBSs). Knowing the sequence specificities of DNA and RNA-binding proteins are essential for developing models of the regulatory processes in biological systems and for identifying causal disease variants. This research mainly focus on sequence specificities that can be ascertained from experimental data with deep learning techniques, which offer a scalable, flexible and unified computational approach for pattern discovery. Discovering the TFBS has immense significance in terms of developing techniques and evaluating regulatory processes in biological systems. The uniqueness of genetic sequences can be discovered with the TF. A genetic disease can be cured by determining the specificities of the gene sequences concerned. In molecular biology, identifying accurate TF binding sites is a challenge for researchers, and an efficient methodology is needed to recognize such binding sites. Currently, deep learning techniques have been yielding great results, especially in computational biology. In the present approach, the Convolutional Neural Network (CNN) and Artificial Neural Network (ANN) have been used. In the first experiment, CNN models were built to predict TFBSs from gene sequence datasets. newline newline newline
Pagination: xvii, 131p.
URI: http://hdl.handle.net/10603/256107
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File26.91 kBAdobe PDFView/Open
02_certificates.pdf473.81 kBAdobe PDFView/Open
03_abstract.pdf127.23 kBAdobe PDFView/Open
04_acknowledgement.pdf4.62 kBAdobe PDFView/Open
05_table of contents.pdf310.05 kBAdobe PDFView/Open
06_list_of_symbols and abbreviations.pdf5.89 kBAdobe PDFView/Open
07_chapter1.pdf993.56 kBAdobe PDFView/Open
08_chapter2.pdf395.22 kBAdobe PDFView/Open
09_chapter3.pdf431.31 kBAdobe PDFView/Open
10_chapter4.pdf502.01 kBAdobe PDFView/Open
11_chapter5.pdf567.1 kBAdobe PDFView/Open
12_chapter6.pdf1.48 MBAdobe PDFView/Open
13_conclusion.pdf16.78 kBAdobe PDFView/Open
14_references.pdf138.01 kBAdobe PDFView/Open
15_list_of_publications.pdf91.73 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: