Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/240181
Full metadata record
DC FieldValueLanguage
dc.coverage.spatial
dc.date.accessioned2019-04-30T07:05:00Z-
dc.date.available2019-04-30T07:05:00Z-
dc.identifier.urihttp://hdl.handle.net/10603/240181-
dc.description.abstractTandemly repeated structural motifs form integrated assemblies with multiple binding sites and facilitate various protein-protein/DNA/RNA interactions. The evolutionary conservation and disease association of repeat proteins depict their structural and functional importance, and the low sequence conservation suggests the need for their identification at the structural level. newlineIn this thesis, we propose graph based approaches for the identification of repeat proteins and use them to build a database of structural repeats. We observe that the eigenvector centrality of the protein contact network is able to capture the conserved residue interaction pattern within and between the repeating units. The efficacy of the measure is tested by developing a rule based approach, AnkPred, for identifying Ankyrin repeat proteins. We extended the approach by developing a generalized algorithm, PRIGSA, to identify members of known families and also de novo identify previously uncharacterized novel repeats. The performance of the approach is compared to state-of-the-art methods on benchmark datasets and predictions of the approach for 13 known families are compared with UniProt annotation. The PRIGSA algorithm is executed on the complete PDB to build a database of structural repeats, DbStRiPs. The de novo predicted repeats are clustered and manually curated. We report one novel repeat family and 31 De novo Protein Repeat Clusters (DPRCs). In DbStRiPs, 11901 repeats are reported in 10816 PDB chains, which are categorized into known protein repeat families, DPRC or Unclassified. newlineWe also developed a webserver for network analysis of protein structures, NAPS, providing a comprehensive platform for construction of 5 types of networks and their analyses such as centrality, shortest path, k-clique and graph spectral analyses. The web server provides browser independent interactive platform for visual analysis of protein structure and network along with options to download the results in suitable formats.
dc.format.extent
dc.languageEnglish
dc.relation
dc.rightsself
dc.titleNetwork based Identification and Analysis of Structural Repeat Proteins
dc.title.alternative
dc.creator.researcherBroto Chakrabarty
dc.subject.keyworddatabase of structural repeats
dc.subject.keywordEngineering and Technology,Engineering,Engineering Multidisciplinary
dc.subject.keywordidentification of structural repeats
dc.subject.keywordProtein contact network
dc.subject.keywordProtein structure graph
dc.subject.keywordStructural repeat proteins
dc.description.note
dc.contributor.guideNita Parekh
dc.publisher.placeHyderabad
dc.publisher.universityInternational Institute of Information Technology, Hyderabad
dc.publisher.institutionBioinformatics
dc.date.registered25/07/2013
dc.date.completed2018
dc.date.awarded29/12/2018
dc.format.dimensions
dc.format.accompanyingmaterialNone
dc.source.universityUniversity
dc.type.degreePh.D.
Appears in Departments:Bioinformatics

Files in This Item:
File Description SizeFormat 
appendix.pdfAttached File513.85 kBAdobe PDFView/Open
chapter1.pdf1.5 MBAdobe PDFView/Open
chapter2.pdf2.52 MBAdobe PDFView/Open
chapter3.pdf2.3 MBAdobe PDFView/Open
chapter4.pdf2.93 MBAdobe PDFView/Open
chapter5.pdf2.05 MBAdobe PDFView/Open
chapter6.pdf310.59 kBAdobe PDFView/Open
references.pdf398.99 kBAdobe PDFView/Open
startingpages.pdf639.45 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: