Data constrained deep learning

Keshari, Rohit

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/566313

Title:	Data constrained deep learning
Researcher:	Keshari, Rohit
Guide(s):	Singh, Richa and Vatsa, Mayank
Keywords:	Computer Science Computer Science Artificial Intelligence Engineering and Technology
University:	Indraprastha Institute of Information Technology, Delhi (IIIT-Delhi)
Completed Date:	2024
Abstract:	Deep Neural Networks (DNNs) have achieved remarkable success across various machine learning and computer vision tasks, especially when abundant training samples are available. In Convolutional Neural Network (CNN) research, it has been established that a model s generalization capability improves with the combination of complex architectures, strong regularization, domain- specific loss functions, and extensive databases. However, training DNNs in environments with limited data remains a significant challenge, calling for attention from the research community. Many applications lack the requisite volume of data needed to train models effectively. Data constraint in this context is influenced by factors such as 1) a scarcity of domain experts, 2) long-tail distribution in large datasets, 3) insufficient domain-specific data, and 4) the challenge of mimicking human cognition and learning. The issues above are common challenges encountered while designing deep models, underscoring the importance of addressing Data Constrained Learning (DCL). This thesis investigates the formulation of deep learning strategies explicitly tailored for scenarios with DCL. The objective is to ensure that the training of numerous parameters does not adversely affect the model s ability to learn meaningful patterns, as this could elevate the risk of overfitting and result in suboptimal generalization performance. To address the DCL challenge, we introduce a novel strength parameter in deep learning named SSF-CNN, which concentrates on learning both the quotstructurequot and quotstrengthquot of filters. The filter structure is initialized using a dictionary-based filter learning algorithm, while the strength is learned under data-constrained settings. This architecture demonstrates adaptability, delivering robust performance even when used with small databases and consistently attaining high accuracy. We validate the effectiveness of our algorithm on databases such as MNIST, CIFAR10, and NORB, with varying training sample sizes. The results indicate
Pagination:	168 p.
URI:	http://hdl.handle.net/10603/566313
Appears in Departments:	Department of Computer Science and Engineering

Files in This Item:

File	Description	Size	Format
01-title.pdf	Attached File	54.64 kB	Adobe PDF	View/Open
02_prelim pages.pdf		424.52 kB	Adobe PDF	View/Open
03_content.pdf		99.43 kB	Adobe PDF	View/Open
04_abstract.pdf		67.6 kB	Adobe PDF	View/Open
05_chapter 1.pdf		2.17 MB	Adobe PDF	View/Open
06_chapter 2.pdf		1.85 MB	Adobe PDF	View/Open
07_chapter 3.pdf		1.5 MB	Adobe PDF	View/Open
08_chapter 4.pdf		1.64 MB	Adobe PDF	View/Open
09_chapter 5.pdf		380.81 kB	Adobe PDF	View/Open
10_annexures.pdf		555.26 kB	Adobe PDF	View/Open
80_recommendation.pdf		560.84 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET