Improving Robustness of Deep Reinforcement Learning Systems

Gupta, Surbhi

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/363195

Title:	Improving Robustness of Deep Reinforcement Learning Systems
Researcher:	Gupta, Surbhi
Guide(s):	Singal, Gaurav and Garg, Deepak
Keywords:	Computer Science Computer Science Information Systems Engineering and Technology Reinforcement Learning
University:	Bennett University
Completed Date:	2021
Abstract:	Reinforcement Learning (RL), a branch of machine learning, is used to solve problems that cannot be dealt with supervised and unsupervised learning techniques. In RL, the actor interacts with the environment and learns by getting a reward signal in a trial-and-error fashion. Reinforcement learning is scaled to deep reinforcement learning (DRL) to handle huge state or action space-based problems. In DRL, deep learning models are used for approximation, auto feature extraction, and generalisation across unvisited states and unexplored actions. Though deep reinforcement learning models are generalisable, they may perform catastrophic actions due to noise in the environment or the perceived state. Incorporating robustness to DRL models is of great importance as when these models are deployed to the real systems, may cause irrelevant behaviour due to noisy scenario. Hence, it would cause hardware damage with increased cost and decreased reliability. This thesis presents a study on deep reinforcement learning that covers applications of DRL in different industry verticals, the evolution of DRL, insight of designing Markov decision process (MDP) for various problems, usable simulation tools to apply DRL, and a list of challenges with future direction. We have also assembled a sensor-enabled robot to find the problem (dimensionality perturbation) and considered the robustness aspect of DRL for applications such as industrial control, autonomous driving, autonomous flight, and planetary exploration. These applications motivate us to consider the robustness aspect as a wrong decision in a noisy state will incur a huge cost.
Pagination:
URI:	http://hdl.handle.net/10603/363195
Appears in Departments:	School of Computer Science Engineering and Technology

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	173.2 kB	Adobe PDF	View/Open
02_table of contents.pdf		101.55 kB	Adobe PDF	View/Open
03_declaration.pdf		127.79 kB	Adobe PDF	View/Open
04_certificate.pdf		133.01 kB	Adobe PDF	View/Open
05_acknowledgement.pdf		99.33 kB	Adobe PDF	View/Open
06_abstract.pdf		100.26 kB	Adobe PDF	View/Open
07_list of acronyms.pdf		127.53 kB	Adobe PDF	View/Open
08_list of figures.pdf		120.61 kB	Adobe PDF	View/Open
09_list of tables.pdf		118.34 kB	Adobe PDF	View/Open
10_chapter 1.pdf		209.03 kB	Adobe PDF	View/Open
11_chapter 2.pdf		1.34 MB	Adobe PDF	View/Open
12_chapter 3.pdf		512.94 kB	Adobe PDF	View/Open
14_chapter 5.pdf		1.02 MB	Adobe PDF	View/Open
15_chapter 6.pdf		2.01 MB	Adobe PDF	View/Open
16_chapter 7.pdf		105.47 kB	Adobe PDF	View/Open
17_references.pdf		229.43 kB	Adobe PDF	View/Open
18_appendix.pdf		169.76 kB	Adobe PDF	View/Open
80_recommendation.pdf		278.23 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET