Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/363195
Title: Improving Robustness of Deep Reinforcement Learning Systems
Researcher: Gupta, Surbhi
Guide(s): Singal, Gaurav and Garg, Deepak
Keywords: Computer Science
Computer Science Information Systems
Engineering and Technology
Reinforcement Learning
University: Bennett University
Completed Date: 2021
Abstract: Reinforcement Learning (RL), a branch of machine learning, is used to solve problems that cannot be dealt with supervised and unsupervised learning techniques. In RL, the actor interacts with the environment and learns by getting a reward signal in a trial-and-error fashion. Reinforcement learning is scaled to deep reinforcement learning (DRL) to handle huge state or action space-based problems. In DRL, deep learning models are used for approximation, auto feature extraction, and generalisation across unvisited states and unexplored actions. Though deep reinforcement learning models are generalisable, they may perform catastrophic actions due to noise in the environment or the perceived state. Incorporating robustness to DRL models is of great importance as when these models are deployed to the real systems, may cause irrelevant behaviour due to noisy scenario. Hence, it would cause hardware damage with increased cost and decreased reliability. This thesis presents a study on deep reinforcement learning that covers applications of DRL in different industry verticals, the evolution of DRL, insight of designing Markov decision process (MDP) for various problems, usable simulation tools to apply DRL, and a list of challenges with future direction. We have also assembled a sensor-enabled robot to find the problem (dimensionality perturbation) and considered the robustness aspect of DRL for applications such as industrial control, autonomous driving, autonomous flight, and planetary exploration. These applications motivate us to consider the robustness aspect as a wrong decision in a noisy state will incur a huge cost.
Pagination: 
URI: http://hdl.handle.net/10603/363195
Appears in Departments:School of Computer Science Engineering and Technology

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File173.2 kBAdobe PDFView/Open
02_table of contents.pdf101.55 kBAdobe PDFView/Open
03_declaration.pdf127.79 kBAdobe PDFView/Open
04_certificate.pdf133.01 kBAdobe PDFView/Open
05_acknowledgement.pdf99.33 kBAdobe PDFView/Open
06_abstract.pdf100.26 kBAdobe PDFView/Open
07_list of acronyms.pdf127.53 kBAdobe PDFView/Open
08_list of figures.pdf120.61 kBAdobe PDFView/Open
09_list of tables.pdf118.34 kBAdobe PDFView/Open
10_chapter 1.pdf209.03 kBAdobe PDFView/Open
11_chapter 2.pdf1.34 MBAdobe PDFView/Open
12_chapter 3.pdf512.94 kBAdobe PDFView/Open
14_chapter 5.pdf1.02 MBAdobe PDFView/Open
15_chapter 6.pdf2.01 MBAdobe PDFView/Open
16_chapter 7.pdf105.47 kBAdobe PDFView/Open
17_references.pdf229.43 kBAdobe PDFView/Open
18_appendix.pdf169.76 kBAdobe PDFView/Open
80_recommendation.pdf278.23 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: