Sequential Controlled Sensing to Detect an Anomalous Process

Narayanaprasad, Karthik Periyapattana

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/429856

Title:	Sequential Controlled Sensing to Detect an Anomalous Process
Researcher:	Narayanaprasad, Karthik Periyapattana
Guide(s):	Sundaresan, Rajesh
Keywords:	Engineering Engineering and Technology Engineering Electrical and Electronic
University:	Indian Institute of Science Bangalore
Completed Date:	2021
Abstract:	In this thesis, we study the problem of identifying an anomalous arm in a multi-armed bandit as quickly as possible, subject to an upper bound on the error probability. Also known as odd arm identification, this problem falls within the class of optimal stopping problems in decision theory and can be embedded within the framework of active sequential hypothesis testing. Prior works on odd arm identification dealt with independent and identically distributed observations from each arm. We provide the first known extension to the case of Markov observations from each arm. Our analysis and results are in the asymptotic regime of vanishing error probability. We associate with each arm an ergodic discrete time Markov process that evolves on a common, finite state space. The notion of anomaly is that the transition probability matrix (TPM) of the Markov process of one of the arms (the {\em odd arm}) is some $P_{1}$, and that of each non-odd arm is a different $P_{2}$, $P_{2}\neq P_{1}$. A learning agent whose goal it is to find the odd arm samples the arms sequentially, one at any given time, and observes the state of the Markov process of the sampled arm. The Markov processes of the unobserved arms may either remain frozen at their last observed states until their next sampling instant ({\em rested arms}) or continue to undergo state evolution ({\em restless arms}). The TPMs $P_{1}$ and $P_{2}$ may be known to the learning agent beforehand or unknown. We analyse the following cases: (a) rested arms with TPMs unknown, (b) restless arms with TPMs known, and (c) restless arms with TPMs unknown. For each of these cases, we first present an asymptotic lower bound on the {\color{black} growth rate of the }expected time required to find the odd arm, and subsequently devise a sequential arm selection policy which we show achieves the lower bound and is therefore asymptotically optimal. A key ingredient in our analysis of the setting of rested arms is the observation that for the Markov process of each arm, the long term fr...
URI:	http://hdl.handle.net/10603/429856
Appears in Departments:	Electrical Communication Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	168.27 kB	Adobe PDF	View/Open
02_prelim pages.pdf		194.58 kB	Adobe PDF	View/Open
03_abstract.pdf		116.9 kB	Adobe PDF	View/Open
04_table of contents.pdf		138.4 kB	Adobe PDF	View/Open
05_chapter 1.pdf		199.87 kB	Adobe PDF	View/Open
06_chapter 2.pdf		519.24 kB	Adobe PDF	View/Open
07_chapter 3.pdf		547.03 kB	Adobe PDF	View/Open
08_chapter 4.pdf		603.67 kB	Adobe PDF	View/Open
09_annexure.pdf		88.05 kB	Adobe PDF	View/Open
80_recommendation.pdf		369.13 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET