Pankayaraj Pathmanathan

Pankayaraj Pathmanathan
University of Peradeniya | UOP · Department of Computer Engineering

Bachelor of Science

About

8
Publications
1,187
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
7
Citations
Citations since 2017
8 Research Items
7 Citations
20172018201920202021202220230.00.51.01.52.02.53.0
20172018201920202021202220230.00.51.01.52.02.53.0
20172018201920202021202220230.00.51.01.52.02.53.0
20172018201920202021202220230.00.51.01.52.02.53.0
Introduction
Skills and Expertise

Publications

Publications (8)
Preprint
Full-text available
In recent years, extensive research has been carried out in the field of autonomous aerial vehicle control, motivated by the rapid advancements in Machine Learning (ML). In particular , Reinforcement Learning (RL) has gained immense interest in developing control algorithms. In this work, we examine key control problems related to the operation of...
Preprint
Full-text available
In multi-agent systems, an agent's behaviour is affected by the dynamicity of the environment and that of the interactions among agents. Thus, in learning to cooperate to solve complex tasks, such considerations should be taken into account. Reinforcement learning has gained immense interest in this line of research as it allows agents to learn use...
Preprint
Full-text available
This paper considers a multi-armed bandit (MAB) problem in which multiple mobile agents receive rewards by sampling from a collection of spatially dispersed stochastic processes, called bandits. The goal is to formulate a decentralized policy for each agent, in order to maximize the total cumulative reward over all agents, subject to option availab...
Conference Paper
Full-text available
This paper proposes a novel policy for a group of agents to, individually as well as collectively, solve a multi armed bandit (MAB) problem. The policy relies solely on the information that an agent has obtained through sampling of the options on its own and through communication with neighbors. The option selection policy is based on an Upper Conf...
Preprint
Full-text available
Sleep apnea is a breathing disorder where a person repeatedly stops breathing in sleep. Early detection is crucial for infants because it might bring long term adversities. The existing accurate detection mechanism (pulse oximetry) is a skin contact measurement. The existing non-contact mechanisms (acoustics, video processing) are not accurate enou...
Preprint
Full-text available
Existing studies of the Multi Agent Multi Armed Bandit (MAMAB) problem, with the exception of a very few, consider the case where the agents observe their neighbors according to a static network graph. They also mostly rely on a running consensus for the estimation of the option rewards. Two of the exceptions consider a problem where agents observe...

Network

Cited By