Conference Paper

Lighting control system using an Actor - Critic type learning algorithm

Fac. of Life & Med. Sci., Doshisha Univ., Kyoto, Japan
DOI: 10.1109/NABIC.2010.5716381 Conference: Second World Congress on Nature & Biologically Inspired Computing, NaBIC 2010, 15-17 December 2010, Kitakyushu, Japan
Source: DBLP

ABSTRACT

A novel lighting control system using the Actor- Critic algorithm was developed, in which users can set the brightness of the system through sensory operation, such as “much brighter” or “slightly darker”. During development, this system must learn two states, i.e., the demands of the user and the brightness around the user. The Actor - Critic algorithm was applied for this purpose, and a simplified algorithm was developed. The effectiveness and usefulness of the proposed algorithm are discussed here through numerical simulations.

Full-text preview

Available from: ac.jp
  • [Show abstract] [Hide abstract]
    ABSTRACT: This study proposes the air conditioner (AC) control system based on users' sensations. The purpose is to realize the control system that improves low-performance ACs in terms of energy efficiency and comfortableness performance. The system consists of wireless sensor nodes and user nodes such as PCs and smartphones, and it is applicable to already installed ACs. The users enter their sensation such as cold, good, a little hot and very hot through the user node, and the system determines the appropriate control according to the users' sensations. The appropriate control policy is determined based on Q-Learning, which is a reinforcement learning method. For multiple users, several methods for integrating users' sensations are presented. These makes our proposed system applicable to a large number of users. Further, in order to reduce the energy consumption and the number of users' inputs, several types of reward functions are presented. In the simulation, four types of methods are evaluated in terms of the time needed for providing a comfortable environment and the energy consumption. We clarify the effective methods among them.
    No preview · Article · Jan 2015
  • [Show abstract] [Hide abstract]
    ABSTRACT: This study proposes the air conditioner (AC) control system based on users' sensations. The purpose of the system is to improve low-performance ACs in terms of energy efficiency and comfortableness performance. The system consists of wireless sensor nodes and user nodes such as PCs and smartphones, and it is applicable to already installed ACs. The users enter their sensation such as cold, good, a little hot and very hot through the user node, and the system determines the appropriate control according to the users' sensations. The control signal is transmitted via an equipped IR remote controller. The appropriate control policy is determined based on Q-Learning, which is a reinforcement learning method. In this study, we propose several types of methods and investigate effective methods in terms of energy efficiency and comfortableness performance. For multiple users, several methods for integrating users' sensations are presented. These make our proposed system applicable to a large number of users. Further, in order to reduce the energy consumption and the number of users' inputs, several types of reward functions are presented. In addition to the reward functions proposed in [8], in this paper, we propose a new reward function, which improves the performance for the large number of users. In the simulation, five types of methods including a new proposal in this paper, are evaluated in terms of the time needed for providing a comfortable environment and the energy consumption. We clarify the effective methods among them and the tendency of the scalability against the number of users.
    No preview · Article · Jan 2015 · IAENG International Journal of Computer Science