Figure 1 - uploaded by Vladimir Andreevich Marochko
Content may be subject to copyright.
Source publication
Catastrophic forgetting has a serious impact in reinforcement learning, as the data distribution is
generally sparse and non-stationary over time. The purpose of this study is to investigate whether
pseudorehearsal can increase performance of an actor-critic agent with neural-network based policy
selection and function approximation in a pole balan...
Similar publications
Catastrophic forgetting has a significant negative impact in reinforcement learning. The purpose of this study is to investigate how pseudorehearsal can change performance of an actor-critic agent with neural-network function approximation. We tested agent in a pole balancing task and compared different pseudorehearsal approaches. We have found tha...
Catastrophic forgetting has a significant negative impact in reinforcement learning. The purpose of this study is to investigate how pseudorehearsal can change performance of an actor-critic agent with neural-network function approximation. We tested agent in a pole balancing task and compared different pseudorehearsal approaches. We have found tha...
Citations
... Actor-critics can also learn an explicitly stochastic policy which is very useful in continuous learning problems." [8]. The actor receives the current state from the environment and performs an action depending on its policy. ...
Artificial intelligence (AI) has an increasing influence in the manufacturing and processing industries. An increasingly interesting area of application is the design of controllers for technical systems through reinforcement learning. This allows the design of a controller with reduced effort for human experts, which is normally involved in the design process, for example by eliminating the determination of suitable controller parameters. In this paper a neural controller for a rotational inverted pendulum is developed by using artificial neural networks in combination with reinforcement learning. The rotational inverted pendulum is a classic example of a nonlinear and unstable system and has been little covered in previous publications on this subject. Furthermore, the behaviour of the neural controller is compared with a conventional controller when swinging up and balancing a rotational inverted pendulum.