Learning curves on MuJoCo tasks with the mean (solid line) and standard deviation (shaded area) across 5 runs.

Learning curves on MuJoCo tasks with the mean (solid line) and standard deviation (shaded area) across 5 runs.

Source publication
Preprint
Full-text available
Distributional Reinforcement Learning (RL) maintains the entire probability distribution of the reward-to-go, i.e. the return, providing more learning signals that account for the uncertainty associated with policy performance, which may be beneficial for trading off exploration and exploitation and policy learning in general. Previous works in dis...

Context in source publication

Context 1
... amounts of variance displayed throughout training for both algorithms may be due to that they both involve adversarial training. As shown in Figure 3, however, our model outperforms the benchmark in all cases with distinct margins. We believe this is because WGAN does not take expectations across an amortised inference space that accounts for better generalisation. ...

Similar publications

Article
Full-text available
This study aimed at providing information to help HR practitioners understand the uncertainties caused by COVID-19 by addressing questions on what certainties are faced by HR practitioners in the education sector; what factors are seen as stressors, what characteristics need to be developed, and what solutions are proposed to overcome uncertainties...
Conference Paper
Full-text available
Numerous blockchain networks are inherently closed systems, lacking the ability to communicate with other networks. This inherent design limitation results in restricted interoperability for many blockchain networks. Traditionally, obtaining an asset on a different chain necessitated the sale of an asset on one chain and the acquisition of a corres...
Article
Full-text available
Color has a crucial impact on students’ perception. It encourages the learning atmosphere to be affiliated with the anticipated learning outcomes. The purpose of this study is to investigate the impacts of contextual colors on student’s perception of interior spaces and to validate previous related studies that emphasize on colors as a media to...
Thesis
Full-text available
The purpose of this research is to investigate the relationship between the quality of teacher-child interactions and the development of spatial reasoning among four year old children in full time kindergarten in a disadvantaged environment. The sample is made up on one hand of 415 children data (215 girls, 200 boys) aged 58.29 months (SD=4.93), an...