Reward, punishment and internal expectation for training the random neural network with reinforcement

1998-01-01
The reinforcement learning scheme proposed in [8] for random neural networks [5] is based on reward and performs well for a stationary environment. However, when the environment is not stationary extinction becomes an important problem to be considered. In this paper, the reinforcement learning scheme is extended by introducing a weight update rule that takes into consideration the internal expectation of reinforcement. Such a scheme has made extinction possible while resulting in a good convergence to the most rewarding action.
Citation Formats
U. Halıcı, “Reward, punishment and internal expectation for training the random neural network with reinforcement,” Antalya, Turkey, 1998, vol. 53, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/52892.