DQN - target values vs action values?

https://datascience.stackexchange.com/questions/56116

reinforcement-learning
deep-learning
q-learning

02-11-2019
|

문제

I'm trying to understand the difference between target-values and action-values in Deep Q Networks.

From what I understand, action-value tries to approximate the reward of a given action (at some state). The target-value is also an approximate of the reward. How are they different?

Reference
- https://storage.googleapis.com/deepmind-media/dqn/DQNNaturePaper.pdf

올바른 솔루션이 없습니다

라이센스 : CC-BY-SA ~와 함께 속성

제휴하지 않습니다 datascience.stackexchange