DQN - target values vs action values?
-
02-11-2019 - |
Pergunta
I'm trying to understand the difference between target-values and action-values in Deep Q Networks.
From what I understand, action-value tries to approximate the reward of a given action (at some state). The target-value is also an approximate of the reward. How are they different?
Reference
- https://storage.googleapis.com/deepmind-media/dqn/DQNNaturePaper.pdf
Nenhuma solução correta
Licenciado em: CC-BY-SA com atribuição
Não afiliado a datascience.stackexchange