문제

When i use Q-Table for save state-action in reinforcement learning, some state never (or rarely) happen and state-action value remain zero until max-iteration so i decide to estimate Q-Table online with a neural network instead of using Q-Table.

Which type of neural networks can estimate this kind of problem more accurate and does this solution help me?

도움이 되었습니까?

해결책

i use Localy Weighted Regression(LWR) as function approximation and simply replace table with this function

라이센스 : CC-BY-SA ~와 함께 속성
제휴하지 않습니다 StackOverflow
scroll top