How to implement clipping the reward in DQN in keras
How to implement clipping the reward in DQN in keras? especially how to implement clipping the reward?
Is this pseudo code correct:
if reward-threshold reward=-1
elseif rewardthreshold reward=1
elseif -thresholdrewardthreshold reward=reward/threshold
And if reward is always positive how we can change clipping the reward?
Topic dqn keras-rl training tensorflow deep-learning
Category Data Science