The discount factor, represented by the Greek letter gamma, is a scalar value between zero and one that determines the present value of future rewards. In reinforcement learning, an agent aims to maximize the cumulative return, which is the sum of rewards collected over time. The mathematical role of the discount factor is to multiply future rewards by increasing powers of gamma, expressed as the sum of gamma raised to....
Log in to view the answer