According to Bellman’s Principle of Optimality, an optimal policy must consist of optimal sub-policies regardless of the initial state or the initial decision. A policy is a sequence of decisions made across multiple stages of a process. A multi-stage decision process is a sequence of steps where each choice influences the state of the system and the potential future rew....
Log in to view the answer