We consider a discrete time Markov Decision Process with infinite horizon. The criterion to be maximized is the sum of a number of standard discounted rewards, each with a different discount factor.
Priya Kartik is an executive coach and leadership expert with decades of experience enhancing the success factor of teams and organizations. Starting the clock on a critical project. Approving a ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results