Categories recording Markov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences Post date June 20, 2011 Watch on videolectures.net Paul WengMarkov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences ← Algorithms for Solving the Multiagent Simple Temporal Problem → Controlling Robots Across Intermediate Time Delays