Description
Towards Minimax Optimality of Model-based Robust Reinforcement Learning Pierre Clavier, Erwan Le Pennec, Matthieu Geist https://openreview.net/pdf?id=mcmMbWLkfQ
Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes Sang Bin Moon, Abolfazl Hashemi https://openreview.net/pdf?id=tdz5SyQ2CX
Group Fairness in Predict-Then-Optimize Settings for Restless Bandits Shresth Verma, Yunfan Zhao, Sanket Shah, Niclas Boehmer, Aparna Taneja, Milind Tambe https://openreview.net/pdf?id=GJlZbpLWX3
Recursively-Constrained Partially Observable Markov Decision Processes Qi Heng Ho, Tyler Becker, Benjamin Kraske, Zakariya Laouar, Martin S. Feather, Federico Rossi, Morteza Lahijanian, Zachary N Sunberg https://openreview.net/pdf?id=cC2c4KhHni