Distributionally robust control for partially observable linear systems (TAC'24)
Distributionally robust differential dynamic programming (LCSS-CDC'23)
Distributionally robust optimization with unscented transform for learning-based MPC (ICRA'23)
Distributionally robust risk map for learning-based motion planning and control (T-RO'23)
Approximate Thompson sampling for learning LQR with O(\sqrt{T}) regret (L4DC'25 Oral)
Task-relevant loss functions in meta-RL (L4DC'24)
Infusing MPC into meta-RL (RAL-IROS'22)
Hamilton-Jacobi deep Q-learning in continuous time (JMLR'21)
Anderson Acceleration for POMDPs (Automatica'24)
Unified Nesterov's accelerated gradient methods (ICML'23 Oral)
Riemannian Nesterov accelerated gradient methods (ICML'22)