Miruna Oprescu, Andrew Bennett, Nathan Kallus
(2024).
Low-rank MDPs with Continuous Action Spaces.
International Conference on Artificial Intelligence and Statistics (AISTATS).
Masatoshi Uehara, Haruka Kiyohara, Andrew Bennett, Victor Chernozhukov, Nan Jiang, Nathan Kallus, Chengchun Shi, Wen Sun
(2023).
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs.
International Conference on Neural Information Processing Systems (NeurIPS).
Andrew Bennett, Nathan Kallus
(2023).
The Variational Method of Moments.
Journal of the Royal Statistical Society Series B: Statistical Methodology (JRSS:B).