Research Group

Web & User Data Processing

Publication

Authors Cief, M., Kveton, B., Kompan, M.

Published in Proceedings of the AAAI Conference on Artificial Intelligence

Download Download publication

Official Link

Cross-Validated Off-Policy Evaluation

Cief, M., Kveton, B.¹, Kompan, M.

¹ Adobe Research

We study estimator selection and hyper-parameter tuning in off-policy evaluation. Although cross-validation is the most popular method for model selection in supervised learning, off-policy evaluation relies mostly on theory, which provides only limited guidance to practitioners. We show how to use cross-validation for off-policy evaluation. This challenges a popular belief that cross-validation in off-policy evaluation is not feasible. We evaluate our method empirically and show that it addresses a variety of use cases.

Cite: Cief, M., Kveton, B., & Kompan, M. (2025). Cross-validated off-policy evaluation. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 39, No. 15, pp. 16073-16081).

Web & User Data Processing

Cross-Validated Off-Policy Evaluation

Authors

Matej Čief

Michal Kompan

Why partner with KInIT