
Teodor Vanislavov Marinov
My main research interests are in the field of Theoretical Machine Learning. Recently my research has focused on Reinforcement Learning with applications to compiler optimization and how to make Large Language Models (LLMs) more factual. On the more theoretical side I am interested in Bandit Problems, more efficient algorithms for Reinforcement Learning beyond worst case settings and understanding emergent abilities of LLMs.
Research Areas
Authored Publications
Sort By
Google
Multiple-policy High-confidence Policy Evaluation
Mohammad Ghavamzadeh
International Conference on Artificial Intelligence and Statistics (2023), pp. 9470-9487