Thomas Yang Kleine Büning
Publikasjoner
-
Kleine Büning, Thomas & Saha, Aadirupa (2023). ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits. Proceedings of Machine Learning Research (PMLR). ISSN 2640-3498.
-
Kleine Büning, Thomas; Dimitrakakis, Christos; Eriksson, Hannes; Grover, Divya & Jorge, Emilio (2023). Minimax-Bayes Reinforcement Learning. Proceedings of Machine Learning Research (PMLR). ISSN 2640-3498. 206.
Publisert
24. mars 2021 16:48
- Sist endret
24. mars 2021 16:48