arxiv:2604.19698
Michal Valko
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
authored a paper about 9 hours ago
Budgeted Online Influence Maximization authored a paper about 9 hours ago
Planning in entropy-regularized Markov decision processes and games authored a paper about 9 hours ago
On two ways to use determinantal point processes for Monte Carlo integration