|
[MRR+26]
Anirban Majumdar, Ritam Raha, Rajarshi Roy, David Parker and Marta Kwiatkowska.
About Time: Model-free Reinforcement Learning with Timed Reward Machines.
Technical report 2512.17637, arxiv.
2026.
[pdf]
https://arxiv.org/abs/2512.17637
|
|
Available from:
https://arxiv.org/abs/2512.17637
|