EU Horizon 2020
Horizon 2020
HomeNewsResearch ThemesPeopleKey Prior PublicationsPublicationsWorkshop
[SBK26] Daqian Shao, Thomas Kleine Buening and Marta Kwiatkowska. Causal Imitation Learning under Expert-Observable and Expert-Unobservable Confounding. In Proc. International Conference on Learning Representation (ICLR'26). To appear. 2026. [pdf]
Downloads:  pdf pdf (1.53 MB)
Notes: Preprint available from https://arxiv.org/abs/2502.07656v2.
Abstract. We propose a general framework for causal Imitation Learning (IL) with hidden confounders, which subsumes several existing settings. Our framework accounts for two types of hidden confounders: (a) variables observed by the expert but not by the imitator, and (b) confounding noise hidden from both. By leveraging trajectory histories as instruments, we reformulate causal IL in our framework into a Conditional Moment Restriction (CMR) problem. We propose DML-IL, an algorithm that solves this CMR problem via instrumental variable regression, and upper bound its imitation gap. Empirical evaluation on continuous state-action environments, including Mujoco tasks, demonstrates that DML-IL outperforms existing causal IL baselines.