[SBK25]
Daqian Shao, Thomas Kleine Buening, Marta Kwiatkowska.
A Unifying Framework for Causal Imitation Learningwith Hidden Confounders.
In Workshop on Spurious Correlation and Shortcut Learning: Foundations and Solutions, Workshop at The International Conference on Learning Representations (ICLR) 2025.
2025.
[pdf]
[bib]
https://arxiv.org/abs/2502.07656
|
Available from:
https://arxiv.org/abs/2502.07656
|
Abstract.
We propose a general and unifying framework for causal Imitation Learning (IL) with hidden confounders that subsumes several existing confounded IL settings from the literature. Our framework accounts for two types of hidden confounders: (a) those observed by the expert, which thus influence the expert's policy, and (b) confounding noise hidden to both the expert and the IL algorithm. For additional flexibility, we also introduce a confounding noise horizon and time-varying expert-observable hidden variables. We show that causal IL in our framework can be reduced to a set of Conditional Moment Restrictions (CMRs) by leveraging trajectory histories as instruments to learn a history-dependent policy. We propose DML-IL, a novel algorithm that uses instrumental variable regression to solve these CMRs and learn a policy. We provide a bound on the imitation gap for DML-IL, which recovers prior results as special cases. Empirical evaluation on a toy environment with continues state-action spaces and multiple Mujoco tasks demonstrate that DML-IL outperforms state-of-the-art causal IL algorithms.
|