FUN2MODEL Case Studies

FUN2MODEL Research Themes:

Robustness Guarantees for Probabilistic Neural Networks

Adversarial robustness for Gaussian processes models. Provable guarantees.

We study adversarial robustness for Gaussian process models, defined as invariance of the model’s decision to bounded perturbations, in contrast to distributional robustness. We develop a comprehensive theory, anytime algorithms and implementation based on branch-and-bound optimisation for computing provable guarantees of adversarial robustness of Gaussian process models, for both multi-class classification and regression. This involves computing lower and upper bounds on its prediction range. The image illustrates the working of the method, where a region R is refined into R1 and R2 to improve the bounds.

Probabilistic safety and reachability for Bayesian neural networks.

Since adversarial examples are arguably intuitively related to uncertainty, Bayesian neural networks (BNNs), i.e., neural networks with a probability distribution placed over their weights and biases, have the potential to provide stronger robustness properties. BNNs also enable principled evaluation of model uncertainty, which can be taken into account at prediction time to enable safe decision making. We study probabilistic safety for BNNs, defined as the probability that for all points in a given input set the prediction of the BNN is in a specified safe output set. In adversarial settings, this translates into computing the probability that adversarial perturbations of an input result in small variations in the BNN output, which represents a probabilistic variant of local robustness for deterministic neural networks.

We propose a framework based on relaxation techniques from non-convex optimisation (interval and linear bound propagation) for the analysis of probabilistic safety for BNNs with general activation functions and multiple hidden layers. We evaluate the methods on the VCAS autonomous aircraft controller. The image shows the geometry of VCAS (left), visualisation of ground truth labels (centre) and the computed safe regions (right).

Certified training for Bayesian neural networks.

We develop the first principled framework for adversarial training of Bayesian neural networks (BNNs) with certifiable guarantees, enabling applications in safety-critical contexts. We rely on techniques from constraint relaxation of nonconvex optimisation problems and modify the standard cross-entropy error model to enforce posterior robustness to worst-case perturbations in ϵ-balls around input points.

The plot shows the average certified radius for images from MNIST (right), and CIFAR-10 (left) using CNN-Cert. We observe that robust training with IBP (Interval Bound Propagation) roughly doubles the maximum verifiable radius compared with standard training and that obtained by training on PGD adversarial examples.

Adversarial robustness certification for Bayesian neural networks. Probabilistic and decision robustness.

We study the problem of certifying the robustness of Bayesian neural networks (BNNs) to adversarial input perturbations. We define two notions of robustness for BNNs in an adversarial setting: probabilistic robustness and decision robustness. Probabilistic robustness is the probability that for all points in a given input set T the output of a BNN sampled from the posterior is in a safe set S. On the other hand, decision robustness considers the optimal decision of a BNN and checks if for all points in T the optimal decision of the BNN for a given loss function lies within the output set S. Although exact computation of these robustness properties is challenging due to the probabilistic and non-convex nature of BNNs, we present a unified computational framework for efficiently and formally bounding them, and evaluate the effectiveness of our methods on various regression and classification tasks.

The image shows different training image resolutions on a training image sample from the PneumoniaMNIST dataset (left), for which we plot (right) computed lower bounds on decision robustness as we vary the resolution.

Probabilistic reach-avoid for Bayesian neural networks. Certifiable controller synthesis for learned BNN models.

Model-based reinforcement learning seeks to simultaneously learn the dynamics of an unknown stochastic environment and synthesise an optimal policy for acting in it. Ensuring the safety and robustness of sequential decisions made through a policy in such an environment is a key challenge for policies intended for safety-critical scenarios. In this work, we investigate two complementary problems: first, computing reach-avoid probabilities for iterative predictions made with dynamical models, with dynamics described by Bayesian neural network (BNN); second, synthesising control policies that are optimal with respect to a given reach-avoid specification (reaching a "target" state, while avoiding a set of "unsafe" states) and a learned BNN model. The computed lower bounds provide safety certification for the given policy and BNN model. We then introduce control synthesis algorithms to derive policies maximizing said lower bounds on the safety probability. We demonstrate the effectiveness of our method on a series of control benchmarks characterized by learned BNN dynamics models.

The images show the lower-bound reach-avoid probabilities for a BNN dynamics model as vary the depth from one to two and three later BNNs.

To know more about these models and analysis techniques, follow the links below.

Sort by: date, type, title

14 publications:

2025

[ZWG+25] Xiyue Zhang, Zifan Wang, Yulong Gao, Licio Romao, Alessandro Abate, Marta Kwiatkowska. Risk-Averse Certification of Bayesian Neural Networks. Technical report , arXiv:2411.19729 . Paper under submission. 2025. [pdf] [bib]

2024

[VSLK24] Jon Vadillo, Roberto Santana, Jose A. Lozano, Marta Kwiatkowska. Uncertainty-Aware Explanations Through Probabilistic Self-Explainable Neural Networks. Technical report , arXiv:2403.13740 . Paper under submission. 2024. [pdf] [bib] https://arxiv.org/abs/2403.13740
[WPL+24] Matthew Wicker, Andrea Patane, Luca Laurenti, Marta Kwiatkowska. Adversarial Robustness Certification for Bayesian Neural Networks. In Proc. 26th International Symposium on Formal Methods (FM'24 invited paper), Springer. To appear. 2024. [pdf] [bib] https://arxiv.org/abs/2306.13614
[WLP+24] Matthew Wicker, Luca Laurenti, Andrea Patane, Nicola Paoletti, Alessandro Abate, Marta Kwiatkowska.. Probabilistic Reach-Avoid for Bayesian Neural Networks. Artificial Intelligence. To appear in Artificial Intelligence. 2024. [pdf] [bib] https://arxiv.org/abs/2310.01951

2022

[Wic22] Matthew Wicker. Adversarial Robustness of Bayesian Neural Networks. Ph.D. thesis, Department of Computer Science, University of Oxford. 2022. [pdf] [bib]
[Fal22] Rhiannon Falconmore. On the Role of Explainability and Uncertainty in Ensuring Safety of AI Applications. Ph.D. thesis, Department of Computer Science, University of Oxford. 2022. [pdf] [bib]
[Kwi22] Marta Kwiatkowska. Robustness Guarantees for Bayesian Neural Networks. In Proc. 19th International Conference on Quantitative Evaluation of SysTems (QEST 2022). 2022. [pdf] [bib]
[PBL+22] Andrea Patane, Arno Blaas, Luca Laurenti, Luca Cardelli, Stephen Roberts and Marta Kwiatkowska. Adversarial Robustness Guarantees for Gaussian Processes. Journal of Machine Learning Research, 23, pages 1-55. 2022. [pdf] [bib]

2021

[WLP+21a] Matthew Wicker, Luca Laurenti, Andrea Patane, Nicola Paoletti, Alessandro Abate and Marta Kwiatkowska. Certification of Iterative Predictions in Bayesian Neural Networks. In 37th Conference on Uncertainty in Artificial Intelligence (UAI'21). May 2021. [pdf] [bib]
[WLP+21] Matthew Wicker, Luca Laurenti, Andrea Patane, Zhuotong Chen, Zheng Zhang and Marta Kwiatkowska. Bayesian Inference with Certifiable Adversarial Robustness. In International Conference on Artificial Intelligence and Statistics (AISTATS'21), PMLR. April 2021. [pdf] [bib]
[Pat21] Andrea Patane. On the Adversarial Robustness of Gaussian Processes. Ph.D. thesis, Department of Computer Science, University of Oxford. February 2021. [pdf] [bib]

2020

[BPL+20] Arno Blaas, Andrea Patane, Luca Laurenti, Luca Cardelli, Marta Kwiatkowska and Stephen Roberts. Adversarial Robustness Guarantees for Classification with Gaussian Processes. In 23rd International Conference on Artificial Intelligence and Statistics (AISTATS'20), PMLR. August 2020. [pdf] [bib]
[WLPK20] Matthew Wicker, Luca Laurenti, Andrea Patane and Marta Kwiatkowska. Probabilistic Safety for Bayesian Neural Networks. In 36th Conference on Uncertainty in Artificial Intelligence (UAI'20), PMLR. August 2020. [pdf] [bib]

2019

[CKLP19] Luca Cardelli, Marta Kwiatkowska, Luca Laurenti and Andrea Patane. Robustness Guarantees for Bayesian Inference with Gaussian Processes. In Proc. Thirty-Third AAAI Conference on Artificial Intelligence (AAAI'19). 2019. [pdf] [bib]

Sort by: date, type, title

« Overview