This repository is the official PyTorch implementation of paper: Adversarial Rademacher Complexity of Deep Neural Networks.
Jiancong Xiao, Yanbo Fan, Ruoyu Sun, Zhi-Quan Luo
arXiv: https://arxiv.org/abs/2211.14966
Deep neural networks (DNNs) are vulnerable to adversarial attacks. Ideally, a robust model shall perform well on both the perturbed training data and the unseen perturbed test data. It is found empirically that fitting perturbed training data is not hard, but generalizing to perturbed test data is quite difficult. Therefore, it is of great interest to study the generalization guarantee from a learning theory perspective and this paper focuses on adversarial Rademacher complexity (ARC), which is concurrently introduced by (yin et al., 2018,khim et al., 2018). Their work mainly focused on linear functions and both pointed out that how to bound ARC of neural networks is largely unclear. Since then, there have been several attempts and the latest result only applied ARC to two-layer neural networks. The main challenge comes from analyzing the adversarial loss. In this paper, we resolve the problem and provide the first bound of adversarial Rademacher complexity of deep neural networks. Our bound is qualitatively comparable to the Rademacher complexity bounds in similar settings. The key ingredient is a bound on the changes in the output of max functions with respect to the change of their weights and inputs, thereby bounding ARC. Finally, we provide experiments to study the poor robust generalization. Our experiments show that the weight norm is an important factor related to the robust generalization gap.
Our main result states an upper bound of adversarial Rademacher complexity. For depth-l, width-h fully-connected neural networks, assume that the weight matrices
we show the experiments of standard and adversarial training VGG on CIFAR-10 and CIFAR-100. The y-axis is in logarithmic scale. The weight norms of adv-trained models are larger than that of std-trained models, i.e.,
The code of calculating the norms is adopted from Neyshabur et al., 2017.
@article{xiao2022adversarial,
title={Adversarial rademacher complexity of deep neural networks},
author={Xiao, Jiancong and Fan, Yanbo and Sun, Ruoyu and Luo, Zhi-Quan},
journal={arXiv preprint arXiv:2211.14966},
year={2022}
}