Code of paper: "DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks"

Last update: Nov 10, 2022

Overview

DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks

Abstract: Adversarial training has been proven to be a powerful regularization method to improve generalization of models. In this work, a novel masked weight adversarial training method, DropAttack, is proposed for improving generalization potential of neural network models. It enhances the coverage and diversity of adversarial attack by intentionally adding worst-case adversarial perturbations to both the input and hidden layers and randomly masking the attack perturbations on a certain proportion weight parameters. It then improves the generalization of neural networks by minimizing the internal adversarial risk generated by exponentially different attack combinations. Further, the method is a general technique that can be adopted to a wide variety of neural networks with different architectures. To validate the effectiveness of the proposed method, five public datasets were used in the fields of natural language processing (NLP) and computer vision (CV) for experimental evaluating. This study compared DropAttack with other adversarial training methods and regularization methods. It was found that the proposed method achieves state-of-the-art performance on all datasets. In addition, the experimental results of this study show that DropAttack method can achieve similar performance when it uses only a half training data required in standard training. Theoretical analysis revealed that DropAttack can perform gradient regularization at random on some of the input and weight parameters of the model. Further, visualization experiments of this study show that DropAttack can push the minimum risk of the neural network model to a lower and flatter loss landscapes.

For technical details and additional experimental results, please refer to our paper:

“DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks”

Experimental results:

DropAttack indeed selects flatter loss landscapes via masked adversarial perturbations.

[The code of loss visualization]

Citation

@article{ni2021dropattack,
  title={DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks},
  author={Ni, Shiwen and Li, Jiawen and Kao, Hung-Yu},
  journal={arXiv preprint arXiv:2108.12805},
  year={2021}
}

Requirements

pytorch
pandas
numpy
nltk
sklearn
torchtext

Please star it, thank you! :）

Code of paper: "DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks"

Related tags

Overview

DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks

For technical details and additional experimental results, please refer to our paper:

Experimental results:

Citation

Requirements

Please star it, thank you! :）

Owner

倪仕文 (Shiwen Ni)

ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models

Code to replicate the key results from Exploring the Limits of Out-of-Distribution Detection

Tensorflow Tutorials using Jupyter Notebook

PyTorch Implementation of SSTNs for hyperspectral image classifications from the IEEE T-GRS paper "Spectral-Spatial Transformer Network for Hyperspectral Image Classification: A FAS Framework."

Official PyTorch implementation of Data-free Knowledge Distillation for Object Detection, WACV 2021.

Rethinking Nearest Neighbors for Visual Classification

Deep Learning Based EDM Subgenre Classification using Mel-Spectrogram and Tempogram Features"

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

Code repository for the paper: Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild (ICCV 2021)

Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.

a morph transfer UGATIT for image translation.

Codes for “A Deeply Supervised Attention Metric-Based Network and an Open Aerial Image Dataset for Remote Sensing Change Detection”

Exemplo de implementação do padrão circuit breaker em python

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

Residual Pathway Priors for Soft Equivariance Constraints

This repo is developed for Strong Baseline For Vehicle Re-Identification in Track 2 Ai-City-2021 Challenges

Code for our paper A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization,

PyTorch implementation of Weak-shot Fine-grained Classification via Similarity Transfer

Dilated Convolution for Semantic Image Segmentation

Video Matting Refinement For Python