Dynamic Backdoor Trigger Elimination using Model Contrastive Learning and Triple Marginal Loss

This repository contains an implementation for eliminating backdoor triggers embedded in images, particularly addressing poison label attacks such as Trojan, BadNets, and Blend. The solution is built upon Model Contrastive Learning and Triple Marginal Loss techniques. Additionally, the code implements a Dynamic Patching algorithm, enabling the model to train with different trigger patterns at runtime for enhanced robustness.

Features:

Model Contrastive Learning: Utilizes contrastive learning techniques to enhance the model's ability to discriminate between clean and poisoned data.
Triple Marginal Loss (TML): Implements TML for training robust models against poison label attacks. TML effectively minimizes the impact of backdoor triggers during model training.
Dynamic Patching Algorithm: Incorporates a dynamic patching algorithm, enabling the model to adapt to different trigger patterns during runtime. This enhances the model's resilience against evolving attack strategies.
K-Means Clustering for Pseudo-Label Generation: Employs K-Means clustering to generate pseudo-labels for training with TML. This helps in effectively identifying and mitigating the influence of poisoned data during training.

MCL++: Quick start with pretrained model

We have already uploaded the all2one pretrained backdoor model(i.e. gridTrigger WRN-16-1, target label 5).

Install the requirements using the following command:

$ pip install -r requirements.txt

For evaluating the performance of MCL++, you can easily run command:

$ python MCL++_defense.py

where the default parameters are shown in config.py.

The trained model will be saved at the path weight/<name>.tar

Please carefully read the MCL++_defense.py and configs.py, then change the parameters for your experiment.

Training your own backdoored model

We have provided a DatasetBD Class in data_loader.py for generating training set of different backdoor attacks.

For implementing backdoor attack(e.g. GridTrigger attack), you can run the below command:

$ python train_badnet.py

This command will train the backdoored model and print clean accuracies and attack rate. You can also select the other backdoor triggers like Grid, Square etc as defined in the data_loader.py class.

Acknowledgements

Much of the code in this repository was adapted from code in this paper by Zhihao et al.

Other source of backdoor attacks

Attack

CL: Clean-label backdoor attacks

SIG: A New Backdoor Attack in CNNS by Training Set Corruption Without Label Poisoning

Paper

WaNet: WaNet-Imperceptible Warping-based Backdoor Attack.

Defense

Fine-tuning && Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks

I-BAU: Adversarial Unlearning of Backdoors via Implicit Hypergradient.

Neural Cleanse: Identifying and Mitigating Backdoor Attacks in Neural Networks

Library

Note: TrojanZoo provides a universal pytorch platform to conduct security researches (especially backdoor attacks/defenses) of image classification in deep learning.

Backdoors 101 — is a PyTorch framework for state-of-the-art backdoor defenses and attacks on deep learning models.

BackdoorBox — is a Python toolbox for backdoor attacks and defenses.

Contacts

If you have any questions, leave a message below with GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data/CIFAR10/cifar-10-batches-py		data/CIFAR10/cifar-10-batches-py
trigger		trigger
.gitattributes		.gitattributes
.gitignore		.gitignore
MCL++_defense.py		MCL++_defense.py
README.md		README.md
config.py		config.py
data_loader.py		data_loader.py
inversion_torch.py		inversion_torch.py
select_dataset.py		select_dataset.py
train_badnet.py		train_badnet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dynamic Backdoor Trigger Elimination using Model Contrastive Learning and Triple Marginal Loss

Features:

MCL++: Quick start with pretrained model

Training your own backdoored model

Acknowledgements

Other source of backdoor attacks

Attack

Defense

Library

Contacts

About

Releases

Packages

Languages

Isvarya12/MCLplusplus_strengthening-backdoor-defenses-against-backdoor-poisoning-attacks

Folders and files

Latest commit

History

Repository files navigation

Dynamic Backdoor Trigger Elimination using Model Contrastive Learning and Triple Marginal Loss

Features:

MCL++: Quick start with pretrained model

Training your own backdoored model

Acknowledgements

Other source of backdoor attacks

Attack

Defense

Library

Contacts

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages