Skip to content

Generating adversarial examples for classification tasks with a SAT solver

Notifications You must be signed in to change notification settings

owenps/Adversarial_Generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Generating Adversarial Examples with a SAT Solver

Adversarial examples are specialized inputs created with the intension of confusing a neural network classifier, resulting in a misprediction of the given image.

Often the changes to these inputs are indistinguishable to humans, but still cause the model to mispredict.

We can include these adversarial images back into our train set to help improve model perforamnce.

In this notebook we explore searching for adversarial examples with the Z3 SMT Solver as the core of our engine. The notebook includes two demos:

  1. Simple binary classifier with two features
  2. Multi-class classifier on the infamous MNIST dataset.

This project is for McGill University's Fall 2021 COMP597 Final Project.

References

Athalye, Anish. “Synthesizing Robust Adversarial Examples.” Arxiv.org, 7 June 2018, https://arxiv.org/pdf/1707.07397.pdf.

Pei, Kexin. “DeepXplore: Automated Whitebox Testing of Deep Learning Systems.” Arxiv.org, 24 Sept. 2017, https://arxiv.org/pdf/1705.06640.pdf.

About

Generating adversarial examples for classification tasks with a SAT solver

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published