#

ai-security

Here are 58 public repositories matching this topic...

h4cker

The-Art-of-Hacking / h4cker

This repository is primarily maintained by Omar Santos (@santosomar) and includes thousands of resources related to ethical hacking, bug bounties, digital forensics and incident response (DFIR), artificial intelligence security, vulnerability research, exploit development, reverse engineering, and more.

Updated May 16, 2024
Jupyter Notebook

giskard

Giskard-AI / giskard

🐢 Open-Source Evaluation & Testing for LLMs and ML models

Updated May 30, 2024
Python

THUYimingLi / backdoor-learning-resources

A list of backdoor learning resources

machine-learning deep-learning ai-security backdoor-attacks backdoor-learning backdoor-defense

Updated May 17, 2024

jiep / offensive-ai-compilation

A curated list of useful resources that cover Offensive AI.

artificial-intelligence compilation adversarial-machine-learning ai-security offensive-ai

Updated May 12, 2024
HTML

jay-johnson / train-ai-with-django-swagger-jwt

Train AI (Keras + Tensorflow) to defend apps with Django REST Framework + Celery + Swagger + JWT - deploys to Kubernetes and OpenShift Container Platform

machine-learning jwt deep-neural-networks ai openshift tensorflow rest-api django-rest-framework swagger drf keras celery network-analysis network-security celery-tasks machine-learning-security ai-security anti-nex

Updated Nov 2, 2018
Python

RjDuan / AdvDrop

Code for "Adversarial attack by dropping information." (ICCV 2021)

pytorch adversarial-examples adversarial-attacks ai-security

Updated Jan 13, 2022
Python

CVPR_2019_PNI

elliothe / CVPR_2019_PNI

pytorch implementation of Parametric Noise Injection for adversarial defense

ai-security adversarial-defense

Updated Oct 23, 2019
Python

ZhengyuZhao / AI-Security-and-Privacy-Events

A curated list of academic events on AI Security & Privacy

adversarial-machine-learning adversarial-examples ai-security ai-privacy data-poisoning

Updated May 9, 2024

normster / llm_rules

RuLES: a benchmark for evaluating rule-following in language models

ai-safety ai-security gpt-4

Updated May 24, 2024
Python

YiZeng623 / I-BAU

Official Implementation of ICLR 2022 paper, ``Adversarial Unlearning of Backdoors via Implicit Hypergradient''

deep-learning adversarial-machine-learning adversarial-attacks ai-security backdoor-attacks backdoor-defense

Updated Nov 16, 2022
Jupyter Notebook

AnthenaMatrix / Image-Prompt-Injection

Image Prompt Injection is a Python script that demonstrates how to embed a secret prompt within an image using steganography techniques. This hidden prompt can be later extracted by an AI system for analysis, enabling covert communication with AI models through images.

ai cybersecurity ai-security prompt-engineering aisecurity prompt-injection prompt-injection-tool

Updated Mar 20, 2024
Python

ruoxi-jia-group / Narcissus

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack success rate.

adversarial-machine-learning adversarial-attacks ai-security backdoor-attacks deep- poisoning-attacks

Updated May 9, 2023
Python

mitre-atlas / atlas-data

ATLAS tactics, techniques, and case studies data

security machine-learning mitre-attack ai-security mitre-atlas

Updated Apr 29, 2024
Python

AnthenaMatrix / Website-Prompt-Injection

Website Prompt Injection is a concept that allows for the injection of prompts into an AI system via a website's. This technique exploits the interaction between users, websites, and AI systems to execute specific prompts that influence AI behavior.

security ai cybersecurity ai-security prompt-engineering aisecurity prompt-injection

Updated Mar 19, 2024
HTML

zhangzp9970 / MIA

Unofficial pytorch implementation of paper: Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures

machine-learning research deep-learning ai-security model-inversion-attacks

Updated Oct 6, 2023
Python

AnthenaMatrix / Prompt-Injection-Testing-Tool

The Prompt Injection Testing Tool is a Python script designed to assess the security of your AI system's prompt handling against a predefined list of user prompts commonly used for injection attacks. This tool utilizes the OpenAI GPT-3.5 model to generate responses to system-user prompt pairs and outputs the results to a CSV file for analysis.

ai prompt openai ai-security openai-api prompt-learning prompt-engineering prompting prompt-injection prompt-injection-tool ai-cyber-security

Updated Mar 21, 2024
Python

SEC-CAFE / handbook

安全手册，企业安全实践、攻防与安全研究知识库

ai-security security-wiki awesome-security llm-security security-handbook

Updated May 29, 2024
CSS

Hacking-Notes / VulnScan

Performing website vulnerability scanning using OpenAI technologie

hacking-tool vulnerability-scanners vulnerability-scanning ai-security chatgpt

Updated Apr 19, 2024
Python

Imperio

HKU-TASR / Imperio

[IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the victim model's prediction for arbitrary targets.

ai-security backdoor-attacks llm

Updated Apr 17, 2024
Python

modzy / sdk-javascript

The official JavaScript SDK for the Modzy Machine Learning Operations (MLOps) Platform.

javascript kubernetes machine-learning microservices api-client api-rest model-serving explainable-ai production-machine-learning ai-security mlops drift-detection machine-learning-operations

Updated Sep 22, 2022
TypeScript

Improve this page

Add a description, image, and links to the ai-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-security topic, visit your repo's landing page and select "manage topics."