Audio Deepfake Classification

This project focuses on building a deep learning model for classifying audio files as either genuine (bonafide) or manipulated (spoof). The objective is to detect audio deepfakes, which are manipulated audio recordings designed to impersonate a genuine audio source. The ASVspoof 2019 dataset is used for training and evaluating the model.

Project Overview

Data: ASVspoof 2019 dataset containing genuine and spoof audio recordings.
Preprocessing: Convert audio files to Mel spectrograms, augment training data.
Model Architecture: Convolutional Neural Network (CNN) with classification layers.
Training: Binary cross-entropy loss, Adam optimizer, monitoring metrics.
Evaluation: Accuracy, F1 score, ROC curve, AUC.
Visualization: Model architecture visualization using plot_model and Netron.

Model Architecture

The model architecture is designed to extract features from Mel spectrograms and make predictions for audio deepfake classification.

Convolutional Layer: Extracts local features from the Mel spectrogram using convolutional filters.
MaxPooling Layer: Performs downsampling to reduce spatial dimensions.
Batch Normalization: Normalizes activations to stabilize training.
ReLU Activation: Introduces non-linearity to the model.
Dropout Layer: Prevents overfitting by deactivating neurons randomly during training.
Global Average Pooling Layer: Aggregates feature maps for global information.
Dense Layer: Performs classification with a sigmoid activation function.

Metrics

Getting Started

Installation

To use this project, follow these steps:

Clone the repository:

   git clone https://github.com/sksmta/audio-deepfake-detection.git
   cd audio-deepfake-detection

Download the ASVspoof 2019 dataset:

Download the dataset from here and extract it into the dataset directory.

Contribution

Contributions are welcome and greatly appreciated. To contribute to this project, follow these steps:

Fork the repository to your own GitHub account.
Clone the forked repository to your local machine:

  git clone https://github.com/sksmta/audio-deepfake-detection.git
  cd audio-deepfake-detection

Create a new branch for your contribution:

   git checkout -b feature/your-feature-name

Make your changes, improvements, or bug fixes.
Commit your changes with a meaningful commit message:

git commit -m "Add your commit message here"

Push your changes to your GitHub repository:

git push origin feature/your-feature-name

Open a pull request on the original repository's main branch. Provide a clear description of your contribution.
Your pull request will be reviewed, and any necessary feedback will be given. Once approved, your contribution will be merged into the main project.

Thank you for your valuable contributions to make this project even better!

Acknowledgments

ASVspoof 2019 dataset: Download
Netron: GitHub Repository

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.vscode		.vscode
TestEvaluation		TestEvaluation
eval		eval
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
audio_classifier.h5		audio_classifier.h5
main.ipynb		main.ipynb
model_architecture.png		model_architecture.png
test_eval.txt		test_eval.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Deepfake Classification

Project Overview

Model Architecture

Metrics

Getting Started

Installation

Contribution

Acknowledgments

About

Releases

Packages

Languages

License

sksmta/audio-deepfake-detection

Folders and files

Latest commit

History

Repository files navigation

Audio Deepfake Classification

Project Overview

Model Architecture

Metrics

Getting Started

Installation

Contribution

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages