VoicePassport: Your Trusted Voice Authentication Solution 🎙️🔐

🔒🗣️ VoicePassport is a robust and secure voice authentication system designed to ensure the authenticity of users through their unique voiceprints. Powered by Resemblyzer, VoicePassport leverages advanced voice processing technology to generate voice embeddings, which are compact numerical representations of voice characteristics. These embeddings capture the distinctive features of an individual's voice in a highly accurate and secure manner.

🔍🔐 Using these voice embeddings, VoicePassport employs a similarity search mechanism to authenticate users. By comparing the voice embeddings extracted from an input voice sample with those stored in its database, VoicePassport can determine the likelihood of a match, thereby verifying the identity of the user.

💼💬 VoicePassport offers a reliable and efficient means of authentication, enabling seamless user access to various applications and services while ensuring a high level of security. With its innovative approach to voice-based authentication, VoicePassport provides a convenient and dependable solution for organizations seeking robust identity verification mechanisms.

🙏 I would like to extend my heartfelt gratitude to Karan Shingde for his insightful article published on Medium titled "Build an Audio-driven Speaker Recognition System Using Open Source Technologies: resemblyzer and pyAudioAnalysis". His comprehensive guide served as a significant source of inspiration and a crucial starting point for developing the VoicePassport Architecture project. Karan's expertise and dedication have been instrumental in shaping my understanding and implementation of speaker recognition technologies. I am truly thankful for his invaluable contribution to the field and for sharing his knowledge with the community. 🚀

✨ Features

🔒 Advanced Voice Authentication: VoicePassport harnesses the cutting-edge Resemblyzer technology to analyze and generate unique voice embeddings from user audio samples. These embeddings serve as the foundation for precise and trustworthy user authentication based on voice similarity.
⛓️ Blockchain-Powered Security: With blockchain integration, VoicePassport ensures unparalleled security and immutability in storing user authentication data. Each user's voice authentication details are securely hashed and recorded on the blockchain, establishing a tamper-proof ledger of user interactions.
💾 Efficient Vector Database: VoicePassport leverages a specialized vector database to effectively store and query voice embeddings derived from user audio samples. By employing advanced vector similarity search algorithms, VoicePassport facilitates rapid and accurate matching of voice patterns for seamless user authentication.
🚀 Streamlined Workflow Management: Empowered by Apache Airflow, VoicePassport streamlines the authentication process with robust workflow management capabilities. Through automated task orchestration, including audio processing, embedding generation, and database integration, VoicePassport ensures smooth and dependable operation.
👤 Intuitive User Experience: Designed for user convenience, VoicePassport offers a hassle-free authentication experience. Users can effortlessly enroll their voice profiles and authenticate themselves with a simple voice command, eliminating the complexity of traditional password-based methods.

More Details 📝

For comprehensive information about this project, check out this Medium article.

🛠️ Technologies Used

🎤 Resemblyzer: Advanced voice analysis tool for generating voice embeddings.
🔍 QDrant: Vector database for efficient storage and querying of voice embeddings.
🐍 Python: Programming language used for backend development.
🌐 Flask: Web framework for building the RESTful API.
🔗 Web3: Python library for interacting with Ethereum blockchain.
📝 Solidity: Programming language for writing smart contracts on the Ethereum blockchain.
🟣 Polygon PoS: Scalable Ethereum sidechain for fast and low-cost transactions.
🔬 Alchemy: An analytics platform that provides insights into blockchain transactions, allowing users to monitor and analyze transaction data on the Ethereum blockchain.
📦 MinIO: Object storage service for storing voice samples files.
🍃 MongoDB: NoSQL database for storing user metadata and authentication data.
🌀 Apache Airflow: Workflow management tool for automation of audio processing tasks and database integration.
🐳 Docker: Containerization platform for packaging VoicePassport application components.
🔀 HAProxy: Load balancer for distributing incoming traffic across multiple Docker containers.

Unveiling Architecture 🏛️

In the VoicePassport platform, various architectural components work in harmony to ensure seamless voice authentication and user management functionalities. Let's explore the purpose and role of each element:

Enrollment 🎤: Users enroll and register their voice profiles by providing audio samples. These audio samples are processed by Resemblyzer, an advanced voice analysis tool, to generate unique voice embeddings. These voice embeddings capture the distinctive characteristics of each user's voice accurately and securely. Once generated, these embeddings are stored in a database for later use in authentication.
Authentication 🔐: During authentication, users speak a passphrase, and their voice is compared against the stored voice embeddings using a vector similarity search algorithm. This process determines the likelihood of a match between the user's voice and the previously registered voice embeddings. If the match is sufficiently high, the user is successfully authenticated.
Blockchain Verification 🛡️: User authentication data, including voice embeddings and authentication results, undergoes cryptographic hashing and is recorded on the blockchain. This approach ensures the security and integrity of authentication data by providing an immutable and auditable record of all user interactions with the voice authentication system.
Apache Airflow Integration ⚙️: The entire authentication workflow, from audio processing to task management and workflow orchestration, is handled by Apache Airflow. This integration ensures the efficient execution of audio processing tasks, voice embedding generation, and blockchain integration. Additionally, it enables centralized monitoring and management of the authentication process, ensuring its reliability and scalability.

This architectural approach provides a comprehensive and robust solution for voice authentication, offering an optimal balance of security, efficiency, and user-friendliness for end-users.

Why Blockchain Verification? 🛡️

Blockchain verification is pivotal in ensuring the security, integrity, and transparency of the voice authentication system. Here's why it's essential, especially considering the implementation of the VoiceIDVerifier DApp:

Immutable Record: By recording user authentication data on the blockchain via the VoiceIDVerifier DApp, the system creates an immutable and tamper-proof record of all authentication transactions. This ensures that once authentication data is stored, it cannot be altered or deleted, providing a reliable audit trail of user interactions.
Enhanced Security: Through the VoiceIDVerifier DApp, user authentication data, including the hash of the user ID and the hash of the voice audio stored in MinIO, is cryptographically hashed and securely recorded on the blockchain. This robust security measure ensures that sensitive information remains protected from unauthorized access or tampering.
Transparency and Auditability: The decentralized nature of blockchain technology, facilitated by the VoiceIDVerifier DApp, enables transparent and auditable verification of user authentication data. Stakeholders can easily access and verify the authenticity of recorded transactions, fostering trust and transparency in the authentication process.
Decentralized Trust: The VoiceIDVerifier DApp eliminates the need for centralized authorities or intermediaries to verify user authentication data. Instead, trust is distributed across the network, with consensus mechanisms ensuring the accuracy and validity of recorded transactions. This decentralized trust model enhances the reliability and resilience of the authentication system.

By leveraging the capabilities of the VoiceIDVerifier DApp and blockchain technology, the voice authentication system achieves heightened security, transparency, and trustworthiness, ensuring a robust and reliable mechanism for authenticating user identities.

UML Diagram Explanation for VoiceIDVerifier DApp Deployed on Polygon PoS

The UML diagram provides an overview of the VoiceIDVerifier decentralized application (DApp) deployed on the Polygon Proof of Stake (PoS) blockchain network. This diagram illustrates the key components, interactions, and workflows involved in the authentication process within the DApp.

The Vector Database: A Core Element, Why QDrant? 📊

The vector database plays a crucial role in the voice authentication system, and choosing QDrant as the platform for its implementation offers several significant advantages. Below are some key reasons why QDrant is the ideal choice for managing the vector database in our system:

Scalability and Performance: QDrant is designed to handle large volumes of data and provide exceptional performance in high-load environments. Its distributed architecture and parallel processing capabilities ensure optimal scalability, enabling efficient management of large amounts of voice vectors without compromising system performance.
Advanced Similarity Search: QDrant offers powerful similarity search capabilities that are essential for the voice authentication process. Its vector-based similarity search algorithm ensures accurate and efficient results, allowing for quick and effective comparison of input voice vectors with those stored in the database.
Security and Privacy: QDrant prioritizes data security and privacy, offering robust security measures to protect the integrity and confidentiality of stored voice vectors. Its advanced security features, such as data encryption and granular access controls, ensure that user data is effectively protected against external threats.
Integration with Voice Technologies: QDrant seamlessly integrates with other key voice technologies, such as Resemblyzer, making it easy to generate, store, and search voice vectors in the voice authentication system. This seamless integration ensures optimal interoperability between the various tools and components of the system.

In summary, QDrant provides a comprehensive and highly efficient solution for managing the vector database in our voice authentication system. Its scalability, performance, security, and integration capabilities make it the ideal choice to meet the storage and search needs of voice vectors in a robust and secure voice authentication environment.

Installation

In this section I will provide an explanation about how to setup the whole project architecture.

Deploy VoiceIdVerifier DApp on Polygon PoS Blockhain

The first step is to clone the repository and execute the following command directory to install all the required modules

rake voicepassport:dapp:install_dependencies

After that it will be necessary to create an account in Alchemy, infura or another similar service in order to configure the network on which the Dapp will be deployed.

In my case, I have created a project in Alchemy and I have created a secret.json file to configure the deployment over the Mumbai testnet as you can see in the hardhat.config.ts file of the project:

import { HardhatUserConfig } from "hardhat/config";
import "@nomicfoundation/hardhat-toolbox";
const secret = require('./.secret.json');

const config: HardhatUserConfig = {
  solidity: {
    version: "0.8.9",
    settings: {
      optimizer: {
        enabled: true,
      },
    },
  },
  networks: {
    hardhat: {},
    ganache: {
      url: "http://127.0.0.1:7545",
      allowUnlimitedContractSize: true,
      gas: 2100000,
      gasPrice: 8000000000
    },
    amoy: {
      url: `https://polygon-amoy.g.alchemy.com/v2/${secret.projectId}`,
      accounts: [secret.accountPrivateKey]
    }
  }
};

export default config;

The project has a set of tests to validate the correct behaviour of the contracts and the interaction between them. You can run the following command to launch the test suite on the local EVM:

rake voicepassport:dapp:run_tests

VoiceIDVerifier
    ✔ Should set the right owner (3173ms)
    ✔ register voiceID verification successfully (181ms)
    ✔ disable voiceID verification successfully (164ms)
    ✔ enable voiceID verification successfully (170ms)
    ✔ only the contract's owner can register and verify voice ids (112ms)


  5 passing (4s)

You can deploy your own VoiceIdVerifier DApp instance using the following command:

rake voicepassport:dapp:deploy_contracts

The project has been deployed on the Polygon PoS Amoy testnet, the address of the contract is as follows:

cd VoiceIdVerifierDapp && npx hardhat run --network amoy scripts/deploy.ts
VoiceIDVerifier contract deployed to 0xb23286ffEFa312CB6e828d203BB4a9FF85ee61DD

Environment configuration

It is necessary preparing the environment file called .env placed at the root folder which contains a lot of params to configure the services used in the architecture

## QDrant Configuration
QDRANT_URI=http://voice_passport_qdrant:6333
QDRANT_API_KEY=
QDRANT_COLLECTION=user_voice_embeddings

## VoiceIdVerifierDApp - Alchemy - Polygon PoS
VOICE_ID_VERIFIER_HTTP_PROVIDER=https://polygon-amoy.g.alchemy.com/v2/api_token
VOICE_ID_VERIFIER_CALLER_ADDRESS=CALLER_ADDRESS
VOICE_ID_VERIFIER_CALLER_PRIVATE_KEY=PRIVATE_KEY
VOICE_ID_VERIFIER_CONTRACT_ADDRESS=0xb23286ffEFa312CB6e828d203BB4a9FF85ee61DD
VOICE_ID_VERIFIER_CONTRACT_ABI_NAME=VoiceIDVerifier.json
....................
....................

Platform Setup

Below is the order in which tasks should be executed to set up the project:

Upload Contract ABI to MinIO:

rake voicepassport:upload_contract_abi_to_minio

Build and Push Apache Airflow Image:

rake voicepassport:build_and_push_airflow_image

Build and Push VoicePassport API Image:

rake voicepassport:build_and_push_voice_passport_api_image

Deploy Architecture:
```
rake voicepassport:deploy
```
Create Users in Apache Airflow:

rake voicepassport:create_apache_airflow_users

Screenshots 📷

Here are some screenshots that demonstrate the functionality of Voice Passport:

It is possible to manage the information stored in QDrant by accessing its dashboard. We can visualize the created collections and stored embeddings, and even perform similarity searches.

Each operator implemented in each of the Apache Airflow DAGs stores tracking records in MongoDB that we can analyze and track in order to inspect their operation.

Through the Apache Airflow dashboard, it is possible to monitor the operation of the DAGs and analyze their performance and various metrics.

It's possible to examine details about task execution and the provided execution configuration.

Through the Alchemy dashboard, it's possible to monitor the different transactions executed and various details regarding gas consumption or the block number where the transaction was mined.

It's also possible to visit the Polygon Block Explorer to obtain more details about the mined transaction.

The platform offers a REST API through which it's possible to interact with the system, initiate the registration of new users, and perform authentications.

Through the Apache Airflow UI, it's possible to examine the operation of the registered DAGs, identify the number of failed, queued, and completed tasks...

Task Descriptions

The following table provides descriptions and examples of tasks available in the Rakefile for deploying and managing your environment.

Task	Description	Command
voicepassport:deploy	Deploys the architecture and launches all necessary services and daemons.	`rake voicepassport:deploy`
voicepassport:undeploy	Undeploys the architecture.	`rake voicepassport:undeploy`
voicepassport:start	Starts the containers.	`rake voicepassport:start`
voicepassport:stop	Stops the containers.	`rake voicepassport:stop`
voicepassport:status	Shows the status of the containers.	`rake voicepassport:status`
voicepassport:create_apache_airflow_users	Creates users in Apache Airflow.	`rake voicepassport:create_apache_airflow_users`
voicepassport:build_and_push_airflow_image	Builds and pushes Apache Airflow Docker image to DockerHub.	`rake voicepassport:build_and_push_airflow_image`
voicepassport:build_and_push_voice_passport_api_image	Builds and pushes VoicePassport API Docker image to DockerHub.	`rake voicepassport:build_and_push_voice_passport_api_image`
voicepassport:upload_contract_abi_to_minio	Uploads the contract ABI JSON file to MinIO.	`rake voicepassport:upload_contract_abi_to_minio`
voicepassport:delete_contract_abi_from_minio	Deletes the contract ABI JSON file from MinIO.	`rake voicepassport:delete_contract_abi_from_minio`
voicepassport:check_contract_abi_in_minio	Checks if the contract ABI JSON file exists in MinIO.	`rake voicepassport:check_contract_abi_in_minio`
voicepassport:clean_environment	Cleans the environment.	`rake voicepassport:clean_environment`
voicepassport:check_docker	Checks if Docker and Docker Compose are installed and accessible.	`rake voicepassport:check_docker`
voicepassport:login	Logs in to DockerHub.	`rake voicepassport:login`
voicepassport:check_deployment_file	Checks the existence of the deployment file.	`rake voicepassport:check_deployment_file`

To execute any of these tasks, use the rake command followed by the task name. For example, to deploy VoicePassport, run rake voicepassport:deploy.

Services Overview

Below is a list of services available locally, each with its associated port number and a short description of its purpose. These services are used in the VoicePassport architecture for various functions, including data storage, database management, and API services. Understanding these services and their ports will be helpful when working with the VoicePassport environment.

Service Name	Ports	Purpose
voice_passport_minio1		Object and data storage
voice_passport_minio2		Object and data storage
voice_passport_minio3		Object and data storage
voice_passport_minio_haproxy	9000 (MinIO), 1936 (Stats)	Load balancer for MinIO
voice_passport_mongo	27017	NoSQL database
voice_passport_mongo_express	9001	Web interface to administer MongoDB
voice_passport_redis		Cache storage and message broker for Apache Airflow
voice_passport_postgres	5432	Relational database for Apache Airflow
voice_passport_pgadmin	9002	Web interface to administer PostgreSQL
voice_passport_airflow_webserver	9003	Apache Airflow web server for workflow management
voice_passport_celery_flower	9004 (Celery), 9005 (Web), 9006 (Stats)	Web-based tool to monitor and manage Celery clusters
voice_passport_airflow_scheduler	9007	Task scheduler for Apache Airflow
voice_passport_airflow_worker_1		Apache Airflow task processing
voice_passport_api_service_1		API service for VoicePassport
voice_passport_api_service_2		API service for VoicePassport
voice_passport_api_service_3		API service for VoicePassport
voice_passport_api_service_haproxy	9008 (API), 1937 (Stats)	Load balancer for VoicePassport API services
voice_passport_qdrant	6333 (gRPC), 6334 (HTTP)	Database for storing and querying vectors

Contribution

Contributions to VoicePassport Architecture are highly encouraged! If you're interested in adding new features, resolving bugs, or enhancing the project's functionality, please feel free to submit pull requests.

License

This project is licensed under the MIT License.

Credits

VoicePassport Architecture is developed and maintained by Sergio Sánchez Sánchez (Dream Software). Special thanks to the open-source community and the contributors who have made this project possible. If you have any questions, feedback, or suggestions, feel free to reach out at [email protected].

Acknowledgements 🙏

🙏 I would like to extend my heartfelt gratitude to Karan Shingde for his insightful article published on Medium titled "Build an Audio-driven Speaker Recognition System Using Open Source Technologies: resemblyzer and pyAudioAnalysis". His comprehensive guide served as a significant source of inspiration and a crucial starting point for developing the VoicePassport Architecture project. Karan's expertise and dedication have been instrumental in shaping my understanding and implementation of speaker recognition technologies. I am truly thankful for his invaluable contribution to the field and for sharing his knowledge with the community. 🚀

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoicePassport: Your Trusted Voice Authentication Solution 🎙️🔐

✨ Features

More Details 📝

🛠️ Technologies Used

Unveiling Architecture 🏛️

Why Blockchain Verification? 🛡️

UML Diagram Explanation for VoiceIDVerifier DApp Deployed on Polygon PoS

The Vector Database: A Core Element, Why QDrant? 📊

Installation

Deploy VoiceIdVerifier DApp on Polygon PoS Blockhain

Environment configuration

Platform Setup

Screenshots 📷

Task Descriptions

Services Overview

Contribution

License

Credits

Acknowledgements 🙏

Visitors Count

Please Share & Star the repository to keep me motivated.

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
VoiceIdVerifierDapp		VoiceIdVerifierDapp
abis		abis
airflow		airflow
api		api
doc		doc
haproxy		haproxy
.dockerignore		.dockerignore
.env		.env
.gitignore		.gitignore
Gemfile		Gemfile
README.md		README.md
Rakefile		Rakefile
docker-compose.yml		docker-compose.yml

sergio11/voice_passport_architecture

Folders and files

Latest commit

History

Repository files navigation

VoicePassport: Your Trusted Voice Authentication Solution 🎙️🔐

✨ Features

More Details 📝

🛠️ Technologies Used

Unveiling Architecture 🏛️

Why Blockchain Verification? 🛡️

UML Diagram Explanation for VoiceIDVerifier DApp Deployed on Polygon PoS

The Vector Database: A Core Element, Why QDrant? 📊

Installation

Deploy VoiceIdVerifier DApp on Polygon PoS Blockhain

Environment configuration

Platform Setup

Screenshots 📷

Task Descriptions

Services Overview

Contribution

License

Credits

Acknowledgements 🙏

Visitors Count

Please Share & Star the repository to keep me motivated.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages