Medical-ChatBot-using-llama-2

Project - Medical ChatBot

Architecture :

Backend :

Data Ingestion : Medical book (Pdf file)
Extract data
Create different text chunks (break down big content into small chunks(parts) for to feed into model)
Embedding - vector
Build semantic index - (Connecting vector)
Knowledge base - (pinecone, vector store)

Frontend :

User questions -> Query embedding -> Knowledge base
Knowledge base -> Ranked result -> LLM model (llam2) -> user answer

Tech stacks:

Language - Python
Framework - Langchain
Frontend/webapp - Flask, HTML, CSS
LLM - meta llama 2
Vector DB - pinecone.

Description

This project is a Medical ChatBot built using the Llama-2 model. The primary goal of this chatbot is to provide accurate and relevant medical information to users based on their queries.

The architecture of the project is divided into two main parts: the backend and the frontend.

Backend:

Data Ingestion: The backend starts with ingesting data from a medical book in PDF format.
Data Extraction: The data from the PDF is then extracted and processed.
Text Chunking: The extracted data is broken down into smaller chunks or parts. These chunks are then fed into the model.
Embedding: Each chunk of text is converted into a vector representation, also known as embedding.
Semantic Index Building: A semantic index is built to connect these vectors, which aids in understanding the context and meaning of the text chunks.
Knowledge Base: The embeddings are stored in a knowledge base using Pinecone, a vector database.

Frontend:

User Query Processing: When a user asks a question, it is converted into a query embedding.
Knowledge Base Lookup: This query embedding is used to search the knowledge base for the most relevant information.
LLM Model Processing: The ranked results from the knowledge base are then passed through the LLM (Llama-2) model.
User Answer Generation: Finally, the model generates an answer to the user's question based on the information it has processed.

The tech stack used for this project includes:

Python: The primary programming language used for developing the chatbot.
Langchain: A framework used for building the chatbot.
Flask, HTML, CSS: These technologies are used for building the frontend or web application of the chatbot.
Meta Llama-2: The language model used for processing and generating responses to user queries.
Pinecone: A vector database used for storing the text embeddings.

In summary, this Medical ChatBot project leverages advanced NLP techniques and a robust tech stack to provide users with accurate medical information based on their queries. It represents a significant step forward in the field of AI-driven healthcare solutions.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
my_packages/my_package		my_packages/my_package
research		research
src		src
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
setup.py		setup.py
store_index.py		store_index.py
template.py		template.py

License

athiyaman-m/Medical-ChatBot-using-llama-2

Folders and files

Latest commit

History

Repository files navigation

Medical-ChatBot-using-llama-2

Architecture :

Tech stacks:

Description

Output

About

Topics

Resources

License

Stars

Watchers

Forks

Languages