RAG: Research-assistant

This project aims to help researchers find answers from a set of research papers with the help of a customized RAG pipeline and a powerfull LLM, all offline and free of cost.

For more details, please checkout the blog post about this project.

How it works

Download some research papers from Arxiv
Use Llamaindex to load, chunk, embed and store these documents to a Qdrant database
FastAPI endpoint that receives a query/question, searches through our documents and find the best matching chunks
Feed these relevant documents into an LLM as a context
Generate an easy to understand answer and return it as an API response alongside citing the sources

Running the project

Starting a Qdrant docker instance

docker run -p 6333:6333 -v ~/qdrant_storage:/qdrant/storage:z qdrant/qdrant

Downloading & Indexing data

python rag/data.py --query "LLM" --max 10 --ingest

Starting Ollama LLM server

Follow this article for more infos on how to run models from hugging face locally with Ollama.

Create model from Modelfile

ollama create research_assistant -f ollama/Modelfile

Start the model server

ollama run research_assistant

By default, Ollama runs on http://localhost:11434

Starting the api server

uvicorn app:app --reload

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
ollama		ollama
rag		rag
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
config.yml		config.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

ollama

ollama

rag

rag

.gitignore

.gitignore

Dockerfile

Dockerfile

README.md

README.md

app.py

app.py

config.yml

config.yml

requirements.txt

requirements.txt

Repository files navigation

RAG: Research-assistant

How it works

Running the project

Starting a Qdrant docker instance

Downloading & Indexing data

Starting Ollama LLM server

Starting the api server

Example

Request

Response

About

Releases

Packages

Languages

Otman404/local-rag-llamaindex

Folders and files

Latest commit

History

Repository files navigation

RAG: Research-assistant

How it works

Running the project

Starting a Qdrant docker instance

Downloading & Indexing data

Starting Ollama LLM server

Starting the api server

Example

Request

Response

About

Topics

Resources

Stars

Watchers

Forks

Languages