text-embedding

Here are 78 public repositories matching this topic...

olha-kaminska / scientific_data_computing

"Word2Vec for Russian text" - my project for course "Scientific Data Computing" in University of Tartu. It was presented as 20-minutes talk on 6th Estonian Digital Humanities Conference at September 2018.

python university neural-network conference word2vec university-project text-embedding tartu russian-language movies-reviews

Updated Sep 27, 2018
Jupyter Notebook

DanielPFlorian / Transformers-Github-Semantic-Search

Star

NLP Dataset Creation and Semantic Search Demonstration

nlp natural-language-processing transformers semantic-search text-embedding huggingface tokenizers

Updated Feb 27, 2024
Jupyter Notebook

justinlaw360 / RAG

Star

Retrieval-Augmented Generation using Azure OpenAI

python azure openai text-embedding rag gpt-4 llm retrieval-augmented-generation

Updated May 20, 2024
HTML

inferless / universal-sentence-encoder-multilingual-tensorflow

Star

Universal-Sentence-Encoder-Multilingual-QA is a model developed by researchers at Google mainly for the purpose of question answering. You can use this template to import the model in Inferless.

text-embedding

Updated Jul 20, 2023
Python

Yash182023 / BrightPsyche_New

Star

"BrightPsych" is a holistic mental health platform featuring a supportive chatbot and detail CBT analysis for disorders. Daily Mood Tracking aids emotional well-being, while data analysis unveils student mental health trends. Guided mindfulness contribute to resilience in a nurturing space. Empower, Engage and Elevate through Community Forum.

python html text-to-speech deep-learning chatbot apk android-application artificial-intelligence android-studio mental-health da cbt text-embedding strea mental-health-awareness openai-api llms langchain

Updated Feb 19, 2024
HTML

NandiSoham / Cocktail-Recommendation-System

Star

mongodb openai recommender-system text-embedding python-project huggingface genai

Updated May 28, 2024
Jupyter Notebook

inferless / Bge-m3

Star

BGE-M3 is an innovative project known for its versatility, featuring Multi-Functionality, Multi-Linguality, and Multi-Granularity.

text-embedding

Updated Mar 28, 2024
Python

inferless / MedCPT-Query-Encoder

Star

MedCPT generates embeddings of biomedical texts that can be used for semantic search (dense retrieval). MedCPT Query Encoder: compute the embeddings of short texts (e.g., questions, search queries, sentences). In this template, we will import the MedCPT Query Encoder on the Inferless Platform.

text-embedding

Updated Nov 24, 2023
Python

llmrails / ember-v1

Star

State-of-the-Art Ember embedding model for retrieval augmented generation

transformers embeddings sentence-classification text-embedding sentence-similarity sentence-embeddings sentence-transformers mteb

Updated Oct 31, 2023

inferless / Multilingual-e5-large

Star

This is a sentence embedding model, initialized from xlm-roberta-large and continually trained on a mixture of multilingual datasets. It supports 100 languages from xlm-roberta, but low-resource languages may see performance degradation.

text-embedding

Updated Apr 7, 2024
Python

inferless / MS-marco-MiniLM-L-12-v2

Star

MS-marco-MiniLM-L-12-v2 model can be used for Information Retrieval: Given a query, encode the query will all possible passages (e.g. retrieved with ElasticSearch). Then sort the passages in a decreasing order.

text-embedding

Updated Apr 7, 2024
Python

brainsqueeze / SequenceModels.jl

Star

Julia experimentation using sequence-based NLP models

nlp machine-learning julia text-embedding transformer-encoder

Updated Feb 25, 2021
Julia

Jibril14 / OpenAI_Text_Embedding

Star

OpenAI Text Embedding. Clean, process and create vectorize representation of text for indexing and semantic search

machine-learning natural-language-processing openai chat-application text-embedding vector-database gpt-4 prompt-engineering

Updated Sep 10, 2023
Python

inferless / Jina-embeddings-v2

Star

jina-embeddings-v2-base-en is an English, monolingual embedding model supporting 8192 sequence length. It is based on a BERT architecture (JinaBERT) that supports the symmetric bidirectional variant of ALiBi to allow longer sequence length. The backbone jina-bert-v2-base-en is pretrained on the C4 dataset.

text-embedding