Vector Dimension Issue While Querying #8142

hodgesz · 2023-10-15T23:43:41Z

hodgesz
Oct 15, 2023

I have created an index in Qdrant using Instructor embeddings with 768 dimensions. However, I am running into issues while trying to query it.

Here is my code:

`
embeddings = HuggingFaceInstructEmbeddings(model_name="hkunlp/instructor-xl",
model_kwargs={"device": "mps"})

client = qdrant_client.QdrantClient(url='http://localhost:6333', prefer_grpc=True)
vector_store = QdrantVectorStore(client=client, collection_name=qdrant_index,
embeddings=embeddings)

llm = ChatOpenAI(model_name=gpt3_5_turbo_model_name,
temperature=openai_temperature)

index = VectorStoreIndex.from_vector_store(vector_store=vector_store,
dimensions=768,
embed_model=embeddings)

retriever = VectorIndexRetriever(
index=index,
# vector_store_query_mode=VectorStoreQueryMode.DEFAULT,
similarity_top_k=6
)

service_context = ServiceContext.from_defaults(
llm=llm,
system_prompt=template,
embed_model=embeddings,
chunk_size=400,
chunk_overlap=20
)

response_synthesizer = get_response_synthesizer(
response_mode="refine",
service_context=service_context,
use_async=False,
streaming=False,
)

query_engine = (
RetrieverQueryEngine(
retriever=retriever,
response_synthesizer=response_synthesizer,
))

Write query answer

st.markdown("### Answer:")

result = query_engine.query(query)
print(result.source_nodes)

Display the results

st.write(f"Answer: {str(result)}")
`

Here is my error:

Traceback (most recent call last):
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/streamlit/runtime/scriptrunner/script_runner.py", line 565, in _run_script
exec(code, module.dict)
File "/Users/hodgesz/PycharmProjects/WorkForge/workforge_ai_workshop/code/llamaindex_chat_app.py", line 181, in
raise e
File "/Users/hodgesz/PycharmProjects/WorkForge/workforge_ai_workshop/code/llamaindex_chat_app.py", line 174, in
result = query_engine.query(query)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/llama_index/indices/query/base.py", line 23, in query
return self._query(str_or_query_bundle)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/llama_index/query_engine/retriever_query_engine.py", line 171, in _query
nodes = self.retrieve(query_bundle)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/llama_index/query_engine/retriever_query_engine.py", line 123, in retrieve
nodes = self._retriever.retrieve(query_bundle)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/llama_index/indices/base_retriever.py", line 22, in retrieve
return self._retrieve(str_or_query_bundle)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/llama_index/indices/vector_store/retrievers/retriever.py", line 87, in _retrieve
return self._get_nodes_with_embeddings(query_bundle)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/llama_index/indices/vector_store/retrievers/retriever.py", line 164, in _get_nodes_with_embeddings
query_result = self._vector_store.query(query, **self._kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/llama_index/vector_stores/qdrant.py", line 381, in query
response = self._client.search(
^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/qdrant_client/qdrant_client.py", line 289, in search
return self._client.search(
^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/qdrant_client/qdrant_remote.py", line 401, in search
res: grpc.SearchResponse = self.grpc_points.Search(
^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/grpc/_channel.py", line 1030, in call
return _end_unary_response_blocking(state, call, False, None)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Caskroom/miniforge/base/envs/langchain_dev/lib/python3.11/site-packages/grpc/_channel.py", line 910, in _end_unary_response_blocking
raise _InactiveRpcError(state) # pytype: disable=not-instantiable
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
grpc._channel._InactiveRpcError: <_InactiveRpcError of RPC that terminated with:
status = StatusCode.INVALID_ARGUMENT
details = "Wrong input: Vector inserting error: expected dim: 768, got 1536"
debug_error_string = "UNKNOWN:Error received from peer {created_time:"2023-10-15T17:26:00.324433-06:00", grpc_status:3, grpc_message:"Wrong input: Vector inserting error: expected dim: 768, got 1536"}"

For some reason, it is assuming embedding size of 1536 instead of 768 and was wondering if it is set to OpenAI default somewhere.

Any ideas?

Answered by logan-markewich

Oct 17, 2023

@hodgesz you need to pass in the service context when loading the index (with the service context setup with the same embedding model you used to create the index)

index = VectorStoreIndex.from_vector_store(vector_store=vector_store, service_context=service_context)

View full answer

@dosu-bot · 2023-10-15T23:51:41Z

dosubot[bot]
bot Oct 15, 2023

🤖

Hello,

The error message you're seeing is due to a mismatch between the expected and actual dimensions of the embeddings. In your case, the Qdrant index is expecting embeddings of size 768, but it's receiving embeddings of size 1536.

This mismatch is likely because the size of the vector in Qdrant is determined by the first node that is added to the collection. If the first node that was added to the collection had an embedding size of 1536, that would set the vector size for the entire collection to 1536. If you want the vector size to be 768, you need to ensure that the first node added to the collection has an embedding size of 768. This behavior is evident from the add and async_add methods in the QdrantVectorStore class in LlamaIndex.

Here is the relevant code from the add method:

if len(nodes) > 0 and not self._collection_initialized:
    self._create_collection(
        collection_name=self.collection_name,
        vector_size=len(nodes[0].get_embedding()),
    )

And from the async_add method:

if len(nodes) > 0 and not self._collection_initialized:
    await self._async_create_collection(
        collection_name=self.collection_name,
        vector_size=len(nodes[0].get_embedding()),
    )

You can find this code in the QdrantVectorStore class in the LlamaIndex repository.

To resolve this issue, you might need to recreate your Qdrant index ensuring that the first node added has an embedding size of 768.

I hope this helps! Let me know if you have any other questions.

Sources

This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

If you want to continue the conversation, start your reply with @dosu-bot.

1 reply

hodgesz Oct 16, 2023
Author

I deleted and recreated the index in Qdrant and ensured all 333 entries are of 768 dimensions.

When I use the following code when the index is initialized from the documents versus from the vector store, the query code from above works as expected with no errors about vector dimensions.

documents = SimpleDirectoryReader("/Users/hodgesz/LangChain/data/workforge/load/csv/").load_data()
service_context = ServiceContext.from_defaults(chunk_size=400, chunk_overlap=20, embed_model=embeddings)

storage_context = StorageContext.from_defaults(vector_store=vector_store)
index = VectorStoreIndex.from_documents(
    documents, storage_context=storage_context, service_context=service_context, dimensions=768
)

However, when I move back to the following code to initialize the index from Qdrant, I get the same 1536 dimension error I initially reported.

index = VectorStoreIndex.from_vector_store(vector_store=vector_store,
                                                       dimensions=768,
                                                       embed_model=embeddings)

Any other ideas?

logan-markewich · 2023-10-17T17:36:34Z

logan-markewich
Oct 17, 2023
Maintainer

@hodgesz you need to pass in the service context when loading the index (with the service context setup with the same embedding model you used to create the index)

index = VectorStoreIndex.from_vector_store(vector_store=vector_store, service_context=service_context)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vector Dimension Issue While Querying #8142

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Vector Dimension Issue While Querying #8142

hodgesz Oct 15, 2023

Write query answer

Display the results

Replies: 2 comments · 1 reply

dosubot[bot] bot Oct 15, 2023

Sources

hodgesz Oct 16, 2023 Author

logan-markewich Oct 17, 2023 Maintainer

hodgesz
Oct 15, 2023

Replies: 2 comments 1 reply

dosubot[bot]
bot Oct 15, 2023

hodgesz Oct 16, 2023
Author

logan-markewich
Oct 17, 2023
Maintainer