New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Question]: on querying responses are totally incorrect #13130
Comments
It sounds like you're encountering issues with the accuracy of responses from your query engine. Let's tackle this step by step to identify potential causes and solutions:
By methodically reviewing these areas, you should be able to identify and resolve the issues causing inaccurate responses.
|
@dosu ok but there is NO documentation/exemple on "PagedCSVReader" and cannot find any readers to install. If I shouldn't use SimpleDirectoryReader as you said, give me simple example to persist simple data.csv to ./persist/ folder using PagedCSVReader. |
@elvisbegovic structured data is pretty special case. Unless it makes sense to embed and retrieve individual rows (maybe you have QA pairs, etc.), using a vector index is probably not the right approach Instead you probably want to put it into a sqlite db or similar and do text-to-sql, or a dataframe and do text-to-pandas https://docs.llamaindex.ai/en/stable/use_cases/q_and_a/#qa-over-structured-data |
Thank you @logan-markewich i will try what you say and I understand what txt-to-sql try to achieve... thanks. let me ask : I want be sure to understand. As "base readers" includes CSV file extension and as far as I know every CSV is "structured". So what explain that llama-index support officially reading CSV as core feature and you say it is "special case? Is there any different way of what im doing here to create documents[] based on csv. Or in other words: what is usecase reading csv using SimpleDirectoryReader? thank you |
From my observation, the built-in readers of As for a sample CSV as below:
there will only be one document with the text: Similarly, for the following JSON: {
"a": "b":
"c": [
{
"n": 1
},
{
"n": 2
}
]
} you will also get a single document that contains the text: @elvisbegovic Returning to the current issue, I think it matters what the content your CSV file contains and what the Q/A behavior you expect. It may be helpful if you can provide more information. |
Question Validation
Question
Hi there, sory to make noise here.
I am impressed how responses are completly incorrect base on a simple "structured" csv file. Am I doing things correctly ?
Step 1, persist csv file with load.py :
Step 2, chat with data with chat.py :
So on code / compilation etc everytihng works perfectly (llama-index is awesome) BUT every question I asking on this CSV file, reponses are completly wrong. What Am I doing wrong here ?
The text was updated successfully, but these errors were encountered: