Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to upload data with DI in portal? #1604

Open
dicktangdev opened this issue May 10, 2024 · 5 comments
Open

How to upload data with DI in portal? #1604

dicktangdev opened this issue May 10, 2024 · 5 comments

Comments

@dicktangdev
Copy link

Please provide us with the following information:

This issue is for a: (mark with an x)

- [ ] bug report -> please search issues before submitting
- [ ] feature request
- [x] documentation issue or request
- [ ] regression (a behavior that used to work and stopped in a new release)

Minimal steps to reproduce

I deployed the demo with quick start, finding a way to upload documents like DOCX, PPTX, XLSX through DI(as document mentioned that these document types are only supported by DI). I cannot open the DI with it's endpoint, it shows 404. And I also cannot find the indexer in AI search. Please kindly let me know how I can upload my data through Azure portal. Thanks!

Any log messages given by the failure

Expected/desired behavior

Upload data in portal

OS and Version?

Windows 7, 8 or 10. Linux (which distribution). macOS (Yosemite? El Capitan? Sierra?)

azd version?

run azd version and copy paste here.

Versions

Mention any other details that might be useful


Thanks! We'll be in touch soon.

@zedhaque
Copy link

Just wondering did you check below:

https://github.com/Azure-Samples/azure-search-openai-demo/blob/main/docs/data_ingestion.md

specifically indexing additional documents section. Maybe this is what you are looking for?

@dicktangdev
Copy link
Author

Hi @zedhaque

Yes I read the indexing additional documents section. I saw that there is an index called "gptkbindex" created and I also upload new PDF files in the blob container "content" folder(the sample PDFs were all uploaded to here).
But I am not quite sure how to "find the index, and run it" in Azure portal. The index cannot be run and there is not any indexer created. I also tested from UI and search explorer in the index "gptkbindex", the new uploaded PDFs content is not available.
I am still try to figure out how to upload data through Azure portal. If you know the details, please kindly share with me. Thanks!

@zedhaque
Copy link

Just to confirm - when you click on the "indexer" in the AI Search menu item - nothing is there/it's an empty page? and your .env has below item as true:
USE_FEATURE_INT_VECTORIZATION true

correct?

Not sure why the indexer and scheduler didn't get created. (https://learn.microsoft.com/en-us/azure/search/vector-search-integrated-vectorization).

You could set up the schedular in the portal manually and see if it picks up the document that you uploaded earlier.

@heartleaf
Copy link

I had a similar question like the original poster here, demo creates an index automatically question is how to index documents that are directly uploaded to storage containers. I know that scheduling indexing requires setting up a new indexer, but how to associate the indexer to the existing index and make sure new documents or updates to existing pdfs are automatically reindexed and made available in chatApp.

@mikedizon
Copy link

interested to know this as well. i am getting auth issues trying to index locally by running ./scripts/prepdocs.sh. the pdf i am trying to index is 7000 pages

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants