You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to use "Recursive URL" Document loaders from "langchain_community.document_loaders.recursive_url_loader" to process load all URLs under a root directory but css or js links are also processed
System Info
System Information
OS: Linux
OS Version: #1 SMP Tue Dec 19 13:14:11 UTC 2023
Python Version: 3.10.13 | packaged by conda-forge | (main, Dec 23 2023, 15:36:39) [GCC 12.3.0]
The text was updated successfully, but these errors were encountered:
dosubotbot
added
Ɑ: doc loader
Related to document loader module (not documentation)
🤖:bug
Related to a bug, vulnerability, unexpected error with an existing feature
labels
May 2, 2024
Hey, @beethogedeon can you provide the URL where you are facing the problem?
For the URL currently given by you (https://example.com/), the problem lies in the extractor. you have used a very basic extractor and the code can be changed to:
Checked other resources
Example Code
Error Message and Stack Trace (if applicable)
No response
Description
I'm trying to use "Recursive URL" Document loaders from "langchain_community.document_loaders.recursive_url_loader" to process load all URLs under a root directory but css or js links are also processed
System Info
System Information
Package Information
The text was updated successfully, but these errors were encountered: