New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FR]: Article Extracting #1290
Labels
Type-Enhancement
This is request for brand new feature.
Comments
RSS Guard already integrates readability via its "reader mode" feature. |
This is something else, it's not a "reader mode", it extracts article content, even if rss only contains a headline or part of the article, without opening the whole page. |
Recent release has this feature implemented and it works great. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Brief description of the feature request
This is a followup to #399. Since this script stopped working https://github.com/martinrotter/rssguard/blob/master/resources/scripts/scrapers/scrape-full-articles.py [uses site to extract which is 404] I was experimenting with different solution.
Imho, sending it online and back seems wholy unnecesary.
Would it be possible to integrate something like this?
It's a script, needs axios, jsdom and @mozilla/readability npm modules as dependencies, takes site url as argument. Spits out extracted html.
and load it on article clicked instead of trying to extract all urls unnecessarily? There's already node based adblock implemented, from what I've seen. Parsing everything would be fine too, would it be enough to put it into post-processing just?
The text was updated successfully, but these errors were encountered: