-
Notifications
You must be signed in to change notification settings - Fork 840
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Converting HTML to markdown doesn't appear to preserve HTML entities #431
Comments
Using the CommonMark dingus, entering |
@bjones1 it does need to be preserved in this case: <br> which is converted to <br> and in this case: &amp; is an ampersand and in this case: A big space and in this case: Not a code block |
Consider the following HTML example:
When I try converting this to Markdown using Turndown, I get the following output:
I guess I would expect Turndown to preserve HTML entities and to output something like this instead:
I couldn't see an option to turn this on, so unless I'm missing something, I assume I need to use something like https://www.npmjs.com/package/html-entities. But I just wanted to check I'm not missing anything obvious?
Here's the config I'm using:
Any help much appreciated!
The text was updated successfully, but these errors were encountered: