Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unicode whitespace is stripped from leading and trailing positions in markdown paragraphs #310

Open
tim-evans opened this issue Nov 25, 2019 · 0 comments
Labels
🐞 Bug This is identified as a bug or is fixing a bug
Projects

Comments

@tim-evans
Copy link
Collaborator

Following up on #307, we should handle a longer list of unicode whitespace characters:

Name Code Point Entity Size
No Break Space \u00A0   👉 👈
En Quad \u2000   👉 👈
Em Quad \u2001   👉 👈
En Space \u2002   👉 👈
Em Space \u2003   👉 👈
Thick Space \u2004   👉 👈
Mid Space \u2005   👉 👈
Six-per-em Space \u2006   👉 👈
Figure Space \u2007   👉 👈
Punctuation Space \u2008   👉 👈
Thin Space \u2009   👉 👈
Hair Space \u200A   👉 👈
Zero Width Space \u200B ​ 👉​👈
Narrow No-break Space \u202F   👉 👈
Medium Mathematical Space \u205F   👉 👈
Ideographic Space \u3000   👉 👈
Zero Width No-break Space \uFEFF  👉👈

I think this is a fairly exhaustive list of spaces, but if any more should be added, please comment 😄

@tim-evans tim-evans added the 🐞 Bug This is identified as a bug or is fixing a bug label Nov 25, 2019
@tim-evans tim-evans added this to To do in 🏭 Work Feb 28, 2020
JJMcConnell added a commit to JJMcConnell/atjson that referenced this issue Sep 27, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🐞 Bug This is identified as a bug or is fixing a bug
Projects
🏭 Work
  
To do
Development

No branches or pull requests

1 participant