New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect hyphenation in situations including dashes for some languages (e.g. polish) #3235
Comments
I would also like to link the English documentation for the LaTeX polski package providing LaTeX macros for working around this issue. It also defines macros for en-dash ( |
Just for the sharing in return, after seeing this, SILE implemented the intended behavior for Polish in v0.14.15. On our way, we discussed and referenced (but left unaddressed for now, at the time I'm writing this message) which other languages could have the same feature, i.e. (possibly) Czech, Slovak, Portuguese, Spanish Just to be real clear here (one never knows), my intent is to share information back -- not to brag about what SILE does or not -- in the open-minded spirit that we all need to have good solutions for language-specific concerns. |
I can confirm that this behavior is desired in Czech, Slovak, Lower Sorbian (probably also Upper Sorbian), and Croatian as well, as described in their orthographic manuals (i.e. Pravidla českého pravopisu, Pravidlá slovenského pravopisu, Dolnoserbski pšawopis, Hrvatski pravopis):
|
Portuguese grammar also has this hyphenation particularity. According to the Base XX of the "Acordo Ortográfico da Língua Portuguesa de 1990",
Below I bring short excerpts from practical articles that teach how to hyphenate such kind of word.
Footnotes
|
In Spanish we must repeat the hyphen too, according to the RAE1.
... except when the next word is capitalized. E.g.: Ruiz-/ Giménez must not repeat the hyphen on the next line.
Footnotes |
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
- Czech - Croatian - Lower Sorbian - Polish - Portuguese - Slovak - Spanish Fix typst#3235
Description
In some languages—in polish[1], for example—while dividing a phrase consisted of words joined with a hyphen, it is required to split it on the hyphen, leaving it at the end of the line, but also to repeat it in the beginning of a new line. Typst doesn't do it. A minimal example is the following.
The correct hyphenation would be
I wasn't sure whether it should be a bug, or a feature report. I settled on a bug caused by the missing feature.
1: https://web.archive.org/web/20240120154340/https://www.ortograf.pl/zasady-pisowni/lacznik-zasady-pisowni
Reproduction URL
No response
Operating system
Linux
Typst version
The text was updated successfully, but these errors were encountered: