Scripts or algorithms for automatic building of dictionaries based on plain text or wordlist input #177
Replies: 1 comment
-
This is a bit outside of the scope of the tool itself. For shorthands, this is obviously easier (I'm including the general approach for how I built the English one in the file, and reprinted below for reference.) What you are suggesting makes a lot of sense. (Shorter words, rare words ... Completely agree, that's what goes into the considerations.) For the ergonomically feasible -- I sometimes use a 'mirroring' technique. See discussed under #8. This might, or might not make sense for other people. You start running into conflicts pretty quickly, and how to best resolve them (again, with all the considerations), is pretty tricky to automate (or to automate to make sense). How the English shorthand (not chord) dictionary was built: This dictionary is based on the list of 5,000 most frequent English words at https://en.wiktionary.org/wiki/Wiktionary:Frequency_lists/A_Frequency_Dictionary_of_Contemporary_American_English) which has 4374 entries after removing duplicates. Sequences of typed keys that trigger the shorthand recognition are separated from the expanded full words by Tab. ZipChord ignores lines without Tab, lines starting with Tab (i.e. where the first column is empty), and text in additional columns. You can change existing shorthands and add your own. To make this shorthand dictionary predictable, these default shortcuts adheres to consistent principles:
|
Beta Was this translation helpful? Give feedback.
-
In addition to English, I also type in other languages, and generally jargon-heavy text. It would be nice to have a tool to generate a chord/shorthand dictionary based on a plaintext file of sample text, or from just a word list with word frequencies included. This would allow using ZipChord in other languages quickly, and should generalize quite well to any specific type of text.
I don't have any background in theory regarding optimal shortcuts or chords, but the following might be something to consider:
I tried searching around for such a script or algorithm, but couldn't find anything satisfactory; please enlighten me if this has already been trivially solved somewhere.
Beta Was this translation helpful? Give feedback.
All reactions