You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Recently I wrote this tiny bit of code to take the Wiktionary XML dumps and create a SQLite database out of it. I enabled full-text search with sqlite-utils, and - it looks great!
Now I'm wondering whether there's a way to parse that MediaWiki text to create something human readable. I tried pandoc, but at least without writing a custom filter it leaves a lot of noise hen I try to convert to Markdown / HTML / ASCII.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hey simonw and friends,
Recently I wrote this tiny bit of code to take the Wiktionary XML dumps and create a SQLite database out of it. I enabled full-text search with
sqlite-utils
, and - it looks great!Now I'm wondering whether there's a way to parse that MediaWiki text to create something human readable. I tried
pandoc
, but at least without writing a custom filter it leaves a lot of noise hen I try to convert to Markdown / HTML / ASCII.Does anyone here have any ideas?
Beta Was this translation helpful? Give feedback.
All reactions