Skip to content

Morpological variants of Sinhala words. Extracted from FastText 300 si

License

Notifications You must be signed in to change notification settings

brainsharks-fyp17/morphdb-si

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

morphdb-si

Morpological variants of Sinhala words.
Extacted from a FastText-300 model released by Facebook https://fasttext.cc/.
Contains a .json
Ex:

  "යටත්විජිතයන්හි": [
    "යටත්විජිතයන්",
    "යටත්විජිතයන්ගෙද",
    "යටත්විජිතයේ",
    "යටත්විජිතයට",
    "යටත්විජිතයෙහි",
    "යටත්විජිතමය",
    "යටත්විජිතය",
    "යටත්විජිතයෙන්",
    "යටත්විජිතකරණයේ",
    "යටත්විජිතයක්වූ"
  ],
  "නිවසයෙන්": [
    "නිවසවල",
    "නිවසේමය"
  ],
  "වැස්සකට": [
    "වැස්සකදි",
    "වැස්සකදී",
    "වැස්සකටත්",
    "වැස්සක",
    "වැස්සකි",
    "වැස්සක්ද",
    "වැස්සක්ව",
    "වැස්සක්ම",
    "වැස්සකුත්"
  ],

This dataset contains 312,397 keys and their morphological forms

Releases

No releases published

Packages

No packages published