Skip to content

Neural network model that predicts the number of syllables in an English word. It shows its creation end-to-end: from data collection to evaluation of various models. One of the explored models is used in the Readgauge app.

License

Notifications You must be signed in to change notification settings

FrederickRoman/syllable-count-predictor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

syllable-count-predictor

Neural network model that predicts the number of syllables in an English word. It shows its creation end-to-end: from data collection to evaluation of various models. This is the model design followed as part of the making of the reading level scoring app Readgauge.

Readgauge logo

Screenshot from readgauge/about

Getting Started

This repo has both the data and the code to run the models. All you need to do is to meet the prerequisites.

Prerequisites

Python>=3.8.6

nltk
pandas 
numpy 
tensorflow 

Preprocessing

Syllable count dictionary creation

Run the jupyter notebook cells in train.ipynb under /preprocess/syllable_count_dict_creation

Synthetic syllable count dictionary creation (for data augmentation)

python ./ML/preprocess/data_synthesizer/data_synthesizer.py 

Training

Run the jupyter notebook cells in train.ipynb under training/feedforward or under training/blstm.

External deployment (not on this repo)

These model were trained to find one to be integrated to to the Readgauge client-side web app. It runs live here and its repository is here.

Readgauge logo Results mockup

Data source

The syllableCountDict dataset contains the syllable count of each word

It was created using nltk's built-in CMU dictionary.

The Carnegie Mellon Pronouncing Dictionary [cmudict.0.6] Copyright 1998 Carnegie Mellon University

About

Neural network model that predicts the number of syllables in an English word. It shows its creation end-to-end: from data collection to evaluation of various models. One of the explored models is used in the Readgauge app.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published