A Python wrapper for the Hidden Markov Model ToolKit

HTK is a venerable open-source modelling tool, which helped generations of linguists make state-of-the-art models of speech. Once upon a time, anyway; you have no reason not to use NLTK or hmmlearn these days.

If, like me, you're forced to use it by some artificial constraint, you'll find that it is batch-only, requires hundreds of intermediate files for most processes, often takes 10 ordered command arguments, has appalling C99 error messages, crashes if it finds or does not find newlines in specific places, and extremely dense docs. This wrapper makes using it a bit less painful.

The wrapper doesn't really reflect HTK's generality: it builds speaker models from wavs. My usecase took raw speech files from a pair of interlocutors, Labb-Cat annotations for their conversation, built models for each speaker, and then reported their overall 'accommodation' to their interlocutor over time.

Usage

Install HTK and Python3.
Get speech data, annotate it.
Point Configs.root at your files.
Run like so : python main.py statesPerHmm=3 vectorType=LPC iterations=10

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
wrapper		wrapper
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wrapper

wrapper

LICENSE

LICENSE

README.md

README.md

main.py

main.py

Repository files navigation

A Python wrapper for the Hidden Markov Model ToolKit

Usage

About

Releases

Packages

Languages

License

g-leech/Py2HTK

Folders and files

Latest commit

History

Repository files navigation

A Python wrapper for the Hidden Markov Model ToolKit

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Languages