Python Book Analyzer

A command line tool to analyze large swathes of text data to look for symbols, total and distinct words, lexical richness, word dispersion, hapax legomena and collocation.

The project is to be completed in the following steps:

Read the text data from .txt file.
Determine the number of total words.
Determine the number of distinct words.
How about lexical richness of the text. Lexical richness is the ratio of distinct words to total words.
What are the most commonly used words.
(Maybe) Character correlation
(Maybe) Sentiment Analysis

(Maybe) is to be completed if time permits.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
text_analyzer.py		text_analyzer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python Book Analyzer

The project is to be completed in the following steps:

About

Releases

Packages

Contributors 2

Languages

License

harsharaman/python-book-analyzer

Folders and files

Latest commit

History

Repository files navigation

Python Book Analyzer

The project is to be completed in the following steps:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages