Fast Oriented Text Spotting with a Unified Network (FOTS)

This repository is an unofficial PyTorch implementation of FOTS Paper. Here is the FOTS model architecture.

There are couple of existing PyTorch implementations which have very slow training process because data preprocessing/ground truth generation part is CPU boud and GPU keeps waiting for batch of data. But using this repo, preprocessing can be done separately (on CPU) and then the preprocessed data can be used to train the model which improves the training speed significantly (and GPU time can be saved).

There are currently TWO separate branches in this repository:

detection: which only implements text detection. This branch is fully tested and works as expected.
recognition: which implements both text detection and recognition, which is basically end to end FOTS. This branch is currently under development.

Detection branch was trained on a very small subset (10K images) of SynthText dataset for 25 epochs because of hardware limitations. Here are a few results from detection branch:

These results can be easily improved further by trained on entire 800K images and then finetuned on ICDAR-2015 images.

Requirements

To install dependencies:

pip install -r requirements.txt

Training

To train the model on SynthText dataset, first run the preprocessing script:

python FOTS-PyTorch/preprocess.py -c config.json

The sample config files with all the available options are given in the config/ dir.

After all the data is preprocessed, run the following comman to start training:

python FOTS-PyTorch/train.py -c train_config.json

Hyperparameters used for training are available in the training configuration file under config/ dir.

Evaluation

To evaluate the trained model on any image:

python FOTS-PyTorch/eval.py -m "<model_path>" -i "<input_dir>" -o "<output_dir>"

For more information, check the demo notebooks available under notebooks/ dir.

TODO:

Implement Detection Branch
Implement Recognition Branch
Validate both Detection and Recognition Branch
Full training on SynthText Dataset
Combine "detection" and "recognition" branches and make training mode configurable

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
components		components
config		config
data_helpers		data_helpers
eval_tools/icdar2015		eval_tools/icdar2015
images		images
lanms		lanms
notebooks		notebooks
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
bbox.py		bbox.py
eval.py		eval.py
model.py		model.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast Oriented Text Spotting with a Unified Network (FOTS)

Requirements

Training

Evaluation

References:

About

Releases

Packages

Languages

License

Kaushal28/FOTS-PyTorch

Folders and files

Latest commit

History

Repository files navigation

Fast Oriented Text Spotting with a Unified Network (FOTS)

Requirements

Training

Evaluation

References:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages