GitHub - cedriclmenard/irislandmarks.pytorch: PyTorch implementation of Google's Mediapipe Iris Landmark model. The original code uses TFLite and their mediapipe workflow, which wouldn't work well with my codebase.

This is the PyTorch implementation of paper Real-time Pupil Tracking from Monocular Video for Digital Puppetry (https://arxiv.org/pdf/2006.11341)

This version doesn't have BatchNorm layers for fine-tuning. If you want to use such model for training, you should add these layers manually.

I've made the conversion semi-manually using a similar method as available for both the BlazeFace PyTorch implementation and the FaceMesh, seen here:

Input for the model is expected to be cropped iris image normalized to -1.0 to 1.0. The cropped image should be 60x60 and centered at the center of the eye contour points as given by FaceMesh.

To get the right scaling, simply use the 192x192 cropped face image used as input for the FaceMesh model, get the average eye contour position for each eyes, use a rect of 64x64 centered at the average position of one eye and use it to crop from the 192x192 image.

However, predict_on_image function normalizes your image itself, so you can even treat resized image as np.array as input

See Inference-IrisLandmarks.ipynb notebook for usage example. See Convert-Iris.ipynb notebook for conversion example.

All other files were just me figuring stuff out. You can take a look at the (very rough) code I used if you're trying something similar.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
mesh_map		mesh_map
.gitignore		.gitignore
Convert-Iris.ipynb		Convert-Iris.ipynb
Inference-IrisLandmarks.ipynb		Inference-IrisLandmarks.ipynb
LICENSE		LICENSE
README.md		README.md
compare_dicts.py		compare_dicts.py
conversion_dict.txt		conversion_dict.txt
convert_test.py		convert_test.py
irislandmarks.pth		irislandmarks.pth
irislandmarks.py		irislandmarks.py
new_conv_dict.txt		new_conv_dict.txt
test (another copy).jpg		test (another copy).jpg
test (copy).jpg		test (copy).jpg
test.jpg		test.jpg
test_192.jpg		test_192.jpg
test_eye.jpg		test_eye.jpg
test_iris.py		test_iris.py
test_tf.py		test_tf.py

License

cedriclmenard/irislandmarks.pytorch

Folders and files

Latest commit

History

Repository files navigation

This is the PyTorch implementation of paper Real-time Pupil Tracking from Monocular Video for Digital Puppetry (https://arxiv.org/pdf/2006.11341)

Input for the model is expected to be cropped iris image normalized to -1.0 to 1.0. The cropped image should be 60x60 and centered at the center of the eye contour points as given by FaceMesh.

About

Topics

Resources

License

Stars

Watchers

Forks

Languages