Real-time 3D Multi-person Pose Estimation Demo on jetson TX2

This repository is oringnally forked from Daniil-Osokin/lightweight-human-pose-estimation-3d-demo.pytorch, and modified to use TensorRT engine on jetson TX2 device, about 110ms for one frame.

The major part of this work was done by Mariia Ageeva, when she was the 🔝🚀🔥 intern at Intel.

Requirements

Python 3.5 (or above)
CMake 3.10 (or above)
C++ Compiler (g++ or MSVC)
OpenCV 4.0 (or above)
TensorRT engine

tensorRT for fast inference on nvidia GPU(for me it is TX2).

Prerequisites

Install requirements:

pip install -r requirements.txt

Build pose_extractor module:

python setup.py build_ext

Add build folder to PYTHONPATH:

export PYTHONPATH=pose_extractor/build/:$PYTHONPATH

Pre-trained model

Pre-trained model is available at Google Drive.

Running

To run the demo, pass path to the pre-trained checkpoint and camera id (or path to video file):

python demo.py --model human-pose-estimation-3d.pth --video 0

Camera can capture scene under different view angles, so for correct scene visualization, please pass camera extrinsics and focal length with --extrinsics and --fx options correspondingly (extrinsics sample format can be found in data folder). In case no camera parameters provided, demo will use the default ones.

Inference with TensorRT

To run with TensorRT, it is necessary to convert checkpoint to onnx format and then change to tensorrt engine. I converted it in models/ for nvidia Jetson TX2 and named it as human-pose-estimation-3d.trt.

import tensorrt as trt
TRT_LOGGER = trt.Logger(trt.Logger.VERBOSE)

with trt.Builder(TRT_LOGGER) as builder, builder.create_network() as network, trt.OnnxParser(network, TRT_LOGGER) as parser:
    builder.max_workspace_size = 1 << 30 
    builder.max_batch_size = 1 
    builder.fp16_mode = True 
    with open('human-pose-estimation-3d.onnx', 'rb') as model: 
        parser.parse(model.read()) 
    engine = builder.build_cuda_engine(network) 
    with open('human-pose-estimation-3d.trt', "wb") as f: 
        f.write(engine.serialize())

To run the demo with TensorRT inference, pass --use-tensorrt option and specify device to infer on:

python demo.py --model models/human-pose-estimation-3d.trt --device GPU --use-tensorrt --video 0

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
models		models
modules		modules
pose_extractor		pose_extractor
scripts		scripts
.gitignore		.gitignore
CONTRIBUTORS.md		CONTRIBUTORS.md
LICENSE		LICENSE
README.md		README.md
common.py		common.py
convert_to_onnx.py		convert_to_onnx.py
convert_to_tensorrt.py		convert_to_tensorrt.py
demo.py		demo.py
direct_to_trt.py		direct_to_trt.py
parser_samples.py		parser_samples.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-time 3D Multi-person Pose Estimation Demo on jetson TX2

Table of Contents

Requirements

Prerequisites

Pre-trained model

Running

Inference with TensorRT

About

Releases

Packages

Contributors 2

Languages

License

TsingWei/Real-time-3D-Multi-person-Pose-Estimation-Demo-on-jetson-TX2

Folders and files

Latest commit

History

Repository files navigation

Real-time 3D Multi-person Pose Estimation Demo on jetson TX2

Table of Contents

Requirements

Prerequisites

Pre-trained model

Running

Inference with TensorRT

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages