tensorrt-inference-server-with-torch-example

Sample of serving pytorch model with TensorRT Inference Server.

In this sample, TensorRT and ONNX are used as the model format.

NOTICE

TensorRT Inference Server was renamed Triton Inference Server in March 2020.

Version

pip

torch==1.7.1
onnx==1.6.0
onnxruntime==1.4.0
tensorrt==6.0.1.5
tensorrtserver==1.11.0

docker

nvcr.io/nvidia/tensorrtserver:19.10-py3

other

cuda:10.1
cudnn:7.5.0
onnx-tensorrt:6.0

cudnn 7.5.0 is important. I failed build onnx-tensorrt 6.0 with cudnn 7.6.4.

Usage

# Training Pytorch Model
python 01_train_model_with_torch.py
# Convert Pytorch model to ONNX model
python 02_pth_to_onnx.py
# Inference example using ONNX model on local
python 03_onnxruntime_local.py
# Convert ONNX model to TensorRT model 
./04_onnx_to_tensorrt.sh
# Inference example using TensorRT model on local
python 05_tensorrt_local.py
# Copy and rename models for using TensorRT Inference Server
./06_prepare_model.sh
# Run TensorRT Inference Server
./07_run_tensorrt_inference_server.sh
#  Inference example using ONNX and TensorRT model with TensorRT Inference Server.
python 08_tensorrt_inferense_server_client.py

onnx_model_cli

onnx_model_cli helps checking onnx model structure.

Inspired by tensorflow saved_model_cli.

Usage

python onnx_model_cli.py show --path foo.onnx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tensorrt-inference-server-with-torch-example

Version

pip

docker

other

Usage

onnx_model_cli

Usage

Great Links

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
models		models
outputs		outputs
tools		tools
01_train_model_with_torch.py		01_train_model_with_torch.py
02_pth_to_onnx.py		02_pth_to_onnx.py
03_onnxruntime_local.py		03_onnxruntime_local.py
04_onnx_to_tensorrt.sh		04_onnx_to_tensorrt.sh
05_tensorrt_local.py		05_tensorrt_local.py
06_prepare_model.sh		06_prepare_model.sh
07_run_tensorrt_inference_server.sh		07_run_tensorrt_inference_server.sh
08_tensorrt_inference_server_client.py		08_tensorrt_inference_server_client.py
onnx_model_cli.py		onnx_model_cli.py
readme.md		readme.md

rskmoi/tensorrt-inference-server-with-torch-example

Folders and files

Latest commit

History

Repository files navigation

tensorrt-inference-server-with-torch-example

Version

pip

docker

other

Usage

onnx_model_cli

Usage

Great Links

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages