v3.0.0 #19
HolyWu
announced in
Announcements
v3.0.0
#19
Replies: 1 comment
-
BenchmarkConfiguration: NVIDIA RTX 3050, driver 526.47, Windows 10 21H2, VS R60, Python 3.10.8, 1080p FP16, model 4.6 RIFE-ncnn-Vulkan
vsmlrt-ORT_CUDA
vsmlrt-TRT
vs-rife with PyTorch 1.13.0+cu117 + cuDNN 8.6.0 + TensorRT 8.5.1.7 + Torch-TensorRT 1.2.0
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
model
paramter to support v4.0~v4.6 models.ensemble
parameter to smooth predictions in areas where the estimation is uncertain.multi
parameter withfactor_num
,factor_den
,fps_num
andfps_den
for rational frame rate change.sc
andsc_threshold
parameters for scene change detection.cuda_graphs
parameter to use CUDA Graphs.fusion
parameter to enable fusion through nvFuser.device_type
parameter. No one bothers to run deep learning inference on CPU anyway.num_streams
parameter for parallel execution.fp16
parameter and now it's controlled by the format of the clip.RGBH
format uses FP16 mode andRGBS
format uses FP32 mode.trt
,trt_max_workspace_size
, andtrt_cache_path
parameters for TensorRT support.With the usage of TensorRT, it should run at least 40~50% faster than previous version or RIFE-ncnn-Vulkan implementation using FP16 mode on GPUs with Tensor Cores. For ease of installation on Windows, you can download the CUDA 7z file which contains required runtime libraries and Python wheel file. Either add the unzipped directory to your system
PATH
or copy the DLL files to a directory which already in your systemPATH
. Finallypip install
the Python wheel file.This discussion was created from the release v3.0.0.
Beta Was this translation helpful? Give feedback.
All reactions