Merlin Inference Support Matrix
This container enables you to deploy NVTabular workflows and HugeCTR or TensorFlow models to the Triton Inference Server for production.
22.xx Container Images
Container Release |
Release 22.04 |
Release 22.03 |
Release 22.02 |
Release 22.01 |
---|---|---|---|---|
DGX |
||||
DGX System |
|
|
|
|
Operating System |
Ubuntu 20.04.4 LTS |
Ubuntu 20.04.3 LTS |
Ubuntu 20.04.3 LTS |
Ubuntu 20.04.3 LTS |
NVIDIA Certified Systems |
||||
NVIDIA Driver |
NVIDIA Driver version 465.19.01 or later is required. However, if you’re running on Data Center GPUs (formerly Tesla) such as T4, you can use any of the following NVIDIA Driver versions:
Note: The CUDA Driver Compatibility Package does not support all drivers. |
NVIDIA Driver version 465.19.01 or later is required. However, if you’re running on Data Center GPUs (formerly Tesla) such as T4, you can use any of the following NVIDIA Driver versions:
Note: The CUDA Driver Compatibility Package does not support all drivers. |
NVIDIA Driver version 465.19.01 or later is required. However, if you’re running on Data Center GPUs (formerly Tesla) such as T4, you can use any of the following NVIDIA Driver versions:
Note: The CUDA Driver Compatibility Package does not support all drivers. |
NVIDIA Driver version 465.19.01 or later is required. However, if you’re running on Data Center GPUs (formerly Tesla) such as T4, you can use any of the following NVIDIA Driver versions:
Note: The CUDA Driver Compatibility Package does not support all drivers. |
GPU Model |
||||
Base Container Image |
||||
Container Operating System |
Ubuntu 20.04.4 LTS |
Ubuntu 20.04.3 LTS |
Ubuntu 20.04.3 LTS |
Ubuntu 20.04.3 LTS |
Base Container |
Triton version 22.03 |
Triton version 22.02 |
Triton version 22.01 |
Triton version 21.12 |
CUDA |
11.6.1.005 |
11.6.0.021 |
11.6.0.020 |
11.5.0.029 |
RMM |
21.12.0 |
0+unknown |
21.12.00 |
21.10.00a+42.gae27a57 |
cuDF |
22.2.0 |
21.12.02 |
21.12.02 |
21.10.00a+345.ge05bd4bf3c.dirty |
cuDNN |
8.3.3.40+cuda11.5 |
8.3.2.44+cuda11.5 |
8.3.2.44+cuda11.5 |
8.3.1.22 |
Merlin Core |
0.2.0 |
v0.1.1+12.g53b1ffc |
Not applicable |
Not applicable |
Merlin Models |
0.3.0 |
Not applicable |
Not applicable |
Not applicable |
Merlin Systems |
0.1.0 |
0+untagged.9.gccd0e15 |
Not applicable |
Not applicable |
NVTabular |
1.0.0 |
0.11.0+20.gafa0e43 |
0.10.0 |
0.9.0+1.g31f9350 |
Transformers4Rec |
0.1.7 |
0.1.6+4.ga153d6d |
0.1.4 |
0.1.4 |
HugeCTR |
Not applicable |
Not applicable |
Not applicable |
Not applicable |
SM |
Not applicable |
Not applicable |
Not applicable |
Not applicable |
Triton Inference Server |
2.20.0 |
2.19.0 |
2.18.0 |
2.17.0 |
Size |
9.45 GB |
9.19 GB |
20.62 GB |
25.93 GB |
21.xx Container Images
Container Release |
Release 21.12 |
Release 21.11 |
Release 21.09 |
---|---|---|---|
DGX |
|||
DGX System |
|
|
|
Operating System |
Ubuntu 20.04.3 LTS |
Ubuntu 20.04.3 LTS |
Ubuntu 20.04.3 LTS |
NVIDIA Certified Systems |
|||
NVIDIA Driver |
NVIDIA Driver version 465.19.01 or later is required. However, if you’re running on Data Center GPUs (formerly Tesla) such as T4, you can use any of the following NVIDIA Driver versions:
Note: The CUDA Driver Compatibility Package does not support all drivers. |
NVIDIA Driver version 465.19.01 or later is required. However, if you’re running on Data Center GPUs (formerly Tesla) such as T4, you can use any of the following NVIDIA Driver versions:
Note: The CUDA Driver Compatibility Package does not support all drivers. |
NVIDIA Driver version 465.19.01 or later is required. However, if you’re running on Data Center GPUs (formerly Tesla) such as T4, you can use any of the following NVIDIA Driver versions:
Note: The CUDA Driver Compatibility Package does not support all drivers. |
GPU Model |
|||
Base Container Image |
|||
Container Operating System |
Ubuntu 20.04.3 LTS |
Ubuntu 20.04.3 LTS |
Ubuntu 20.04.3 LTS |
Base Container |
Triton version 21.10 |
Triton version 21.10 |
Triton version 21.07 |
CUDA |
11.5.0.029 |
11.4.3.001 |
11.4.0.031 |
RMM |
21.08.02 |
21.08.02 |
0+unknown |
cuDF |
21.08.03 |
21.08.03 |
21.08.02 |
cuDNN |
8.3.0.96 |
8.2.4.15 |
8.2.2.26 |
Merlin Core |
Not applicable |
Not applicable |
Not applicable |
Merlin Models |
Not applicable |
Not applicable |
Not applicable |
Merlin Systems |
Not applicable |
Not applicable |
Not applicable |
NVTabular |
0.8.0 |
0.7.1 |
0.7.0 |
Transformers4Rec |
0.1.3 |
0.1.2 |
0.1.1 |
HugeCTR |
Not applicable |
Not applicable |
Not applicable |
SM |
Not applicable |
Not applicable |
Not applicable |
Triton Inference Server |
2.16.0 |
2.15.0 |
2.12.0 |
Size |
27.26 GB |
24.61 GB |
28.72 GB |