请访问深度学习框架 (DLFW) 网站以获取完整的兼容性矩阵.

发布兼容性矩阵#

  1. 容器名称:trtllm-python-py3

  2. 容器名称:vllm-python-py3

  3. ONNX Runtime 版本

容器名称:trtllm-python-py3#

Triton 发行版本

NGC 标签

Python 版本

Torch 版本

TensorRT 版本

TensorRT-LLM 版本

CUDA 版本

CUDA 驱动版本

大小

25.01

nvcr.io/nvidia/tritonserver:25.01-trtllm-python-py3

Python 3.12.3

2.6.0a0%2Becf3bae40a.nv25.1

10.8.0.43

0.17.0

12.8.0.038

570.86.10

30G

24.12

nvcr.io/nvidia/tritonserver:24.12-trtllm-python-py3

Python 3.12.3

2.6.0a0%2Bdf5bbc09d1.nv24.11

10.7.0

0.16.0

12.6.3

560.35.05

22G

24.11

nvcr.io/nvidia/tritonserver:24.11-trtllm-python-py3

Python 3.10.12

2.5.0a0%2Be000cf0ad9.nv24.10

10.6.0

0.15.0

12.6.3

555.42.06

24.8G

24.10

nvcr.io/nvidia/tritonserver:24.10-trtllm-python-py3

Python 3.10.12

2.4.0a0%2B3bcc3cddb5.nv24.7

10.4.0

0.14.0

12.5.1.007

555.42.06

23.3G

24.09

nvcr.io/nvidia/tritonserver:24.09-trtllm-python-py3

Python 3.10.12

2.4.0a0%2B3bcc3cddb5.nv24.7

10.4.0

0.13.0

12.5.1.007

555.42.06

21G

24.08

nvcr.io/nvidia/tritonserver:24.08-trtllm-python-py3

Python 3.10.12

2.4.0a0%2B3bcc3cddb5.nv24.7

10.3.0

0.12.0

12.5.1.007

555.42.06

21G

24.07

nvcr.io/nvidia/tritonserver:24.07-trtllm-python-py3

Python 3.10.12

2.4.0a0%2B07cecf4168.nv24.5

10.1.0

0.11.0

12.4.1.003

550.54.15

23G

24.06

nvcr.io/nvidia/tritonserver:24.06-trtllm-python-py3

Python 3.10.12

2.3.0a0%2B40ec155e58.nv24.3

10.0.1

0.10.0

12.4.0.041

550.54.14

31G

24.05

nvcr.io/nvidia/tritonserver:24.05-trtllm-python-py3

Python 3.10.12

2.3.0a0%2Bebedce2

10.0.1.6

0.9.0

12.3.2.001

545.23.08

34G

24.04

nvcr.io/nvidia/tritonserver:24.04-trtllm-python-py3

Python 3.10.12

2.3.0a0%2Bebedce2

9.3.0.post12.dev1

0.9.0

12.3.2.001

545.23.08

34G

容器名称:vllm-python-py3#

Triton 发行版本

NGC 标签

Python 版本

vLLM 版本

CUDA 版本

CUDA 驱动版本

大小

25.01

nvcr.io/nvidia/tritonserver:25.01-vllm-python-py3

Python 3.12.3

0.6.3.post1

12.8.0.038

570.86.10

23G

24.12

nvcr.io/nvidia/tritonserver:24.12-vllm-python-py3

Python 3.12.3

0.5.5

12.6.3.004

560.35.05

20G

24.11

nvcr.io/nvidia/tritonserver:24.11-vllm-python-py3

Python 3.12.3

0.5.5

12.6.3.001

560.35.05

22.1G

24.10

nvcr.io/nvidia/tritonserver:24.10-vllm-python-py3

Python 3.10.12

0.5.5

12.6.2.004

560.35.03

21G

24.09

nvcr.io/nvidia/tritonserver:24.09-vllm-python-py3

Python 3.10.12

0.5.3.post1

12.6.1.006

560.35.03

19G

24.08

nvcr.io/nvidia/tritonserver:24.08-vllm-python-py3

Python 3.10.12

0.5.0 post1

12.6.0.022

560.35.03

19G

24.07

nvcr.io/nvidia/tritonserver:24.07-vllm-python-py3

Python 3.10.12

0.5.0 post1

12.5.1

555.42.06

19G

24.06

nvcr.io/nvidia/tritonserver:24.06-vllm-python-py3

Python 3.10.12

0.4.3

12.5.0.23

555.42.02

18G

24.05

nvcr.io/nvidia/tritonserver:24.05-vllm-python-py3

Python 3.10.12

0.4.0 post1

12.4.1

550.54.15

18G

24.04

nvcr.io/nvidia/tritonserver:24.04-vllm-python-py3

Python 3.10.12

0.4.0 post1

12.4.1

550.54.15

17G

ONNX Runtime 版本#

Triton 发行版本

ONNX Runtime

25.01

1.20.1

24.12

1.20.1

24.11

1.19.2

24.10

1.19.2

24.09

1.19.2

24.08

1.18.1

24.07

1.18.1

24.06

1.18.0

24.05

1.18.0

24.04

1.17.3