GitHub
triton-inference-server/tensorrtllm_backend
call_split 122 forks
star 844 stars
Description
The Triton TensorRT-LLM Backend
Project metadata as of .
The Triton TensorRT-LLM Backend
Project metadata as of .
This site uses cookies from Google to deliver its services and to analyze traffic.