TensorRT-LLM installation #6766

jdbornem · 2025-02-19T20:59:37Z

jdbornem
Feb 19, 2025

How do I actually use TensorRT-LLM to run a model?
I have the latest update to text-generation-webui, running on Windows 11.

I converted my own model on my GPU, which is now stored as a ".engine" file.
This shows up in my Model list, I choose the "TensorRT-LLM" model loader, and I get the following errors

Okay, I guess TensorRT isn't actually installed...
So I open the command line and the text-generation-webui environment by running "cmd_windows.bat"

I then follow the TensorRT-LLM installation instructions as linked to from the text-generation-webui, and for the windows installation, here: https://nvidia.github.io/TensorRT-LLM/installation/windows.html

Now it's a bit of a mess, because the text-generation-webui environment has Python 3.11 installed, but the TensorRT-LLM installation says that you must have Python 3.10. I actually also have 3.10 installed, but not in this environment.
I have used the full tensorRT package by just using a git clone of the Nvidia project (not the LLM scripts used here) in my windows system environment, that's how I built the LLM engine to begin with.

But I can't get this tensorrt_llm to install for text-generation-webui

In any case, inside of the "cmd_windows.bat" environment, running:
pip install tensorrt_llm==0.17.0.post1 --extra-index-url https://download.pytorch.org/whl/ --extra-index-url https://pypi.nvidia.com

Always results in the same errors. I have tried different versions: tensorrt-llm==0.16.0, tensorrt-llm==0.10.8

I don't use linux, nor github very much, and although I can program decently, the readme saying that "[TensorRT-LLM] is supported via its own [Dockerfile]" isn't really that helpful to me. I see that the Dockerfile is just a python script to install various libraries and packages. I did directly try the command from the docker script for version 10.0, but it gives this error.

Help?
Am I doing something stupid?
Thanks!

luguoyixiazi · 2025-08-10T15:23:32Z

luguoyixiazi
Aug 10, 2025

doc of tensorrt-llm is for old version, you can't follow it, I'm trying install tensorrt-llm 0.21.0 in windows, and here is the reason why I'm failed —— nvdia has gave up Windows in some:

  x No solution found when resolving dependencies:
  `-> Because tensorrt-llm==0.21.0 has no wheels with a matching platform tag (e.g., `win_amd64`) and you require
      tensorrt-llm==0.21.0, we can conclude that your requirements are unsatisfiable.

      hint: `tensorrt-llm` was found on https://pypi.nvidia.com/, but not at the requested version
      (tensorrt-llm==0.21.0). A compatible version may be available on a subsequent index (e.g.,
      https://download.pytorch.org/whl/cu121). By default, uv will only consider versions that are published on the
      first index that contains a given package, to avoid dependency confusion attacks. If all indexes are equally
      trusted, use `--index-strategy unsafe-best-match` to consider all versions from all indexes, regardless of the
      order in which they were defined.

      hint: Wheels are available for `tensorrt-llm` (v0.21.0) on the following platform: `linux_x86_64`

I'm trying to build whl, but most possible result is fail……I'll trun on docker

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TensorRT-LLM installation #6766

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

TensorRT-LLM installation #6766

Uh oh!

jdbornem Feb 19, 2025

Replies: 1 comment

Uh oh!

luguoyixiazi Aug 10, 2025

jdbornem
Feb 19, 2025

luguoyixiazi
Aug 10, 2025