[Performance] how to set the threads when using TRT EP #22913
Labels
ep:TensorRT
issues related to TensorRT execution provider
performance
issues related to performance regressions
platform:jetson
issues related to the NVIDIA Jetson platform
Describe the issue
I notice multiple threads when using ONNXRUNTIME (TRT EP). Is this a normal behavior?
From the documentation it says:
I'm using TRT EP, although in
providers
I also includeCPUExecutionProvider
andCUDAExecutionProvider
. How can I set number of threads for TRT EP? Thanks.To reproduce
No code can be provided.
Urgency
No response
Platform
Other / Unknown
OS Version
JetPack=5.1.2
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
onnxruntime-gpu=1.17.0
ONNX Runtime API
Python
Architecture
ARM64
Execution Provider
TensorRT
Execution Provider Library Version
No response
Model File
No response
Is this a quantized model?
No
The text was updated successfully, but these errors were encountered: