You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Records device as the device on which the active host thread executes the device code.
If the host thread has already initialized the CUDA runtime by calling non-device management
runtime functions or if there exists a CUDA driver context active on the host thread, then this
call returns cudaErrorSetOnActiveProcess
Note that we may need to restart Server to apply this change
Update: Records device as the device on which the active host thread executes the device code. Seems like we can not apply the active device to all threads.
Question: Should we use CUDA_VISIBLE_DEVICE?
Answer: This is the best approach that I know. We will support TensorRT-LLM and Onnx(?), so it will reduce the complexity because we don't need to change the logic.
For AMD, I think we have ROCM_VISIBLE_DEVICE. Any environment variable can be useful?
The text was updated successfully, but these errors were encountered:
v1/hardware/activate
For Nvidia card we can use below API

Note that we may need to restart Server to apply this change
Update:
Records device as the device on which the active host thread executes the device code
. Seems like we can not apply the active device to all threads.Question: Should we use
CUDA_VISIBLE_DEVICE
?Answer: This is the best approach that I know. We will support
TensorRT-LLM
andOnnx
(?), so it will reduce the complexity because we don't need to change the logic.For AMD, I think we have
ROCM_VISIBLE_DEVICE
. Any environment variable can be useful?The text was updated successfully, but these errors were encountered: