You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
When using a hipblas build, I'm always frustrated that whisper.cpp is using my CPU instead of my GPU. Unfortunately, whisper.cpp does not support ROCm or hipblas.
Describe the solution you'd like
However, my AMD GPU does not only support ROCm/hipblas, but also Vulkan. Just like whisper.cpp.
Describe alternatives you've considered
At the moment, I'm using a full Vulkan build to be compatible with whisper.cpp. However, I'd prefer to use ROCm when available, else fallback to Vulkan.
The text was updated successfully, but these errors were encountered:
Technically, whisper.cpp supports ROCm. I recently changed my hardware, but before that I was able to use it directly (without a wrapper like LocalAI) with an RX 6900XT on NixOS.
I remember it didn't work right away; I was also using a Vulkan build for some time. Meanwhile, the maintainers have been making some changes, and after that, whisper.cpp started using ROCm as intended, which was evident both in the logs and in the speed.
While searching for the cause and solution, I stumbled across a change in a build variable from LLAMA_HIPBLAS to GGML_HIPBLAS, perhaps that is the problem here as well.
At least I could only find one commit with LLAMA_HIPBLAS in this repository, but none with GGML_HIPBLAS.
Is your feature request related to a problem? Please describe.
When using a hipblas build, I'm always frustrated that whisper.cpp is using my CPU instead of my GPU. Unfortunately, whisper.cpp does not support ROCm or hipblas.
Describe the solution you'd like
However, my AMD GPU does not only support ROCm/hipblas, but also Vulkan. Just like whisper.cpp.
Describe alternatives you've considered
At the moment, I'm using a full Vulkan build to be compatible with whisper.cpp. However, I'd prefer to use ROCm when available, else fallback to Vulkan.
The text was updated successfully, but these errors were encountered: