I'm very confused and interested in an explanation as well
I just setup whisper on my external GPU server to run the medium model with 0.5s of processing time but the built-in intends are somewhat lacking.
Everything Home Assistant. Questions, projects, news, you name it.
I'm very confused and interested in an explanation as well
I just setup whisper on my external GPU server to run the medium model with 0.5s of processing time but the built-in intends are somewhat lacking.
What's involveditn running whisper on a computer other than the home assistant computer? I'm guessing its relatively easyyto install, hopefully in docker. How do you tell HA to use that whisper?
Also, its a bit surprising that moving the voice recognition to a GPU on a more powerful (presumably) computer doesn't improve HA performance.
First of all: It increases performance tremendously. For comparison
So running it on a cheap 100€ used GPU can get you results where Alexa, Siri and Google have to respect you in terms of accuracy and speed. This is a gamechanger for me. I already installed 3 M5Stack ATOM ECHOs in my Home and more will soon come in. It's incredibly accurate and quick.
The important part is to pick the correct docker image. The default one that's available at rhasspy doesn't have GPU support.
Now, to get it running it's actually pretty easy. First go to this link and download all the files. You have to build a custom docker image with those files. I have no idea how to do that with barebones docker as i am using portainer for everything. In Portainer you have to do:
Next you go
That will spin up a docker-compose with the local custom image you just built, running faster-whisper that is compatible with the wyoming protocol in home assistant and that can run on an NVidia GPU with cuda acceleration.
As you can see in the docker-compose it will expose port 10300. Next:
It will automagically know that it's whisper and will be fully integrated into your system. You can now add it into your voice assistant.
If you look at the logs of your new docker container you can see every voice command that is sent to your new whisper.
I finally got around to trying this. It's super easy and significantly improved response time. I will add that the last step is to configure the Voice Assistant you're using in Home Assistant to use the new entity you just added as the "Speech to Text" engine.
Thanks, @RandomLegend@lemmy.dbzer0.com!
Ah yes, that final step i forgot.
Awesome that it works for you!