You won't believe how fast it is | Raspberry Pi Speech-to-Text

Faster than real-time offline speech transcription on Raspberry Pi - or any other computing system, including Orange Pi, Jetson Nano and many other Linux SBCs. A quick hands-on guide from installing necessary packages to running Whisper model with whisper.cpp or faster-whisper.
Whisper.cpp Python bindings repository:
github.com/AIWintermuteAI/whi...
faster-whisper:
github.com/SYSTRAN/faster-whi...
Benchmark gist:
gist.github.com/AIWintermuteA...

Пікірлер: 85

@Hardwareai
The follow-up video is also live on KZread - find it in my channel.
@brianmeyer107
love this video! i rarely find myself pausing, and rewinding but here the details were coming fast enough that i became the weak link. love this.
@C0ldSpace
I need this because im building a translator for my sister. There’s a new person in her class that can only speak Spanish, so im making this.
@newtownsmells
Hey this is incredible. really appreciate your work
@TomanswerAi
Very cool guide. Thank you.
@exploring-electronic
Thanks for the work done fixing the whisper.cpp python bindings! I'll check them out.
@tribelessa
Hello! Great work, will try test it. Your projects are interesting (for me since Kendryte K210).
@emanuelepapa3548
I’m using your repository. Thanks you
@markantinozzi497021 сағат бұрын
I'm going to try to install it.
@levbereggelezo
Well done! Was whisper.cpp compiled with BLAS optimizations?
@antoniorodriguez-ynyestosa5907
Hi! This is amazing! Thank you very much! Just a quick question, should it work on Windows? Because I get an error when I run "python -m build -w":
@newtownsmells
Would you consider showing how to implement live real time streaming with faster-whisper? Seems like that would be a huge way forward
@user-nf2pe4kr3n
Can the program be modified so that all recognized texts are consolidated into a single paragraph upon exiting the program?
@ptsckts6123
hello, same benchmark results in 5925.774ms computation time on my RPI 5 currently, should I do anything differently? the audio file i've used is 10 secs, same JFK speech
@phillipreay
How hard would it be to add a continuous background search process taking keywords from the conversation? I wanna have a screen in my office that's supporting the dialogue with more right brain material. Of course, they need to interrupt and follow the sauce for resource would be important.
@bens4446
I had heard about faster whisper on other channels but thought it couldn't work on an SBC because it uses GPU which an SBC doesn't have. I have no idea how you did this. Thanks!
@abdullahdogan5822
hi,
@yashvishah9315
Can i use INMP441 Microphone Module I2S instead of
@user-cl2og
I downloaded this on the Raspberry Pi 4, bookworm 64 bit and I got the following error:
@danilovaz9839
oh man, please teach me the ways. Like, for real. I saw you provide 1:1 consultancy, but I need to know if your price is per meeting of for a full project.

You won't believe how fast it is | Raspberry Pi Speech-to-Text

Пікірлер: 85

Келесі