You won't believe how fast it is | Raspberry Pi Speech-to-Text

Faster than real-time offline speech transcription on Raspberry Pi - or any other computing system, including Orange Pi, Jetson Nano and many other Linux SBCs. A quick hands-on guide from installing necessary packages to running Whisper model with whisper.cpp or faster-whisper.
Whisper.cpp Python bindings repository:
github.com/AIWintermuteAI/whi...
faster-whisper:
github.com/SYSTRAN/faster-whi...
Benchmark gist:
gist.github.com/AIWintermuteA...

Пікірлер: 85

  • @Hardwareai
    @Hardwareai

    The follow-up video is also live on KZread - find it in my channel.

  • @brianmeyer107
    @brianmeyer107

    love this video! i rarely find myself pausing, and rewinding but here the details were coming fast enough that i became the weak link. love this.

  • @C0ldSpace
    @C0ldSpace

    I need this because im building a translator for my sister. There’s a new person in her class that can only speak Spanish, so im making this.

  • @newtownsmells
    @newtownsmells

    Hey this is incredible. really appreciate your work

  • @TomanswerAi
    @TomanswerAi

    Very cool guide. Thank you.

  • @exploring-electronic
    @exploring-electronic

    Thanks for the work done fixing the whisper.cpp python bindings! I'll check them out.

  • @tribelessa
    @tribelessa

    Hello! Great work, will try test it. Your projects are interesting (for me since Kendryte K210).

  • @emanuelepapa3548
    @emanuelepapa3548

    I’m using your repository. Thanks you

  • @markantinozzi4970
    @markantinozzi497021 сағат бұрын

    I'm going to try to install it.

  • @levbereggelezo
    @levbereggelezo

    Well done! Was whisper.cpp compiled with BLAS optimizations?

  • @antoniorodriguez-ynyestosa5907
    @antoniorodriguez-ynyestosa5907

    Hi! This is amazing! Thank you very much! Just a quick question, should it work on Windows? Because I get an error when I run "python -m build -w":

  • @newtownsmells
    @newtownsmells

    Would you consider showing how to implement live real time streaming with faster-whisper? Seems like that would be a huge way forward

  • @user-nf2pe4kr3n
    @user-nf2pe4kr3n

    Can the program be modified so that all recognized texts are consolidated into a single paragraph upon exiting the program?

  • @ptsckts6123
    @ptsckts6123

    hello, same benchmark results in 5925.774ms computation time on my RPI 5 currently, should I do anything differently? the audio file i've used is 10 secs, same JFK speech

  • @phillipreay
    @phillipreay

    How hard would it be to add a continuous background search process taking keywords from the conversation? I wanna have a screen in my office that's supporting the dialogue with more right brain material. Of course, they need to interrupt and follow the sauce for resource would be important.

  • @bens4446
    @bens4446

    I had heard about faster whisper on other channels but thought it couldn't work on an SBC because it uses GPU which an SBC doesn't have. I have no idea how you did this. Thanks!

  • @abdullahdogan5822
    @abdullahdogan5822

    hi,

  • @yashvishah9315
    @yashvishah9315

    Can i use INMP441 Microphone Module I2S instead of

  • @user-cl2og
    @user-cl2og

    I downloaded this on the Raspberry Pi 4, bookworm 64 bit and I got the following error:

  • @danilovaz9839
    @danilovaz9839

    oh man, please teach me the ways. Like, for real. I saw you provide 1:1 consultancy, but I need to know if your price is per meeting of for a full project.