OpenAI’s Whisper is AMAZING!

Ғылым және технология

I ran OpenAI’s Whisper model in a notebook and used it to transcribe and translate my voice.
Link to the notebook: colab.research.google.com/dri...
🔔 Subscribe for more stories: www.youtube.com/@underfitted?...
📚 My 3 favorite Machine Learning books:
• Deep Learning With Python, Second Edition - amzn.to/3xA3bVI
• Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow - amzn.to/3BOX3LP
• Machine Learning with PyTorch and Scikit-Learn - amzn.to/3f7dAC8
Twitter: / svpino
Disclaimer: Some of the links included in this description are affiliate links where I'll earn a small commission if you purchase something. There's no cost to you.

Пікірлер: 42

  • @Param3021
    @Param3021 Жыл бұрын

    Model is looking great! Thank you for sharing :D

  • @underfitted

    @underfitted

    Жыл бұрын

    Sure thing!

  • @PritishMishra
    @PritishMishra Жыл бұрын

    The quality of your videos is GREAT! your channel will blow up soon! Edit: May I ask which font you have used at 5:10?

  • @underfitted

    @underfitted

    Жыл бұрын

    Thank you very much! That’s my handwriting :) I did it on an iPad.

  • @cygmoid
    @cygmoid Жыл бұрын

    That's really amazing. Nice work Santiago

  • @underfitted

    @underfitted

    Жыл бұрын

    Thanks!

  • Жыл бұрын

    Thank you for the instructions and the colab notebook. Surprisingly, it works for Turkish as well.

  • @underfitted

    @underfitted

    Жыл бұрын

    Awesome!

  • @joaodiogocosta
    @joaodiogocosta Жыл бұрын

    Please keep sharing!

  • @underfitted

    @underfitted

    Жыл бұрын

    Will do!

  • @dakshbhatnagar
    @dakshbhatnagar Жыл бұрын

    Daaayuuumm, that was quick and easy!

  • @underfitted

    @underfitted

    Жыл бұрын

    Yessir

  • @canozturk369
    @canozturk369 Жыл бұрын

    great work, great video. thanks

  • @underfitted

    @underfitted

    Жыл бұрын

    Thanks!

  • @jayperalta7104
    @jayperalta7104 Жыл бұрын

    this video is AWESOME. I do have a question though, How and where do I upload my prerecorded audio file? Thanks,

  • @underfitted

    @underfitted

    Жыл бұрын

    You can upload it to Colab. There's a file view on the sidebar. You can drag and drop files there to upload them. Keep in mind they will be deleted when the Runtime is restarted.

  • @jamges
    @jamges Жыл бұрын

    Can I use the whisper api to transcribe sound errors? I’m a speech therapist so I want to see where the user makes an error “wabbit” instead of “rabbit” without whisper autocorrecting

  • @SergioEspallargas
    @SergioEspallargas Жыл бұрын

    Love it!

  • @underfitted

    @underfitted

    Жыл бұрын

    Thanks :)

  • @ricosrealm
    @ricosrealm Жыл бұрын

    Hey Santiago, this video and your Diffusion video have a stereo audio imbalance between left and right. Realized it when listening with headphones.

  • @underfitted

    @underfitted

    Жыл бұрын

    Yeah. Pretty crappy audio. I think I fixed it in my last few videos.

  • @ashwinshetgaonkar6329
    @ashwinshetgaonkar6329 Жыл бұрын

    thanks for sharing

  • @underfitted

    @underfitted

    Жыл бұрын

    Absolutely!

  • @ThomasConover
    @ThomasConover Жыл бұрын

    I have tried this AI now a few hours and its incredibly impressive so far. It is super slow without a GPU tho.

  • @underfitted

    @underfitted

    Жыл бұрын

    Yeah to both of those comments. You need a GPU is you want to do something useful with it.

  • @MyHowHowHow

    @MyHowHowHow

    Жыл бұрын

    You can get great performance on CPU if you use the fork called WhisperCpp.

  • @wimdenherder
    @wimdenherder Жыл бұрын

    Do you use this for your own subtitles too?

  • @underfitted

    @underfitted

    Жыл бұрын

    I don’t. Yet

  • @sigib911
    @sigib911 Жыл бұрын

    Amazing, so how can I used in realtime? that would be even great

  • @avnishmishra2770
    @avnishmishra2770 Жыл бұрын

    Man, thanks for this Litt video❤

  • @underfitted

    @underfitted

    Жыл бұрын

    Definitely!

  • @mgreek31
    @mgreek31 Жыл бұрын

    Good 👍

  • @abandonedmuse
    @abandonedmuse Жыл бұрын

    OMG ARE YOU CUBAN TOO? I live in miami too! Yeah! Oye los Cubanos estamos en todo!

  • @underfitted

    @underfitted

    Жыл бұрын

    Ha ha! Verdad que si!

  • @RobinGlauser
    @RobinGlauser Жыл бұрын

    Is there a way to fine tune it? It would be awesome to be able to fine tune the german part to also understand swiss german :) Chuchichästli

  • @underfitted

    @underfitted

    Жыл бұрын

    Good question! I think there is, especially because they open sourced the model.

  • @aleksandrszagorskis3306
    @aleksandrszagorskis3306 Жыл бұрын

    Playback speed 1.5

  • @underfitted

    @underfitted

    Жыл бұрын

    Well, that will certainly go by quick! :)

  • @amphem
    @amphem Жыл бұрын

    do French know about it?

  • @underfitted

    @underfitted

    Жыл бұрын

    It does know French, yes.

  • @nickkatsaras9721
    @nickkatsaras9721 Жыл бұрын

    Nice video, but putting your hand on the camera is a bad habit.. maybe consider not doing that anymore, it's really annoying. Otherwise, great.

  • @underfitted

    @underfitted

    Жыл бұрын

    Do you mean covering the lens during scene transitions? I never heard this was annoying before. Do you mind elaborating a bit? And thanks very much for the feedback! It helps a ton to make better videos!

Келесі