OpenAI’s Whisper is AMAZING!

Ғылым және технология

I ran OpenAI’s Whisper model in a notebook and used it to transcribe and translate my voice.
Link to the notebook: colab.research.google.com/dri...
🔔 Subscribe for more stories: www.youtube.com/@underfitted?...
📚 My 3 favorite Machine Learning books:
• Deep Learning With Python, Second Edition - amzn.to/3xA3bVI
• Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow - amzn.to/3BOX3LP
• Machine Learning with PyTorch and Scikit-Learn - amzn.to/3f7dAC8
Twitter: / svpino
Disclaimer: Some of the links included in this description are affiliate links where I'll earn a small commission if you purchase something. There's no cost to you.

Пікірлер: 42

@Param3021 Жыл бұрын
Model is looking great! Thank you for sharing :D
@underfitted
Жыл бұрын
Sure thing!
@PritishMishra Жыл бұрын
The quality of your videos is GREAT! your channel will blow up soon! Edit: May I ask which font you have used at 5:10?
@underfitted
Жыл бұрын
Thank you very much! That’s my handwriting :) I did it on an iPad.
@cygmoid Жыл бұрын
That's really amazing. Nice work Santiago
@underfitted
Жыл бұрын
Thanks!
Жыл бұрын
Thank you for the instructions and the colab notebook. Surprisingly, it works for Turkish as well.
@underfitted
Жыл бұрын
Awesome!
@joaodiogocosta Жыл бұрын
Please keep sharing!
@underfitted
Жыл бұрын
Will do!
@dakshbhatnagar Жыл бұрын
Daaayuuumm, that was quick and easy!
@underfitted
Жыл бұрын
Yessir
@canozturk369 Жыл бұрын
great work, great video. thanks
@underfitted
Жыл бұрын
Thanks!
@jayperalta7104 Жыл бұрын
this video is AWESOME. I do have a question though, How and where do I upload my prerecorded audio file? Thanks,
@underfitted
Жыл бұрын
You can upload it to Colab. There's a file view on the sidebar. You can drag and drop files there to upload them. Keep in mind they will be deleted when the Runtime is restarted.
@jamges Жыл бұрын
Can I use the whisper api to transcribe sound errors? I’m a speech therapist so I want to see where the user makes an error “wabbit” instead of “rabbit” without whisper autocorrecting
@SergioEspallargas Жыл бұрын
Love it!
@underfitted
Жыл бұрын
Thanks :)
@ricosrealm Жыл бұрын
Hey Santiago, this video and your Diffusion video have a stereo audio imbalance between left and right. Realized it when listening with headphones.
@underfitted
Жыл бұрын
Yeah. Pretty crappy audio. I think I fixed it in my last few videos.
@ashwinshetgaonkar6329 Жыл бұрын
thanks for sharing
@underfitted
Жыл бұрын
Absolutely!
@ThomasConover Жыл бұрын
I have tried this AI now a few hours and its incredibly impressive so far. It is super slow without a GPU tho.
@underfitted
Жыл бұрын
Yeah to both of those comments. You need a GPU is you want to do something useful with it.
@MyHowHowHow
Жыл бұрын
You can get great performance on CPU if you use the fork called WhisperCpp.
@wimdenherder Жыл бұрын
Do you use this for your own subtitles too?
@underfitted
Жыл бұрын
I don’t. Yet
@sigib911 Жыл бұрын
Amazing, so how can I used in realtime? that would be even great
@avnishmishra2770 Жыл бұрын
Man, thanks for this Litt video❤
@underfitted
Жыл бұрын
Definitely!
@mgreek31 Жыл бұрын
Good 👍
@abandonedmuse Жыл бұрын
OMG ARE YOU CUBAN TOO? I live in miami too! Yeah! Oye los Cubanos estamos en todo!
@underfitted
Жыл бұрын
Ha ha! Verdad que si!
@RobinGlauser Жыл бұрын
Is there a way to fine tune it? It would be awesome to be able to fine tune the german part to also understand swiss german :) Chuchichästli
@underfitted
Жыл бұрын
Good question! I think there is, especially because they open sourced the model.
@aleksandrszagorskis3306 Жыл бұрын
Playback speed 1.5
@underfitted
Жыл бұрын
Well, that will certainly go by quick! :)
@amphem Жыл бұрын
do French know about it?
@underfitted
Жыл бұрын
It does know French, yes.
@nickkatsaras9721 Жыл бұрын
Nice video, but putting your hand on the camera is a bad habit.. maybe consider not doing that anymore, it's really annoying. Otherwise, great.
@underfitted
Жыл бұрын
Do you mean covering the lens during scene transitions? I never heard this was annoying before. Do you mind elaborating a bit? And thanks very much for the feedback! It helps a ton to make better videos!