OpenAI’s Whisper is AMAZING!
Ғылым және технология
I ran OpenAI’s Whisper model in a notebook and used it to transcribe and translate my voice.
Link to the notebook: colab.research.google.com/dri...
🔔 Subscribe for more stories: www.youtube.com/@underfitted?...
📚 My 3 favorite Machine Learning books:
• Deep Learning With Python, Second Edition - amzn.to/3xA3bVI
• Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow - amzn.to/3BOX3LP
• Machine Learning with PyTorch and Scikit-Learn - amzn.to/3f7dAC8
Twitter: / svpino
Disclaimer: Some of the links included in this description are affiliate links where I'll earn a small commission if you purchase something. There's no cost to you.
Пікірлер: 42
Model is looking great! Thank you for sharing :D
@underfitted
Жыл бұрын
Sure thing!
The quality of your videos is GREAT! your channel will blow up soon! Edit: May I ask which font you have used at 5:10?
@underfitted
Жыл бұрын
Thank you very much! That’s my handwriting :) I did it on an iPad.
That's really amazing. Nice work Santiago
@underfitted
Жыл бұрын
Thanks!
Thank you for the instructions and the colab notebook. Surprisingly, it works for Turkish as well.
@underfitted
Жыл бұрын
Awesome!
Please keep sharing!
@underfitted
Жыл бұрын
Will do!
Daaayuuumm, that was quick and easy!
@underfitted
Жыл бұрын
Yessir
great work, great video. thanks
@underfitted
Жыл бұрын
Thanks!
this video is AWESOME. I do have a question though, How and where do I upload my prerecorded audio file? Thanks,
@underfitted
Жыл бұрын
You can upload it to Colab. There's a file view on the sidebar. You can drag and drop files there to upload them. Keep in mind they will be deleted when the Runtime is restarted.
Can I use the whisper api to transcribe sound errors? I’m a speech therapist so I want to see where the user makes an error “wabbit” instead of “rabbit” without whisper autocorrecting
Love it!
@underfitted
Жыл бұрын
Thanks :)
Hey Santiago, this video and your Diffusion video have a stereo audio imbalance between left and right. Realized it when listening with headphones.
@underfitted
Жыл бұрын
Yeah. Pretty crappy audio. I think I fixed it in my last few videos.
thanks for sharing
@underfitted
Жыл бұрын
Absolutely!
I have tried this AI now a few hours and its incredibly impressive so far. It is super slow without a GPU tho.
@underfitted
Жыл бұрын
Yeah to both of those comments. You need a GPU is you want to do something useful with it.
@MyHowHowHow
Жыл бұрын
You can get great performance on CPU if you use the fork called WhisperCpp.
Do you use this for your own subtitles too?
@underfitted
Жыл бұрын
I don’t. Yet
Amazing, so how can I used in realtime? that would be even great
Man, thanks for this Litt video❤
@underfitted
Жыл бұрын
Definitely!
Good 👍
OMG ARE YOU CUBAN TOO? I live in miami too! Yeah! Oye los Cubanos estamos en todo!
@underfitted
Жыл бұрын
Ha ha! Verdad que si!
Is there a way to fine tune it? It would be awesome to be able to fine tune the german part to also understand swiss german :) Chuchichästli
@underfitted
Жыл бұрын
Good question! I think there is, especially because they open sourced the model.
Playback speed 1.5
@underfitted
Жыл бұрын
Well, that will certainly go by quick! :)
do French know about it?
@underfitted
Жыл бұрын
It does know French, yes.
Nice video, but putting your hand on the camera is a bad habit.. maybe consider not doing that anymore, it's really annoying. Otherwise, great.
@underfitted
Жыл бұрын
Do you mean covering the lens during scene transitions? I never heard this was annoying before. Do you mind elaborating a bit? And thanks very much for the feedback! It helps a ton to make better videos!