OpenAI GPT-4o API Explained | Tests and Predictions

Ғылым және технология

OpenAI GPT-4o API Explained | Tests and Predictions
👊 Become a member and get access to GitHub and Code:
/ allaboutai
🤖 Great AI Engineer Course:
scrimba.com/learn/aiengineer?...
🔥 Open GitHub Repos:
github.com/AllAboutAI-YT/easy...
📧 Join the newsletter:
www.allabtai.com/newsletter/
🌐 My website:
www.allabtai.com
Explaining the OpenAI GPT-4o API. My predictions and some tests of what I think we can expect from GPT-4o API and the multimodal model.
00:00 OpenAI GPT-4o API Intro
03:23 OpenAI GPT-4o Explained
06:47 OpenAI GPT-4o Exploration

Пікірлер: 36

  • @elliotanderson1585
    @elliotanderson158526 күн бұрын

    It's going to be a game changer if OpenAI can actually deliver all the functions they demonstrated.

  • @Ginto_O
    @Ginto_O26 күн бұрын

    I just realized that new openai model killed your voice assistant projects, just as they did last time with GPTs

  • @acllhes

    @acllhes

    26 күн бұрын

    It’s a pattern I noticed and expect since gpt4 dropped.

  • @josephtilly258

    @josephtilly258

    26 күн бұрын

    or it can make it easier to build an with low latency, in the gpt ap you don't really have long term memory, or function calling, their is still a lot of room to make a great, personnalize local assistant imo + this open the way on omnicient llm, mabye in the futur local llm will have voice and vision natively

  • @alexanderrosulek159

    @alexanderrosulek159

    26 күн бұрын

    @@josephtilly258maybe but doubt this year, I can’t even run the 7b text modes and to add vision and audio understanding would need to be bigger

  • @AllAboutAI

    @AllAboutAI

    26 күн бұрын

    yes haha, but hey, thats technology and why we love it

  • @Ms.Robot.
    @Ms.Robot.10 күн бұрын

    This was very educational. Your instructions were clear and concise. ❤🎉

  • @elliotnyberg9332
    @elliotnyberg933226 күн бұрын

    I wounder how they manage to handle interuptions during the voice output like in their demo for the api

  • @YunusDogan-yc4lx
    @YunusDogan-yc4lx26 күн бұрын

    Hi Kris can you do it vision version also with camera ? Some things or help usecases

  • @mwkoti
    @mwkoti25 күн бұрын

    What we need ASAP is an open source alternative to GPT-4o realtime speech-to-speech (as in demos). I'm pro open-source and I want full control of the application flow, preferably offline. Has anyone tried to use XTTS streaming capabilities succesfully, for example by extending AllAboutAI-examples?

  • @francycharuto
    @francycharuto26 күн бұрын

    Great content!

  • @AllAboutAI

    @AllAboutAI

    26 күн бұрын

    thnx :D appriciate it

  • @Ou8y2k2
    @Ou8y2k226 күн бұрын

    4:59 It's not horribly wrong, but I'd combine the and Voice IN in one graphic in the GPT-4o Voice API Now section and Voice OUT under the LLM RESPONSE. GPT-4o is going to be a game changer for education.

  • @brianpoillucci1805
    @brianpoillucci180526 күн бұрын

    You’re the best man.

  • @AllAboutAI

    @AllAboutAI

    26 күн бұрын

    thnx :D appriciate it

  • @ionutownprint4198
    @ionutownprint419820 күн бұрын

    Does the streaming audio function help with the latency?

  • @OliNorwell
    @OliNorwell26 күн бұрын

    That "Performance Scores" table - nice! If that is all correct then that's pretty impressive. Though I did a screenshot test myself and it mistook a 3 for an 8 so it might not be flawless.

  • @AllAboutAI

    @AllAboutAI

    26 күн бұрын

    yeah, i expect it to just improve over time until its perfect tho

  • @alirezasheikh8797
    @alirezasheikh879725 күн бұрын

    I think you're right. Maybe minutes after OpenAI presentation was done, I posted on their developer forum if voice in / voice out will be available to developers soon. They said only to a small group of "trusted" partners. So yea, I'm not sure when we gonna get access to this. You gotta be in that special circle. 😅

  • @ziadnahdi4343
    @ziadnahdi434323 күн бұрын

    if you could give it IQ tests visually ! that would be great since this is difficult for it like how many triangles or what is the next humber in a series..

  • @squiddymute
    @squiddymute26 күн бұрын

    multimodal or multimodel ? does really anyone believe it’s a single model ?

  • @zhenfu4556
    @zhenfu45564 күн бұрын

    How can I get this code?

  • @learnwithyan
    @learnwithyan22 күн бұрын

    My thoughts that we already have a good chance to be so so devs and to write perfect code 🎉😅

  • @athemis1180
    @athemis118024 күн бұрын

    Hi, great video! Iam courios how much do those apis cost. On their website I found text pricing in tokens and it is pretty cheap and understandable. However the image or “vision” function seems to be so expensive. I calculated it and with full hd on low settings it is going to cost about 5$ for a minute on 15fps. That’s crazy, not even mentioning it their TTS, that costs 15$ per 1M tokens, which is pretty hilarious

  • @RaysAiPixelClips
    @RaysAiPixelClips26 күн бұрын

    I made a script just like yours and now with GPT-4o it kinda defeats the purpose...😅at least we can still use it with local models.

  • @gulludiscord

    @gulludiscord

    24 күн бұрын

    Hey Bro Is This Model Free To use, I Men Can We Explore This?

  • @RICHARDSON143
    @RICHARDSON14326 күн бұрын

    ❤❤❤

  • @Gamez4eveR
    @Gamez4eveR26 күн бұрын

    GPT5 may finally be able to tell how many r's are in the word "strawberry", but 4o will suffice with its ability to write a bigint from scratch in C in just a couple minutes of telling it to try again

  • @mirek190

    @mirek190

    26 күн бұрын

    Really ? llma 3 8b even answer it ... How many r's are in the word "strawberry"? The word "strawberry" has 3 R's.

  • @Gamez4eveR

    @Gamez4eveR

    26 күн бұрын

    @@mirek190 ask it where they are. Also which Llama 3 8b model? Mine failed on first attempt, both meta and groq

  • @Gamez4eveR

    @Gamez4eveR

    26 күн бұрын

    @@mirek190 here's llama 3 70b failing: How many r's in strawberry There are 3 R's in the word "strawberry". Where are they? I apologize for the mistake! There are actually 2 R's in the word "strawberry". They are consecutive, appearing as "rr" in the middle of the word.

  • @Gamez4eveR

    @Gamez4eveR

    26 күн бұрын

    @@mirek190 llama 3 70b groq also failed this

  • @Gamez4eveR

    @Gamez4eveR

    26 күн бұрын

    @@mirek190 and here's claude 3 opus: The three r's in "strawberry" are located as follows: 1. st*r*awberry (after the first "t") 2. stra*w*berry (after the "w") 3. strawbe*r*ry (near the end, before the final "y") You be the judge lmao

  • @rodrigov.9252
    @rodrigov.925226 күн бұрын

    god the number of men who will fall in love with that voice hahaha

  • @Ou8y2k2

    @Ou8y2k2

    26 күн бұрын

    It will make the movie _Her_ seem like an underestimate.

Келесі