GPT-4o - LMM (Audio, Vision & Text) by OpenAI | Faster, Cheaper & Smarter than GPT-4 Turbo

Meet GPT-4o (omni), OpenAI's advanced Large Multimodal Model (LMM). This powerful AI can take in text, audio, and images, and generate text, audio, and images in response. It performs just as well as GPT-4 Turbo when handling text in English and code, and it's even better with non-English languages. Plus, it's much faster and costs 50% less to use through the API.
Blog Post: openai.com/index/hello-gpt-4o/
Follow me on X: / venelin_valkov
AI Bootcamp: www.mlexpert.io/bootcamp
Discord: / discord
Subscribe: bit.ly/venelin-subscribe
GitHub repository: github.com/curiousily/AI-Boot...
00:00 - What is GPT-4o?
03:23 - Benchmarks
05:25 - New tokenizer(s)
06:18 - Availability (+ API)
07:28 - Text Generation Evaluation
09:08 - Document Understanding Evaluation
11:55 - Conclusion
Join this channel to get access to the perks and support my work:
/ @venelin_valkov
#chatgpt #gpt4 #llm #chatbot #artificialintelligence #llama

Пікірлер: 1

  • @nedyalkovs
    @nedyalkovs14 күн бұрын

    Hi Venelin it would interesting to see the voice and audio testing similar to OpenAI Demo