Claude Opus vs. GPT4 - A Practical Review (with code examples)

A lot of data professionals want to explore freelancing, but lack the systems, tools, and guidance on how to get started. If you're curious about how we help data analysts, engineers, and scientists, beyond these videos, the link below will explain. No nonsense, no webinar and no "free training". Check it out: www.datalumina.com/data-freelancer
🔗 Link to repo
github.com/daveebbelaar/ai-experiments/tree/main
👋🏻 About Me
Hey there! I'm Dave, an AI Engineer and the founder of Datalumina, where our mission is to facilitate entrepreneurial and technological proficiency in professionals and businesses. Through my videos here on this channel, my posts on LinkedIn, and courses on Skool, I share practical strategies and tools to navigate the complexities of data, artificial intelligence, and entrepreneurship.
🛠️ My Development Workflow
kzread.info/dash/bejne/Zad93KV_pa_NebQ.html
✔️ My Project Management Tool
clickup.pxf.io/datalumina
🎓 Our Training Programs
www.skool.com/data-alchemy
www.skool.com/data-freelancer
🔗 Let's Connect
www.linkedin.com/in/daveebbelaar/
daveebbelaar
📥 Datalumina's Newsletter
www.datalumina.com/newsletter
📊 How I'm using data to track my health
join.whoop.com/datalumina
#data #ai #datalumina

Пікірлер: 56

  • @daveebbelaar
    @daveebbelaar4 ай бұрын

    What do you think of these new models? Let me know in the comments!👇🏻

  • @icandreamstream
    @icandreamstream4 ай бұрын

    Worked with Claude yesterday for a while. It is “smarter” in some unstructured language processing and seems to deal with larger contexts better, and slightly better in abstract thought, sometimes. But it’s not better at everything. It trades blows with GPT-4, so that’s a good start. We really haven’t had any viable competitors before, so this is needed.

  • @videos6505

    @videos6505

    4 ай бұрын

    GPT-5 will be released (soon).

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    Probably ;)

  • @Mourne84

    @Mourne84

    3 ай бұрын

    How soon?

  • @takeabreakism
    @takeabreakism4 ай бұрын

    Hey Dave, what theme and font do you use for vscode? Looks great. I am trying to get the look of Gemini code window, but can't get there quite yet.

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    Thanks! Check out this video: kzread.info/dash/bejne/Zad93KV_pa_NebQ.html

  • @rogierhelmus3481
    @rogierhelmus34814 ай бұрын

    @Dave: you mention that you prefer using your own wrappers instead of being dependant on langchain for instance. Does that mean that you have been able to build agent&team setups yourself? If so, I'm interested how since I was just about to dive into langchain and crew-AI. Ps: I'm beginner when it comes to Python.

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    Yea I mostly build the agent frameworks myself. Will consider this for a future video.

  • @burdenedbyhope
    @burdenedbyhope4 ай бұрын

    I always have good experiences with Claude. Claude 1.0 instant back then had the better reasoning and instructions following than gpt3.5 (most of the cases) and sometimes even gpt4 (in some specific cases). Unfortunately they figured out I’m in Vietnam and revoked my access 😢. Replacing gpt4 with claude 3 sonnet (via openrouter) yesterday for my game master AI and clear 40% of the issues just with the better and more consistent reasoning and instructions following.

  • @burdenedbyhope

    @burdenedbyhope

    4 ай бұрын

    and json schema works much better with claude, gpt4 sometimes still return invalid json not sure about python, but with typescript you can use typebox or zod to create the schema for prompt and use it as static type to infer in your code as well

  • @diaryofafounder
    @diaryofafounder4 ай бұрын

    Hey Dave, can I ask how you record yourself like that in Portrait mode? What gear are you using?

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    Sure! Ecamm, Sony ZV-E10, Sigma 16mm F1.4, Camlink 4k, Amaran 60d

  • @diaryofafounder

    @diaryofafounder

    3 ай бұрын

    Thanks! Keep up the great videos!

  • @ibrahimahmed3434
    @ibrahimahmed34344 ай бұрын

    Hello, how do you do ! first thanks for all your efforts I really appreciate , I want you to suggest me a plan to learn aws or Google cloud or the one you suggest me. Also as a data scientist should I learn deep learning and computer vision and nlp and llms (I am talking as a beginner or a junior to apply a job what are the specifications needed) thanks again

  • @jasonholdener9036
    @jasonholdener90363 ай бұрын

    Great video Dave, keep them coming!

  • @alfglobalservices
    @alfglobalservices4 ай бұрын

    Thanks for the video!

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    You're welcome!

  • @roberth8737
    @roberth87374 ай бұрын

    What cool tidbit regarding the "{", never read that deeply through their docs to find that - good find!

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    Yea that’s a neat little feature, useful for many other cases as well a certain output it desired.

  • @endoflevelboss
    @endoflevelboss4 ай бұрын

    My favorite quick question for these models... "What is the first palindromic multiple of 13" Answer should be 38 x 13 = 494. Claude Sonnet couldn't get it (e.g. 91 is the first because it reads the same forward [91] as backward [19] !? ) --- and repeated wrong answers like this over and over. ChatGPT 4 writes a Python program to solve it correctly. I would be interested to see if Opus gets it right.

  • @endoflevelboss

    @endoflevelboss

    4 ай бұрын

    Cool thanks for the info. I won't take my OpenAI flag down just yet 👍

  • @ktb1381

    @ktb1381

    4 ай бұрын

    I tried opus three times in a row and it failed each time. But then I tried it with Chad GpT4 and it failed that time too. So.... Here's the last run on opus: Okay, let's approach this step-by-step: 1) First, let's understand what a palindromic number is. A palindromic number is a number that remains the same when its digits are reversed. For example, 121, 34543, 11611, etc. 2) Now, we need to find the first multiple of 13 that is a palindrome. 3) Let's start listing out the multiples of 13: 13 x 1 = 13 13 x 2 = 26 13 x 3 = 39 13 x 4 = 52 13 x 5 = 65 13 x 6 = 78 13 x 7 = 91 13 x 8 = 104 13 x 9 = 117 13 x 10 = 130 13 x 11 = 143 4) None of these are palindromes. Let's keep going. 13 x 12 = 156 13 x 13 = 169 13 x 14 = 182 13 x 15 = 195 13 x 16 = 208 13 x 17 = 221 5) Here we go! 221 is the first palindromic multiple of 13. Therefore, the first palindromic multiple of 13 is 221.

  • @endoflevelboss

    @endoflevelboss

    4 ай бұрын

    @@ktb1381 yeah what I said, the wrong answer. And why did ChatGPT 4 give you the wrong answer when it gives me the right answer every time?

  • @ktb1381

    @ktb1381

    4 ай бұрын

    @@endoflevelboss oh sorry I thought you were interested in seeing someone try it with opus. Regarding chat GPT differences, I double checked and actually I did get it right when I tried that so same results. Although in theory LLMs can give different answers when they are prompted the same way multiple times. Although maybe not now with the Python code check who knows.

  • @superresistant0
    @superresistant04 ай бұрын

    What’s your browser?

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    Arc Browser

  • @BangaloreYoutube
    @BangaloreYoutube4 ай бұрын

    I was just about to go test this 😂😂 thanks for doing it for me the json output part, Funny thing KZread pointed me straight to this video and to the timestamp of the exact time My search was "Claude json output" damn.

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    Wow that's awesome haha

  • @PeterPan-hs5tu
    @PeterPan-hs5tu2 ай бұрын

    great to see Dave’s vid again, somehow youtube stop pushing your vid to my list 😅

  • @No_Masterpiece
    @No_Masterpiece4 ай бұрын

    When am i getting replaced as a software engineer soon?

  • @qozia1370

    @qozia1370

    3 ай бұрын

    2 years max. You are already being replaced.

  • @eugenmalatov5470
    @eugenmalatov54703 ай бұрын

    isn't it blocked in the EU?

  • @kirschdieblp4242

    @kirschdieblp4242

    3 ай бұрын

    Yes but not the API

  • @dontwannabefound
    @dontwannabefound4 ай бұрын

    The bracket thing seems so stupid - does no one try this shit before releasing it

  • @micbab-vg2mu
    @micbab-vg2mu4 ай бұрын

    Gemini Ultra is worse than GPT-4. Claude 3 Opus is the real deal. Based on my preliminary tests, it's better than GPT-4. It is my default LLM now. :)

  • @videos6505

    @videos6505

    4 ай бұрын

    Opus is very expensive.

  • @micbab-vg2mu

    @micbab-vg2mu

    4 ай бұрын

    For longer text and output I use chat Poe. When I use API I set up words output limitation - In my case quality of output is more imprortant than price. For less demanding task I think haiku model will be perfect is cheap and quality is between GPT3.5 and 4. In the future for my projects I will mix Heiku/ Sonet with Opus model for the best cost quality balance. @@videos6505

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    Thanks for sharing!

  • @vdeomkr70

    @vdeomkr70

    4 ай бұрын

    Until tomorrow.

  • @johnamckinley
    @johnamckinley4 ай бұрын

    Was quite disappointed with Claude and initial efforts to do parallel code gen comparison with gpt4. Both have a long way to go, and the hype factor for new Claude release for code gen is not warranted imho

  • @Dave-cg9li

    @Dave-cg9li

    4 ай бұрын

    I've tested some pdf summaries and having it teach me some concepts based on the documents, and Claude was on a completely different level there - both in retrieving the information and teaching it. The documents were short, so the context didn't play a role. Haven't yet tested it for writing code though.

  • @alexanderpopelyuk741
    @alexanderpopelyuk7413 ай бұрын

    come to see Claude review, instead got a lecture on how to get json output 😂

  • @daveebbelaar

    @daveebbelaar

    3 ай бұрын

    Structured output is important 😂

  • @lugaidster
    @lugaidsterАй бұрын

    3 minutes in and still no review.... Dude...

  • @Accuface2000
    @Accuface20003 ай бұрын

    ChatGPT-4 is overrated. Its not better than 3.5 in my honest view and looks worse. ChatGPT-4 has some stubborn qualities and laziness, it will skip one or two clear instructions in a detailed but brief prompt. It puts words in you mouth and is too lazy to complete. Claude Opus is super attentive, excellent in understanding detail and producing long answers.

  • @qozia1370
    @qozia13703 ай бұрын

    "AI Engineer" Hahahaha, he says so because he can type things into chatgpt. Do you create CNNs professionally? You do not. Do you implement ai algorithms in robots? You do not.

  • @henrydeutsch5130
    @henrydeutsch51304 ай бұрын

    Almost your entire video is about JSON format, no one except you cares about this. Nearly your entire video is irrelevant to the vast majority of your viewers, not good.

  • @daveebbelaar

    @daveebbelaar

    3 ай бұрын

    Fair point. The video unexpectably went into that direction as I was doing my research, but the main goal of this video is to provide you with a practical framework to start running and comparing your own tests with both models.

  • @dstutz

    @dstutz

    3 ай бұрын

    You can leave feedback without being a total jerk about it you know

  • @henrydeutsch5130

    @henrydeutsch5130

    3 ай бұрын

    @@dstutz ye my b I forgot creators read comments lol

  • @henrydeutsch5130

    @henrydeutsch5130

    3 ай бұрын

    @@dstutz ye my b I forgot creators read comments sometimes lol

  • @WeAskToAI
    @WeAskToAIАй бұрын

    You are an AI developer? And what you were 1 year ago? A developer? Man if this is not hype...

  • @daveebbelaar

    @daveebbelaar

    Ай бұрын

    Got a bachelor and master in AI. Started in 2013 😉

  • @jamminrebel3614
    @jamminrebel36144 ай бұрын

    top content, may the algo pick u up and carry this faar xD

  • @daveebbelaar

    @daveebbelaar

    4 ай бұрын

    📈🙏🏻