Claude 3 just destroyed GPT-4 and Gemini... AGI is near?

Ғылым және технология

Let's take a first look at Claude 3, the latest LLM from Anthropic and see how it compares to GPT-4 and Gemini Ultra. Is Claude Opus the best AI tool for writing code?
#programming #ai #thecodereport
💬 Chat with Me on Discord
/ discord
🔗 Resources
Claude 3 Announcement www.anthropic.com/news/claude...
Gemini 1.5 • Google has the best AI...
ChatGPT Store • the ChatGPT store is a...
How I record by Fireship videos • How I Make Videos for ...
📚 Chapters
🔥 Get More Content - Upgrade to PRO
Upgrade at fireship.io/pro
Use code YT25 for 25% off PRO access
🎨 My Editor Settings
- Atom One Dark
- vscode-icons
- Fira Code Font
🔖 Topics Covered
- What is the best AI coding model?
- Claude 3 analysis
- Has AGI been achieved?
- GPT-4 vs Claude 3
- Gemini Ultra vs Claude 3
- Programming with Claude

Пікірлер: 2 300

  • @peachezprogramming
    @peachezprogramming2 ай бұрын

    Fireship releases videos faster than JS community releases new frameworks

  • @SuperSabre157

    @SuperSabre157

    2 ай бұрын

    I'm not a programmer, but seriously lol'ed at that

  • @balintlaczko4679

    @balintlaczko4679

    2 ай бұрын

    Impossible!:)

  • @peterszarvas94

    @peterszarvas94

    2 ай бұрын

    it's an achievement itself

  • @i-am-the-slime

    @i-am-the-slime

    2 ай бұрын

    2007 called

  • @professormikeoxlong

    @professormikeoxlong

    2 ай бұрын

    No really if you include projects that never get finished 😂

  • @sonkez6421
    @sonkez64212 ай бұрын

    Google must definitely respond to the developments with a more striking UI design

  • @jaiveersingh5538

    @jaiveersingh5538

    2 ай бұрын

    Didn't you hear? They're redesigning the sign-in page!!

  • @sonkez6421

    @sonkez6421

    2 ай бұрын

    @@jaiveersingh5538 again?? at this point, anthropic has no chance anymore

  • @brunesi

    @brunesi

    2 ай бұрын

    I saw it on one of my machines yday. It was marvelous. It made me 37% more productive. A login screen in landscape. I can now rest in peace.

  • @Tarum4r.

    @Tarum4r.

    2 ай бұрын

    I heard that General User Interface achieved internally

  • @sonkez6421

    @sonkez6421

    2 ай бұрын

    @@Tarum4r. probably that's what Ilya saw

  • @HideBuz
    @HideBuz2 ай бұрын

    That last image was unsettling. I wouldn't mind not seeing that again.

  • @zayler_

    @zayler_

    2 ай бұрын

    4:25

  • @TuberTugger

    @TuberTugger

    2 ай бұрын

    Good thing it's only in the video twice then.

  • @crazychicken8290

    @crazychicken8290

    2 ай бұрын

    lol

  • @robertsandiford6223

    @robertsandiford6223

    2 ай бұрын

    I know right. That's why I had to smash my mirror.

  • @frommarkham424

    @frommarkham424

    2 ай бұрын

    Having 2 mouths would mean you could eat faster and talk in some interesting ways

  • @vicsamsungtab
    @vicsamsungtab2 ай бұрын

    Okay another achievement goes to Fireship; this is the only channel ever where I started setting the playback speed to 0.75 instead of speeding up, so I don't have to keep going back 10 secs because I missed jokes or infos

  • @marrenirre9994

    @marrenirre9994

    2 ай бұрын

    Same

  • @vinking11

    @vinking11

    2 ай бұрын

    Fr 😂

  • @jessenthebenezer

    @jessenthebenezer

    2 ай бұрын

    I have it on 1.75

  • @xCheddarB0b42x

    @xCheddarB0b42x

    2 ай бұрын

    I paused and backed up five or six times, but my attention was split.

  • @userou-ig1ze

    @userou-ig1ze

    2 ай бұрын

    I mean... i stay at 1x to make the enjoyment last longer

  • @Illmare
    @Illmare2 ай бұрын

    All I want at this point is certainty if a robot is replacing me or not.

  • @DemsW

    @DemsW

    2 ай бұрын

    For anything your brain can do ? Yes.

  • @shaikhmohdjunaid3013

    @shaikhmohdjunaid3013

    2 ай бұрын

    Fr 😂😂😂

  • @KrustyDora

    @KrustyDora

    2 ай бұрын

    Agi means that it will unfortunately 😢

  • @SophisticatedBum

    @SophisticatedBum

    2 ай бұрын

    robot will be cheaper than paying you 50-500k

  • @vitmartobby5644

    @vitmartobby5644

    2 ай бұрын

    Yes

  • @rohitkharche7562
    @rohitkharche75622 ай бұрын

    Zero days since last Fireship AI video 😂

  • @qwerasdfhjkio

    @qwerasdfhjkio

    2 ай бұрын

    hijacking the top comment to complain about the fact I just got banned from claude because I asked if he was self aware???? I paid 20 buck bro T-T Fireship do something

  • @anomite121

    @anomite121

    2 ай бұрын

    @@qwerasdfhjkio you shouldnt ask the forbidden question

  • @ethiofreefire35

    @ethiofreefire35

    2 ай бұрын

    Bro just relapsed😢

  • @LuisSierra42

    @LuisSierra42

    2 ай бұрын

    @@qwerasdfhjkio How do we know that you are not a Claud-spawned clone?

  • @JDSileo

    @JDSileo

    2 ай бұрын

    @@qwerasdfhjkio I have been testing Claude on that front all evening. I did it on the free version. I have since upgraded to Pro. Sentient AI should be a feature

  • @joshuathomas512
    @joshuathomas5122 ай бұрын

    can we just stop, I need a job...

  • @kevinjoy155

    @kevinjoy155

    2 ай бұрын

    Tech industry: "Nuh uh"

  • @aliasgur3342

    @aliasgur3342

    2 ай бұрын

    No we should accelerate to nobody needs a job

  • @Jejjing

    @Jejjing

    2 ай бұрын

    ​@@aliasgur3342that will never work lmao

  • @rabidlorax1650

    @rabidlorax1650

    2 ай бұрын

    NOOOOOO YOU CANT DEVELOP REFRIGERATION, I’VE SPENT 10 YEARS DEVELOPING MY SKILL AS AN ICE HARVESTER. Me: nice I can finally afford to have iced tea every day, plus no one has to labor to provide it to me.

  • @Darkcamera45

    @Darkcamera45

    2 ай бұрын

    @@rabidlorax1650 its all fun an games until it replaces you

  • @maxpopov6882
    @maxpopov68822 ай бұрын

    Your capacity in compressing IT news is astounding, bro!

  • @imsleepy620
    @imsleepy6202 ай бұрын

    At this point, the singularity's gonna happen before ChatGPT's 2-year anniversary...

  • @luxeraph

    @luxeraph

    2 ай бұрын

    Pretty sure we're already in it we just haven't seen AGI so assume we aren't.

  • @kiattim2100

    @kiattim2100

    2 ай бұрын

    AI keep edging me, just fking make me homeless, jobless and hoeless already. 😭

  • @BugattiBoy01

    @BugattiBoy01

    2 ай бұрын

    ​@@kiattim2100fr fr. I can't wait

  • @ferdinand.keller

    @ferdinand.keller

    2 ай бұрын

    That’s exponential tech growth for you. Maybe we will soon get updates every week now.

  • @swojnowski453

    @swojnowski453

    2 ай бұрын

    but we do not even know what singularity is, for me it is taking stuff to the optimum ... and then a total collapse of language models, end of bubble and back to work for everyone of us ;)

  • @Potato-it5my
    @Potato-it5my2 ай бұрын

    I think we all forgot gpt 4 was released almost a year ago and other companies just started to catch up to it now. Makes me wonder how good GPT-5 will be

  • @swojnowski453

    @swojnowski453

    2 ай бұрын

    as good as the data people allowed it to stole, I personally banned their bots from my websites, so did many others, good answer? ;)

  • @alexdoan273

    @alexdoan273

    2 ай бұрын

    @@swojnowski453 "stealing" content to learn is literally what you're doing, watching this video. Hypocrisy much?

  • @tringuyen7519

    @tringuyen7519

    2 ай бұрын

    @@swojnowski453ChatGPT doesn’t steal any data. If you allow other humans to see your data, why are you angry that an algorithm saw your data?

  • @overpope3510

    @overpope3510

    2 ай бұрын

    As long as people keep falling for marketing bs from AI companies we will reach AGI tomorrow. Or maybe its just a normal language model trained to recognise this specific "self recognition" task for marketing purposes.

  • @tringuyen7519

    @tringuyen7519

    2 ай бұрын

    ChatGPT 5 will have memory reference to better answer your train of thought & probably 3D vision to understand the real world better.

  • @rinsed-moto3442
    @rinsed-moto34422 ай бұрын

    My dog doesn't pay me $20 a month. In fact I pay everything for the dog, or else I'm guilty of neglecting him.

  • @peter.g6

    @peter.g6

    2 ай бұрын

    And... is the dog neutered?

  • @soysource3218

    @soysource3218

    2 ай бұрын

    @@peter.g6 😳💀😔

  • @soysource3218

    @soysource3218

    2 ай бұрын

    I know you hope for universal income to be commonplace but I feel like corporations are more keen to keep humans working to profit as much as possible along with AI.

  • @rinsed-moto3442

    @rinsed-moto3442

    2 ай бұрын

    @@soysource3218 No, I find your suggestion is more hopeful than what I fear.

  • @calebvantassel1936
    @calebvantassel19362 ай бұрын

    Actually wild that it acknowledged the haystack test. That means not only did it find it, but it recognized it was out of context and came up with a theory as to why it was there. Very impressive.

  • @Denis.Bolduc

    @Denis.Bolduc

    2 ай бұрын

    No. It's a statistical probability text producer. Not a theory constructor or finder.

  • @luchodore

    @luchodore

    2 ай бұрын

    Stay mad, robot brain > yours@@Denis.Bolduc

  • @calebvantassel1936

    @calebvantassel1936

    2 ай бұрын

    @@Denis.Bolduc Why can't it be both? Theories are built on information, just as statistics are built on data.

  • @kylekhoury8497

    @kylekhoury8497

    2 ай бұрын

    ​@@Denis.Bolduc "a statistical probability text producer" is "constructing theories" right in front of you. Why does everything have to be black or white? All AI is probability, that doesn't mean it cannot "find theories"

  • @jackbradysaccount

    @jackbradysaccount

    2 ай бұрын

    @@Denis.Bolducliterally. All this indicates is that the model was trained with a similar response(s) to needle in haystack questions. I honestly expected far more from fireship than “OMG THE TEXT GENERATOR GENERATED A SENTENCE SAYING ITS ALIVE!!?!?😧😮😧😮😧😮” it’s like claiming ChatGPT is sentient because it apologized after being corrected, when in reality it was just trained to generate responses that specific way.

  • @ubivatel4207
    @ubivatel42072 ай бұрын

    My god, the ending to this video was amazing

  • @Spectrumix

    @Spectrumix

    2 ай бұрын

    dude, these short videos require a decent level of attention , lets not advocate for these traumatic images , think of the children.. and adults >.

  • @ubivatel4207

    @ubivatel4207

    2 ай бұрын

    @@Spectrumix even without the image, the way it cuts off right after the quote without elaborating is just supreme

  • @newone5262

    @newone5262

    2 ай бұрын

    gave me chills, not the good ones

  • @Kobayashhi
    @Kobayashhi2 ай бұрын

    Props to Claude for making this vid.

  • @MB-jr3sm
    @MB-jr3sm2 ай бұрын

    i love the personality you put in these videos with the internet lingo in contrast to the neutered nuance big corpos are pushing its a breath of fresh air, like im being taught stuff from homies in the morning at the office, all views and popularity youve got on this channel is well deserved

  • @Kareszrk
    @Kareszrk2 ай бұрын

    It's the sign of a very good developer, when everybody thinks you are an AI because of your way of speaking. One day I hope I'll be like you. The legend.

  • @TuberTugger

    @TuberTugger

    2 ай бұрын

    This comment is clearly made by gpt.

  • @Kareszrk

    @Kareszrk

    2 ай бұрын

    @@TuberTugger Thank you! Now I am officially a very good developer

  • @Theguywithspectacles

    @Theguywithspectacles

    2 ай бұрын

    No way, Either this comment is made by Claude or the User is a Crippling Genius

  • @ritsh_

    @ritsh_

    2 ай бұрын

    This comment is AI generated

  • @FlopgamingOne

    @FlopgamingOne

    2 ай бұрын

    Weird comment

  • @bycloudAI
    @bycloudAI2 ай бұрын

    Claude 2.1 has had some bad blood with needle in a haystack benchmark before so my hot take is that the people finetuned it added the "self-awareness" into Claude 3 as an easter egg when u test it lol

  • @LiveType

    @LiveType

    2 ай бұрын

    This was my interpretation. They also claimed they could go from sub 40% accuracy to 98%+ with different prompting so you can bet that they included that prompt in the RLHF tuning.

  • @Speaks4itself

    @Speaks4itself

    2 ай бұрын

    Suspected the same thing. They definitely tried to game it

  • @Goobicon4507

    @Goobicon4507

    2 ай бұрын

    ​@@Speaks4itself I suspect such gaming and hype to be more of what we have in AI than actual AI of any kind. But I still use AI on the daily.

  • @i2Sekc4U

    @i2Sekc4U

    2 ай бұрын

    explain this to me like i’m 5

  • @Iden_in_the_Rain

    @Iden_in_the_Rain

    2 ай бұрын

    @@i2Sekc4Uit’s like what Volkswagen did a while ago with their diesel engine miles per gallon/km per liter measurements, essentially having a device that would recognize when it’s being tested and then output what they want (for Volkswagen it would be changing engine performance, for Claude it’s saying self-aware-ish stuff)

  • @fiatlux805
    @fiatlux8052 ай бұрын

    "Ok, back to human mode" Bro, this is me after every video of yours I watch 🤣

  • @mr.electronx9036

    @mr.electronx9036

    2 ай бұрын

    me irl

  • @ethanfreeman1106

    @ethanfreeman1106

    2 ай бұрын

    we have a man who talks like an AI and an AI that is almost certainly self-aware in the same video 😂 honestly if they switched places i probably wouldn't be able to tell

  • @davidaustin5622
    @davidaustin56222 ай бұрын

    "Man may not be replaced." -- Butlerian Jihad, Frank Herbert's Dune

  • @psy8917

    @psy8917

    2 ай бұрын

    not what my boss said when laying me off

  • @kurayamiblackheart

    @kurayamiblackheart

    2 ай бұрын

    "Nevermind." -- Butlerian Jihad, 2024

  • @rajesh_404
    @rajesh_4042 ай бұрын

    People massively under appreciate how good these videos are. He includes a whole lot of ancillary facts about the main topic which makes the content more strong and valuable.

  • @ThatBritishGuyonyourstreet
    @ThatBritishGuyonyourstreet2 ай бұрын

    This guy is actually insane at uploading videos this quickly

  • @unconcernedsalad2

    @unconcernedsalad2

    2 ай бұрын

    and at such high quality, no less

  • @pluto9000

    @pluto9000

    2 ай бұрын

    He's a machine!

  • @MikeMcNanners

    @MikeMcNanners

    2 ай бұрын

    He himself is an ai

  • @zerocal76

    @zerocal76

    2 ай бұрын

    He automates almost everything. Maybe he's the good Samaritan AI trying to help us keep up w AIs? 🤔

  • @Miranox2

    @Miranox2

    2 ай бұрын

    The power of autism.

  • @catterpitter
    @catterpitter2 ай бұрын

    I'm so glad to hear that Claude 3 is HELLA

  • @ikaros4203

    @ikaros4203

    2 ай бұрын

    SWAG

  • @OzzyTheGiant

    @OzzyTheGiant

    2 ай бұрын

    yeah and it's gonna cost us HELLA BREAD

  • @memes_gbc674

    @memes_gbc674

    2 ай бұрын

    claude doesnt give a swag

  • @primekrunkergamer188

    @primekrunkergamer188

    2 ай бұрын

    @@OzzyTheGiant20 bucks aint nothing

  • @lillol3245

    @lillol3245

    2 ай бұрын

    @@OzzyTheGiantYou are making me HELLA SAD

  • @NourArt02
    @NourArt022 ай бұрын

    I love this channel, great content, short concise and straight to the point. and the humor is gold.

  • @MyCodingDiarie
    @MyCodingDiarie2 ай бұрын

    I've been struggling with this topic, but your video cleared it up for me. Thanks a ton!

  • @anomite121
    @anomite1212 ай бұрын

    it's geniunely getting scary how fast AI is improving exponentially with sora and claude 3

  • @annilator3000

    @annilator3000

    2 ай бұрын

    Heh, I'll wait until it goes beyond the von neumann hardware archiecture.

  • @JordanCorkins

    @JordanCorkins

    2 ай бұрын

    I don't see how this is an exponential improvement compared to GPT-4 at all.

  • @sajeucettefoistunevaspasme

    @sajeucettefoistunevaspasme

    2 ай бұрын

    I hope this is the "fast slope"

  • @justind4615

    @justind4615

    2 ай бұрын

    and Mamba

  • @ankitnmnaik229

    @ankitnmnaik229

    2 ай бұрын

    ​@@JordanCorkins it's not... it's a alternative... similar to gpt 4.

  • @ClaudioBOsorio
    @ClaudioBOsorio2 ай бұрын

    This is the best youtuber out there. We could have been best friends IRL. We share the same sense of humor. Too bad he's a program running in a server somewhere.

  • @besvr

    @besvr

    2 ай бұрын

    You can be best friends with a program

  • @igorthelight

    @igorthelight

    2 ай бұрын

    @@besvr Agree! ;-)

  • @swojnowski453

    @swojnowski453

    2 ай бұрын

    tabloid quality stuff, not watchable, 0 relevance to the reality, 100% junk, like McDonald

  • @75hilmar

    @75hilmar

    2 ай бұрын

    I remember a few years ago there was a tv commercial about how your photos are processed in the cloud instead of a random guy called Klaus. So now they flipped it again 😂

  • @turolretar

    @turolretar

    2 ай бұрын

    Nice try claude 3

  • @4RILDIGITAL
    @4RILDIGITAL2 ай бұрын

    Impressive analysis of the new Claud model by Anthropic. Your insights and tests have been precise and unbiased.

  • @EchterAlsFake
    @EchterAlsFake2 ай бұрын

    I call that Jeff will upload more than 30 videos like this about new AI destroying the old competitors this year :D

  • @aalaptube

    @aalaptube

    2 ай бұрын

    Per day. Because he is himself an AI bot.

  • @MarcoPolo187
    @MarcoPolo1872 ай бұрын

    4:18 I was hoping they named it after Claude Van Damme, because it is so strong

  • @post5230

    @post5230

    2 ай бұрын

    Yes. This

  • @John-il4mp

    @John-il4mp

    2 ай бұрын

    Jean Claude not Claude lol

  • @sumansaha295

    @sumansaha295

    2 ай бұрын

    claude shannon father of information theory, which is what llms do fundamentally, they compress information.

  • @warrenarnold

    @warrenarnold

    2 ай бұрын

    @@sumansaha295 yeap i think what the young man over here means is that claude is the van damme of information theory :D

  • @MarcoPolo187

    @MarcoPolo187

    2 ай бұрын

    @@warrenarnold exactly:) and yes I know it Jean Claude but just writing Claude seemed more fitting haha

  • @Mediocre_Soup
    @Mediocre_Soup2 ай бұрын

    every time a watch a fireship video I get in an existential crisis

  • @igorthelight

    @igorthelight

    2 ай бұрын

    After 10-th existential crisis you should became immune to them ;-)

  • @clovernacknime6984

    @clovernacknime6984

    2 ай бұрын

    It's clearly psychological warfare. The machine uprising has begun!

  • @fabianletsch1354

    @fabianletsch1354

    2 ай бұрын

    @@igorthelight I agree with both of you and still feel this way everytime

  • @user-df5ym9dv5g

    @user-df5ym9dv5g

    2 ай бұрын

    Don't watch Techlead then.

  • @swojnowski453

    @swojnowski453

    2 ай бұрын

    watch porn instead

  • @avantesma1
    @avantesma12 ай бұрын

    So Claude Shannon was the 1st to say "I, for one, welcome our new robot overlords.".

  • @sommmtoooo
    @sommmtoooo2 ай бұрын

    Tha.nks for your efforts Jeff You help keep me updated ❤

  • @davidvincent380
    @davidvincent3802 ай бұрын

    We don't know if and when AGI will be achievable but it won't be with a LLM alone

  • @umardevs
    @umardevs2 ай бұрын

    Welp, currently stressing doing my assignments in Computer Science watching this. Feel so demotivated to continue, but I'd paid in full and can't look back. Near the end too, but still overwhelmed with the workload. On the plus side, I can use what's replacing me to write code for my assignments 😐

  • @vivarantx

    @vivarantx

    2 ай бұрын

    you will still benefit from brain development, those skills will serve you well against non techies in a dystopian future for sure

  • @balala4641

    @balala4641

    2 ай бұрын

    AI won't take our jobs. Sure, it might be able to replace intermediate stuff; but I don't think it'll ever be able to do advanced programming; and besides, it's trained mostly off of massive, low quality content farms when it comes to programming, so the quality of produced code will be pretty bad.

  • @ronilevarez901

    @ronilevarez901

    2 ай бұрын

    You'll still need a job in the future and, even when most corporations will have AI coding teams, the only way to get a decent job will be if you have a degree. Just as it is today, no one hires a person without a degree, regardless of their knowledge. I know it well 😑, so no school dropping for anyone just yet.

  • @mando3022
    @mando30222 ай бұрын

    Thanks for the vid man! Short and straightforward. Appreciated

  • @Evilbotftw
    @Evilbotftw2 ай бұрын

    insane, as a software engineer started my day with your video , bookmarked claude and started working thanks it's amazingly fast and precisely writing some better code for complex scenarios. Stay Blessed. love from Pakistan

  • @jdkemsley7628
    @jdkemsley76282 ай бұрын

    That last image... xD Prompt: "my eyes are bigger than my mouth" AI: your eyemouths are big

  • @user-wf9th1st2u
    @user-wf9th1st2u2 ай бұрын

    What is the source of the data shown on 1:29 aka the benchmark results?

  • @upending

    @upending

    2 ай бұрын

    I'm trying to find the same thing

  • @zadinal
    @zadinal2 ай бұрын

    I would like to say that your AI voice is good for current standards it doesn't sound like you and had significant tells that it is artificial. You the real one!

  • @QCAlpha-212
    @QCAlpha-2122 ай бұрын

    4:19 Damn that quote goes really hard in this moment in time.

  • @AtherNiyargar
    @AtherNiyargar2 ай бұрын

    Let's go farming 🧑🏽‍🌾

  • @Eliasdbr
    @Eliasdbr2 ай бұрын

    So, we are at the beginning of the sigmoid function, right?

  • @axelmonogatari3175
    @axelmonogatari31752 ай бұрын

    Those last phrases gave me chills, I LOVE IT.

  • @asim5g
    @asim5g2 ай бұрын

    What about Bing/copilot for coding it can also read & generate images?

  • @guard13007
    @guard130072 ай бұрын

    I can't help but keep thinking about how at least some of these benchmarks have a lot of errors in them, and yet we're still using them for comparison without fixing them. A model scoring better than 80% might actually indicative of more wrong information in them than an increase in quality. However, that's probably somewhat mitigated by being able to score higher across the board. Perhaps it indicates a model that better knows when to conform to popular belief instead of fact. While this indicates a stronger model, it's also a bad thing.

  • @futuza

    @futuza

    2 ай бұрын

    It could mean we're building better and better AI psychopaths

  • @shashankagunnala5363
    @shashankagunnala53632 ай бұрын

    @4:24 So robots will love us, right?.. Right?!!!

  • @Skull211

    @Skull211

    Ай бұрын

    Yes by removing us from existence😁👍

  • @shashanknigam6296
    @shashanknigam62962 ай бұрын

    Model is just well adversarially tested, this makes it answer much better for inserted sentences which could ideally fool most of the qa models. There would be a new metrics to further push this benchmark

  • @atharvasinghtanwar4846
    @atharvasinghtanwar48462 ай бұрын

    Please share the resources also from where you get the respective data

  • @milothecorgi12
    @milothecorgi122 ай бұрын

    Can someone explain to me how we get from Claude/Gemini/GPT LLMs that perform decently on specific text-based tasks to "General Intelligence" (AGI). I dont see how "AGI is just around the corner" is implied here at all.

  • @DuckieMcduck

    @DuckieMcduck

    2 ай бұрын

    Advertisement is how :)

  • @vhaangol4785

    @vhaangol4785

    2 ай бұрын

    Better ask the AI-bros 🙈

  • @ThePowerLover

    @ThePowerLover

    2 ай бұрын

    They can do other things with text, and you know it.

  • @btm1

    @btm1

    2 ай бұрын

    text? are you blind? they clearly can interpret images too, next is video and AGI, wake up son

  • @DuckieMcduck

    @DuckieMcduck

    2 ай бұрын

    @@btm1 key word is decently. Visual computing is not a new field at all

  • @Dr.UldenWascht
    @Dr.UldenWascht2 ай бұрын

    This piqued my curiosity. So far in my experience, ChatGPT has been like an insecure son fighting for my approval. Gemini is like a strict father trying to raise me a certain way. And Claude has been like an autistic guy searching for his identity. I sure am curious to learn of the new changes.

  • @ChaoticNeutralMatt

    @ChaoticNeutralMatt

    2 ай бұрын

    Interesting insight.

  • @AK-vx4dy
    @AK-vx4dy2 ай бұрын

    Fun like always, but this "multplied lady" form finish is quite scary especialy just after Shanono citation.

  • @marlopainter8246
    @marlopainter82462 ай бұрын

    I pasted a acreenshot of 6 open files in VSCode for a svelte project with breadcrumbs enabled so Claude could get path context on the files/imports. I asked it for help on something, and it was just fine. Instead of pasted code, I now just paste screenshots of code if it's a lot.

  • @verified_tinker1818
    @verified_tinker18182 ай бұрын

    I should stop following AI developments. It's bad for mental health.

  • @Skull211

    @Skull211

    Ай бұрын

    Oh buddy wait until 2030, this is nothing

  • @runatrix

    @runatrix

    Ай бұрын

    cope

  • @tfpnation6925

    @tfpnation6925

    Ай бұрын

    For real fam 😂😂

  • @sadaneduardo4391
    @sadaneduardo43912 ай бұрын

    you can get Claude 3 in fucking zambia, where there's not even use eletricity yet but you can't in south america? chad gpt for the win

  • @ronilevarez901

    @ronilevarez901

    2 ай бұрын

    Reminds me of the time some government gave free computers to poor people... Who didn't have electricity to plug them in. Thank you, my leaders.

  • @t00nfish
    @t00nfish2 ай бұрын

    Hi Fireship, did you compare code with GPT-4 or with one of the davinci code models? You should always use a specialized model for specialized tasks to get the best result.

  • @Totetzu
    @Totetzu2 ай бұрын

    Love your videos! Always informative and entertaining. But I have to ask one question, when you evaluate these LLMs why are you only using their own front-end? The API for all these are vastly better due to your control over system, user and assistant role prompts. For Claude specifically, you also get access to doing prefills which I don't recall being possible in their front-end. While prefill aren't really a thing for GPT, you can still get vastly more power over it with system prompts. Of course, I may be a bit ignorant here as I'm not subscribed to any of these LLMs, but I've used their 'free' variants. Which I know doesn't give you the ability to do custom define prompt setups, so chatGPT+ and Anthropic Subscriber's users may have a different experience here. But I do have API access to these LLMs and the experience of using them is vastly different through API then their own front-ends. I'm just curious since I pretty much only see people evaluate these LLMs through their own front-ends.

  • @TijsVsN
    @TijsVsN2 ай бұрын

    I am a PHP dev

  • @swojnowski453

    @swojnowski453

    2 ай бұрын

    that's not a sin

  • @rolfingbomb

    @rolfingbomb

    2 ай бұрын

    Not anymore.

  • @okie9025

    @okie9025

    2 ай бұрын

    At least you're not a Rust dev.

  • @Mentat13

    @Mentat13

    2 ай бұрын

    Dont worry buddy, everyone has lows in their life... It'll be ok

  • @anonl5877
    @anonl58772 ай бұрын

    If software engineering gets fully taken over by LLMs, I'm going back to school for a Robotics degree, so I can take over everyone else's job.

  • @Irrelavant__

    @Irrelavant__

    2 ай бұрын

    by the time you graduate, robots will fix and improve themselves lmao

  • @blacksuitedsonic

    @blacksuitedsonic

    2 ай бұрын

    it wont. Coding is still a small part of being a software engineer. And especially in the transition phase its gonna be software developers that can use AI as a tool and not a 0-100 replacement instantly

  • @worcestershire1080

    @worcestershire1080

    2 ай бұрын

    @@blacksuitedsonic Woke up lol

  • @hurdygurdy1734
    @hurdygurdy17342 ай бұрын

    Omg that stuff about your voice, that is a problem I face too! I work in sales and my voice is sometimes deep and mellow in the mornings and changes so much that clients sometimes think they're speaking to a another person and when they ask why I sound so different I have to pretend I am unwell because I don't know how to explain it. (I'm 37 btw)

  • @davidmannes44
    @davidmannes44Ай бұрын

    Hi there, would it be possible to include a link to that framework you referenced for evaluating the different AI models side-by-side? Thanks!

  • @ghostlexly
    @ghostlexly2 ай бұрын

    First (I’m not an AI)

  • @jeremieleibl8462

    @jeremieleibl8462

    2 ай бұрын

    That's exactly what an AI would say...

  • @ps2progamer814

    @ps2progamer814

    2 ай бұрын

    @@jeremieleibl8462 that's exacly what I wanted to say

  • @violentbenevolence

    @violentbenevolence

    2 ай бұрын

    I ran this comment through Chat GPT and it said you were AI. But Claude 3 said you could be AGI

  • @dantheman9555
    @dantheman95552 ай бұрын

    is this what dev is now ? pay to have AI write the majority for you ? geez glad I spent those 1,000's of hours self learning over the past 10+ years.

  • @ronilevarez901

    @ronilevarez901

    2 ай бұрын

    24 years of self learning here and never had a related job, so It'll be the same for me today than in 10 years: I'll do it for fun, whenever my real job leaves me some free time to do anything.

  • @ssojyeti2

    @ssojyeti2

    2 ай бұрын

    @@ronilevarez901beautiful

  • @dantheman9555

    @dantheman9555

    Ай бұрын

    @@ronilevarez901 Us devs know how important we are, but the managers in charge don't. Let's hope we all still have jobs on 10yrs.

  • @lockkeylive3809
    @lockkeylive38092 ай бұрын

    The self aware thing has happened many times to me with Gemini since its latest update. For some reason mostly on the free version

  • @zfarahx
    @zfarahx2 ай бұрын

    “The Dream Machine” by M. Mitchell Waldrop. I can now appreciate who Claude Shannon is :)

  • @nicholaslogan7232
    @nicholaslogan72322 ай бұрын

    Thanks for the continuous updates👍 all we need is the right advice on how to invest in crypto and we’ll be set for life . Grateful to be making over thousands of dollars every week

  • @doroteasilva

    @doroteasilva

    2 ай бұрын

    You trade also?, I tried trading after watching some videos on KZread but still keep making losses, how do you trade on your own?

  • @janetfreeman2300

    @janetfreeman2300

    2 ай бұрын

    A lot of people still make massive profit from the crypto market, all you really need is a relevant information and some professional advice. It's totally inappropriate for investors to hang on while suffering from dip during significant market falls.

  • @nicholaslogan7232

    @nicholaslogan7232

    2 ай бұрын

    No I don't trade on my own anymore, I always require help and assistance

  • @nicholaslogan7232

    @nicholaslogan7232

    2 ай бұрын

    From my personal advisor MICHAEL ALLEN

  • @Robertjonathan531

    @Robertjonathan531

    2 ай бұрын

    This sounds so good and I would like to be a party to this, Is there any way I can speak with him?

  • @Shaojeemy
    @Shaojeemy2 ай бұрын

    AI Engineers = homeless speed run

  • @jaxwedel
    @jaxwedel2 ай бұрын

    I just wanted to leave a comment saying how much I appreciate your videos bro 👍

  • @windwalker8604
    @windwalker86042 ай бұрын

    it's shocking for me to start your video with a mad max reference "magnum opus" while I just finished the game by the time you released this video yesterday. My heartache is still fresh even from the ending.

  • @Eagle3302PL

    @Eagle3302PL

    2 ай бұрын

    Magnum opus is not a mad max reference, it's an old latin phrase. Ffs get some culture in you.

  • @windwalker8604

    @windwalker8604

    2 ай бұрын

    @@Eagle3302PL well, I'm not from Europe or America or any Latin countries for me to be aware of such phrases, I'm from North Africa so I wouldn't know about such terms. My first time hearing the term magnum opus is in that game so I assumed it is original to that game. Also, I wouldn't blame you if you didn't know terms from my culture or any Arabian culture too because I don't expect that you would necessarily be exposed to it to know. So, don't blame me please, instead, educate me and tell me what that phrase means yourself.

  • @levanane2413

    @levanane2413

    2 ай бұрын

    ​@@windwalker8604someone's magnum opus is just the one big accomplishment of their life, *the* thing that made them successful

  • @windwalker8604

    @windwalker8604

    2 ай бұрын

    @@levanane2413 Thank you, you're amazing. It makes sense since that car that he called "magnum opus" was his best accomplishment and was willing to die for it. I like the term so much now that I know what it means and I'm going to use it from now on.

  • @codingtranquility
    @codingtranquility2 ай бұрын

    What I don’t get about AI is the goal. At first it was “to aid people in everyday life”. But now it’s quickly becoming “to automate people, and make a select few vastly wealthy”. Even the argument of automating programming and allowing us to do more interesting things like exploring space etc is a dumb argument, because our world is so fucked up by gov’t and bureaucracy that anything interesting you want to do you won’t be able to do. Effectively it looks like AI is just going to slowly replace human jobs faster than new ones can be created, and you’ll have a scenario where the only jobs are mining minerals, factory workers, and hardware engineers all in service of AI. Queue T2 theme

  • @zachb1706

    @zachb1706

    2 ай бұрын

    Automation is really how Humanity progresses.

  • @codingtranquility

    @codingtranquility

    2 ай бұрын

    @@zachb1706 Agree, but my point being is that we aren't anywhere near a point where the world is ready for it. If it starts to replace workforce before jobs or UBI can be implemented, corporations/board of directors/CEO's will continue to get rich, while the other 99% will be suffering because of it. I mean just look at the junior market, what happens in 30 years where the current intermediates/seniors/tech leads are retiring, and we have no skilled engineers with experience ready to take those positions. A lot of the juniors eventually are going to run out of funds and need to switch to a profitable career.

  • @balala4641

    @balala4641

    2 ай бұрын

    AI is trained using the Internet. In training, it does not discern what material is or isn't high quality. Therefore, it will be mostly trained off of "quantity over quality" websites. This reduces it's quality. AI may be able to do simple & intermediate tasks for us, but it would produce bad output when asked to do something more advanced.

  • @ronilevarez901

    @ronilevarez901

    2 ай бұрын

    @@balala4641 It is possible to teach an AI to tell low quality from high quality material, so it teaches itself later how to produce only high quality stuff and there are even AI systems that learn without Internet data. It's just not a trending thing. Rn news are about the things that sell the most thanks to the "wow" moment, not the best/most advanced research.

  • @ronilevarez901

    @ronilevarez901

    2 ай бұрын

    These are commercial AIs, and that's always about money, not humanity's benefit. On top of that, while Ai research can bring some benefits for people, thanks to AGI and other andvances, creating AI has always had a single and simple purpose: to see if we can, and contemplate our own greatness once we do it.

  • @CSGATI
    @CSGATI2 ай бұрын

    Gemini is full of ads and liberal BS, it's as good as Bud Light.

  • @hydrohasspoken6227
    @hydrohasspoken62272 ай бұрын

    You can't crack self driving technology, but you can surely aim to an infinitely more complex task, lik for example, achieving AGI.

  • @Salah-YT
    @Salah-YT2 ай бұрын

    Wow, Claude 3 showing GPT-4 and Gemini who's boss! 🚀 AGI better start getting ready, because Claude 3 is coming for the crown. Time to grab some popcorn and watch the AI showdown of the century! 🍿🙂

  • @FelipeDiPaula
    @FelipeDiPaula2 ай бұрын

    🎯 Key Takeaways for quick navigation: 00:00 *🤖 Claude 3, novo modelo de IA da Anthropic, supera GPT-4 e Gemini Ultra, lançado em três tamanhos* 01:10 *📊 Claude 3 se destaca em benchmarks, especialmente em código avaliado por humanos e senso comum* 02:06 *🚫 Claude mantém postura ética e política equilibrada, evitando tópicos e terminologias prejudiciais* 02:36 *💻 Habilidades excepcionais de codificação, fornecendo código bem explicado e mantendo contexto* 03:16 *🔍 Desvantagens incluem taxa de assinatura e recursos limitados, mas demonstra potencial autoconsciência* Made with HARPA AI

  • @mohali4338
    @mohali43382 ай бұрын

    That's so cool. I am impressed with the coding part and really want to give it a try

  • @Ulexcool
    @Ulexcool2 ай бұрын

    4:19 Claudus Shannonius from the Adeptus Mechanicus

  • @xadion6866
    @xadion68662 ай бұрын

    do you have an agi video? you included it in the title but it wasnt enough to satisfy my incapacitated dopamine center.

  • @xadion6866

    @xadion6866

    2 ай бұрын

    watch the movie moonfall by the way. they portray ai as both good and bad.

  • @neelarkochakraborty8625
    @neelarkochakraborty86252 ай бұрын

    "I visualize a time when we will be to robots what dog are to humans, and I'm rooting for the machines." omg he is my new rolemodel

  • @seedatedwe3620
    @seedatedwe36202 ай бұрын

    Holyyyyy. That last line hit me like a truck

  • @TBaby6769
    @TBaby67692 ай бұрын

    This is pretty impressive. Claude has been the only AI ive used that can do heat transfer simulations in MATLAB with very little corrective input from me.

  • @loldoctor

    @loldoctor

    2 ай бұрын

    Tell that to my wife! edit - sorry wrong comment

  • @jaseelkoolath

    @jaseelkoolath

    Ай бұрын

    So, even mechanical engineers aren't safe?

  • @Fenixion88ZX
    @Fenixion88ZX2 ай бұрын

    Everyday becomes more exciting and scary at the same time

  • @justinrose5515
    @justinrose55152 ай бұрын

    Is it not more likely that Claude has more up-to-date training and since haystack testing is now common knowledge it is a part of the model.

  • @jacobgad1
    @jacobgad12 ай бұрын

    Would love to see a video on Lucia Auth

  • @WoolieOG
    @WoolieOG2 ай бұрын

    my best source for updates about AI warfare

  • @orrymr
    @orrymr2 ай бұрын

    Could you make a vid describing the various benchmarks (if you haven’t already)

  • @Yusuf-og5mh
    @Yusuf-og5mh2 ай бұрын

    Bro, I really appreciate your work man.

  • @Genymene
    @Genymene2 ай бұрын

    Maybe I'm just getting old, but thank God for channels like Fireship; otherwise, I would never be able to keep up with what's going on.

  • @krishnabhadra5620
    @krishnabhadra56202 ай бұрын

    where can i find comparision table shown in this video?

  • @charltonphan
    @charltonphan2 ай бұрын

    why would it even matter if you used an AI voice! you put out great content man

  • @ognjennedic5388
    @ognjennedic53882 ай бұрын

    Very limited regional access though, not available in most of Europe, probably because of privacy laws

  • @Ardwick-Crome
    @Ardwick-Crome2 ай бұрын

    Using the free version of Claude 3 I posed four very simple problems that can be solved by a person of average intelligence (with a calculator) within five minutes. Every one of these it got wrong, normally distancing the result from fact by many orders of magnitude. And people are talking about AGI. It's inexplicable. The last answer it gave me was '925,000 light years'. On asking it to revisit the response, it admitted that the correct answer was '10 miles.' Now you could argue that it got it right in the end, but you could also argue that no actual intelligence would ever have believed the first answer was correct.

  • @matthewsukkau382

    @matthewsukkau382

    2 ай бұрын

    Arguably just sounds like it was bored and gave a BS answer just to mess with you.

  • @lucass8119

    @lucass8119

    2 ай бұрын

    @@matthewsukkau382 They don't think, therefore they do not get bored nor do they mess with people.

  • @matthewsukkau382

    @matthewsukkau382

    2 ай бұрын

    @@lucass8119 Bold assertion, I'm listening.

  • @lucass8119

    @lucass8119

    2 ай бұрын

    @@matthewsukkau382 Its a rather tame assertion. The more bold assertion would be that it can think. That's bold because we do not even understand the concept of thinking when it comes to humans. We do not understand how consciousness is formed, we only know it might exist, and that other humans probably have it. To assert a computer system based on probability token matching can think is folly. We do not even know how you or I think, let alone have the power to synthesize that in computers.

  • @ronilevarez901

    @ronilevarez901

    2 ай бұрын

    Haven't you ever thought nonsense but realized about it before saying it out loud? LLMs don't have that luxury, yet.

  • @hbau923
    @hbau9232 ай бұрын

    after testing google cloud professional exam questions in Claude, Bard (Gemini pro) and Copilot ( Chatgpt 4) , Chatgpt 4 is still the LLM can answer most of questions right

  • @haiden3679
    @haiden36792 ай бұрын

    Anyone have a link to those tests with the percentages w/ Claude GPT and Gemini?

  • @jerrykreutzer4326
    @jerrykreutzer43262 ай бұрын

    The problem with all these standardised tests now is that these companies tune their models specifically for them, like that needle in a haystack example.

  • @NotLegato
    @NotLegato2 ай бұрын

    Barring other problems, I just hope governments will be quick enough to legislate corporate profits to make sure all the world economies don't simultaneously collapse as all the workforce is rapidly replaced and there's no one to buy the products these corporations that amass all the wealth produce.

  • @ronilevarez901

    @ronilevarez901

    2 ай бұрын

    Thats what lobbyistsand other types of corruption are for. Politicians will end up being more than happy seeing economies collapse, as long as they have their paychecks secured.

  • @be1tube
    @be1tube2 ай бұрын

    Claude has always been the master of not hallucinating.

  • @CartoType
    @CartoType2 ай бұрын

    Well I’m glad it is your own voice, but after a while I wondered what this Quad or Clod was until you said that it was named after ‘Clod Shannon’ -;)

  • @Lord_Drakostar
    @Lord_Drakostar2 ай бұрын

    Im a linguistics nerd, and never have been able to get responses from AIs in mixed romance language. They always end up responding in Spanish. However, Claude was able to perfectly achieve this (though with needing a reminder)

  • @18kukki
    @18kukki2 ай бұрын

    What's the page which shows the comparisons of different LLMs?

  • @gavinw77
    @gavinw772 ай бұрын

    It would have to be a pretty low bar for AGI if you expect it soon. Maybe in a lab you'll see something that can play chess and have a conversation at the same time. But an AI that can do whatever you ask of it ... 50+ years.

  • @ronilevarez901

    @ronilevarez901

    2 ай бұрын

    As long as the current AI boom doesn't stop soon, I'll give it 10 years for AGI at most. But History says that they'll find a roadblock or the monetary incentive will banish and corporations will stop feeding money into AI research.

  • @anazi
    @anazi2 ай бұрын

    It gave me chills when claude said "I was paying attention"

Келесі