[ML News] Groq, Gemma, Sora, Gemini, and Air Canada's chatbot troubles

Ғылым және технология

Your dose of ML News!
OUTLINE:
0:00 - Intro
0:20 - Gemma & Gemini
3:40 - Groq
6:30 - Nvidia EOS Supercomputer
7:15 - Gpulist.ai
8:20 - Demis Hassabis on scale
10:10 - Hardware wars
12:05 - Sora
15:10 - Gemini 1.5 Pro & Long Context
18:45 - Air Canada must pay for chatbot mistake
23:30 - Giant Rat Balls
26:25 - Various News
References:
blog.google/technology/develo...
altryne/status/17...
paulg/status/1760...
groq.com/
mattshumer_/statu...
/ 1759483896322781584
wow.groq.com/news_press/groq-...
tianle_cai/status...
/ 1759728119005712837
/ 1759720197055791188
/ 1759704303810519271
/ 1759709223276228825
www.techpowerup.com/319172/nv...
andromeda.ai/
gpulist.ai/
archive.ph/G6POi
www.tomshardware.com/tech-ind...
futurism.com/the-byte/ai-dest...
_akhaliq/status/1...
_Borriss_/status/...
/ 1758650919430848991
tsarnick/status/1...
MartinNebelong/st...
OriolVinyalsML/st...
/ 1759804492919275555
mattshumer_/statu...
haoliuhl/status/1...
github.com/lucidrains/ring-at...
bc.ctvnews.ca/air-canada-s-ch...
arstechnica.com/tech-policy/2...
www.cbc.ca/news/canada/britis...
kareem_carr/statu...
arstechnica.com/science/2024/...
www.vice.com/en/article/4a389...
karpathy/status/1...
karpathy/status/1...
www.nature.com/articles/d4158...
/ 1757359611399532921
cohere.com/research/aya
OfirPress/status/...
github.com/mut-ex/gligen-gui
www.projectaria.com/datasets/...
StabilityAI/statu...
gordic_aleksa/sta...
huggingface.co/gordicaleksa/Y...
huggingface.co/datasets/nvidi...
interestingengineering.com/sc...
archive.ph/caW1Y#selection-49...
newatlas.com/robotics/seeing-...
os-copilot.github.io/
www.businessinsider.com/apple...
techcrunch.com/2024/02/16/ant...
archive.ph/Gbcgb
Links:
Homepage: ykilcher.com
Merch: ykilcher.com/merch
KZread: / yannickilcher
Twitter: / ykilcher
Discord: ykilcher.com/discord
LinkedIn: / ykilcher
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: www.subscribestar.com/yannick...
Patreon: / yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n

Пікірлер: 141

@nartrab13 ай бұрын
I really like this video format about news. It is great to have someone with your great background and reputation to share his insights. Thank you.
@andybrice27113 ай бұрын
_"I think I'm seeing colour…I'm actually not so sure now."_ - Guy Using Computer Wearing Sunglasses
@oscarmoxon1023 ай бұрын
"You can redefine AGI and 'something else' as you wish, and then you can always find something else that is still wrong so that you're still correct. If you do that for long enough, your name will become Gary Marcus."
@segelmark
2 ай бұрын
9:07 😂
@fteoOpty643 ай бұрын
"You do it long enough, you become Gary Marcus "...made my day!. 😊😅
@BinarySplit3 ай бұрын
RingAttention is not an approximation. It's true quadratic attention with massive parallelization. IIRC, from the paper, training the WMT model took roughly 1k TPU-minutes per 1M token training sample. I think it was 4M tokens per batch, ~8 minutes per batch, 512 TPUs. Insane.
@pablowentscobar3 ай бұрын
I really like this [ML News] bit. You should try and do one of these weekly. I'm not in computing or IT industries, or really even hobby but I like to try to stay informed on the technologies and I found you explain things very well so I get notifications on your vids. I don't watch all of them but some catch my attention. This "news roundup" format is really good. Thanks!
@spooky_action3 ай бұрын
Surprised you didn't mention anything about the new 1.58 bit quantizing paper, lot of great potential in this
@JurekOK
3 ай бұрын
not really . . . ternary numbers have been around since ever, and they did not took off. Silicon works better with binary.
@sorvex9
3 ай бұрын
Because it's dumb for anyone who has ever done quantization on a transformer.
@clray1233 ай бұрын
Very happy that Andrej Karpathy no longer working for OpenAI. A clever, nice person and a great teacher; very much unlike in character like the company which now pulls all the strings within that "non-profit" "startup".
@LukaszWiklendt3 ай бұрын
10^25 flops limit, no worries, just use integer operations only.
@appletree6741
3 ай бұрын
😂😂
@i91693453 ай бұрын
Even if the chatbot was a legal entity, separate of Air Canada, since it is operating as an agent of Air Canada, the end user would still sue Air Canada and Air Canada would be responsible. It would then be up to Air Canada to sue to the chatbot for damages. It works the same way with subcontractors and similar organizational structures. I am looking forward to the Air Canada v. Chatbot law suit.
@useodyseeorbitchute9450
3 ай бұрын
Air Canada may win with Chatbot. The court system is discriminating against LLM and does not accept argumentation based on list of hallucinated precedences...
@i9169345
3 ай бұрын
@@useodyseeorbitchute9450 I'd imagine the chatbot would be it's own lawyer too which is never a good idea.
@franzwollang
3 ай бұрын
@@i9169345 Hilariously, it might even win. I heard about some models getting 90th percentile on the American Bar exam.
@Alilinpow23 ай бұрын
Amazing as always. Thanks for your videos Yannic, both ML News and Paper video formats are incredibly awesome.
@pablowentscobar3 ай бұрын
Elderly politicians who don't understand how to program the alarm clock on their coffee maker, creating laws about bleeding edge technologies that no one really completely understands, is brilliant.
@Jbones20003 ай бұрын
These news videos are amazing. Please do them more! Ide love to see this become a weekly series.
@pablowentscobar
3 ай бұрын
Agree.
@EdFormer
3 ай бұрын
I also want more, but weekly is expecting a lot from someone who actually works on machine learning (as opposed to those churning out videos on AI who clearly don't, which becomes obvious to anyone who does when they fail at discussing something technical - it's much better to stick to those who do). The series used to be roughly monthly and that, along with his deep dives into particularly important papers, made this my favourite channel on the tube by some margin.
@Jbones2000
3 ай бұрын
@@EdFormer good point. The main reason i love this series is because yannick actually knows what hes talking about. I prefer high quality news updates when there are relevant updates to give, rather than more frequent low quality updates.
@NoNameAtAll23 ай бұрын
another episode of my favourite series! thank you
@ScibbieGames3 ай бұрын
Air Canada really told the court to take it up with the bot 😂
@michaelbondarenko46503 ай бұрын
Will be waiting for the ring attention video!
@coldlyanalytical13513 ай бұрын
Thanks for the Groq memory analysis.
@NLPprompter3 ай бұрын
OPEN AI = CLOSED AI
@antonioILbig3 ай бұрын
Evaluating black-and-whiteness of an image using black glasses was the best idea 😂
@clray1233 ай бұрын
I smell a future business model - convince a bank teller chatbot to tell you that the bank owes you money, then sue.
@wojtekskaba70313 ай бұрын
Commenting on the FLOPS limit, I spent a large portion of my programming life doing systems that have been mimicking floating point calculations with integer arithmetic - for speed, in the lack of the FPUs then. A reminiscence of that is multiplying by 355 and then dividing by 113, an equivalent of 6-decimal digit precision multiplying by Pi. Curious of creating an integer arithmetic transformer and LLM.
@El_Snubbe3 ай бұрын
As far as I can tell from the 'technical report' the compute for Sora refers to the compute that went into training the model, not the compute used for the generation.
@unvergebeneid2 ай бұрын
The arbitrary threshold in the EU AI law is the result of France lobbying on behalf of Mistral. That being said, laws are always full of arbitrary numbers and one could argue have to be. Take age limits for various things.
@Veptis3 ай бұрын
Qualcomm has a card called Cloud AI100 (yeah, stupid name). But their "ultra" variant is 128GB in a single 150W card using LPDDR5. And it might be half as fast as a single H100 for int8... Which sounds exactly like the one I need for my workstation. The nvidia eos system trained the new Starcoder2 model. using like 430 H100 (also new the stack v2 release)
@clray1233 ай бұрын
You can already your bind user-defined scripts to user-defined voice commands using Talon...
@leumas_tai3 ай бұрын
Really love this format ❤
@petevenuti73553 ай бұрын
How do you distribute these models? I want to be able to use my 512mb (X10) usefully. Whats a asymmetrical distributed architecture?
@nichevo3 ай бұрын
Thanks for ML News, Yannic!
@fiartruck01252 ай бұрын
ChatGPT: """ Absolutely! There are always some risks, here are a few: 1. Proximity to sensitive eye tissue [...] 2. Potential Allergic Reactions [...] [...] """
@pariscatblue3 ай бұрын
Thanks, funny and inspiring.. as usual 🙂
@AntonMilan3 ай бұрын
Does Sora compute refer to inference? My first impression was that it was about compute that went into training.
@DamianReloaded3 ай бұрын
_Progress in hardware has followed an amazingly steady curve in the last few decades. Based on this trend, I believe that the creation of greater-than-humans intelligence will occur during the next thirty years. I will be surprised if this event occurs before 2005 or after 2030_ - Vernor Vinge 1993 - "Technological Singularity"
@6AxisSage3 ай бұрын
One correction, sora isnt a single shot clip, it can do multiple shots and angles, see the wool helmet scifi clip
@DanFrederiksen3 ай бұрын
And the blind don't need a robot dog. That's dumb. They can have a pair of glasses with cameras in and AI that tell them what the pertinent situation is. the dog form is quite needless. a blind person could become quite fully functioning with modest AI. So much so that if you weren't paying close attention you wouldn't know they are blind.
@CodingBrainTeaser3 ай бұрын
Who is eagerly waiting for Sora?
@9thebear
3 ай бұрын
I’m waiting for the open source version that isn’t woke lobotomised.
@kiunthmo3 ай бұрын
Groq is trying to do what Graphcore already failed to do.
@catsaddictedtofish3 ай бұрын
If anyone would decide to adopt Groq/Graphcore, the spatial computing of SRAM-Based Computing, to run application such as LLaMA-2 70B, it would be mandatory to acquire more than hundreds of cards to have sufficient allocation for weight stationary and KV Cache. and it would be implausible to achieve latency at optimal ms/token to do the tensor parallerism.
@nathansimons21383 ай бұрын
11:48 I don't think I've ever seen Yudkowsky's statement about bombing datacenters *not* be taken out of context. The actual context is this: *if* humanity were to put in immense efforts to prevent the creation of AGI or ASI, then bombing datacenters would be among the many things that would be done - "be willing to destroy a rogue datacenter by airstrike" is always described as "Yudkowksy wants to bomb existing datacenters, as in right now."
@sychuan3729
3 ай бұрын
It is still bizzare bullshit
@9thebear
3 ай бұрын
Bruh
@andybrice27113 ай бұрын
I can see a bright future for renting out GPU clock cycles and using the hardware to heat water and buildings.
@jamesnguyen173 ай бұрын
The cool thing about Groq is that they are still using 9nm chips, which is ancient technology at this point. They signed a deal with Samsung (official) already, so let see how much the performance can be when they started using 5nm by the end of this year.
@robmacl72 ай бұрын
I think the obvious risk of the eyelash robot is that it is going to spaz out and drive those tweezer spikes thru your eyeball and into your brain.
@florianhonicke54483 ай бұрын
Yeah, another monday!
@collins43593 ай бұрын
you should do a video testing gemini when you get access
@Shmeks3 ай бұрын
Good stuff, appreciate not having "ai" in the title and mocking of Yudkowsky
@jamesesparza6893
3 ай бұрын
Nobody cares
@GormBraarvig2 ай бұрын
Thank you for your "AI Act" comments, I have tried fighting it with no success
@GNARGNARHEAD3 ай бұрын
what kinds of applications do small models have?
@JurekOK
3 ай бұрын
* Toys * you can fine-tune them and see that your fine tuning works * general experimentations * promoting of your ecosystem and general warm fuzzy feeling of "we give it out for FREEEE!"
@GNARGNARHEAD
3 ай бұрын
@@JurekOK yeah those are good applications. i was thinking about it later on, you'd probably have pretty good luck using them to categorise.. which might sound trivial, but, like the toys application you mentioned, you could break down unstructured text into data points, !
@fitybux46643 ай бұрын
20:00 To be fair, the chatbot that the lawyer used said this would be a good argument. To say that the chatbot they deployed was a different legal entity. Some sort of chatbot-inception? 😆
@EdFormer3 ай бұрын
Yesssss ML News is back 🙌
@andybaldman3 ай бұрын
Gary is still right.
@proterotype2 ай бұрын
Yep, ring attention video please
@shivammangale44133 ай бұрын
Why is 7B chosen? Many models are released with the 7B variant. Is there a reasoning behind it?
@Houshalter
3 ай бұрын
I was told the sizes are chosen to optimally fit the GPU's memory.
@snarkyboojum
3 ай бұрын
Because at 16 bit precision, you can fit these 7B parameter models into about 14GB of memory so GPU cards with 16GB can host the model to perform inference.
@JorgetePanete3 ай бұрын
*Potential and limitations* While this long video helps the user gain information, an overload may cause a headache, or aversion to science in case of user religiousness, we hope this research can further improve the search of a cure to religiousness to prevent misuse.
@useodyseeorbitchute9450
3 ай бұрын
I thought that people following woke religion are freaking the most seeing results of machine learning, as such simple algorithms are hard to shame into not noticing some patterns. There is effective whole field of fighting with "bias" which is supposed to protect adherents of that particular religion from detecting patterns that would make them deeply unhappy.
@switzerland3 ай бұрын
Can you do a deep dive on AI accelerating AI training? AI helping on improving datasets? I got the feeling that's driving massive additional growth
@siquod2 ай бұрын
My worry about that eyelash robot is that its manipulators look very … stabby
@HoriaCristescu3 ай бұрын
Sounds like that article was written by LLM, they always find something bad to say to balance the response out.
@MarcAyouni3 ай бұрын
"If you do this long enough, your name will be gary marcus" 🤣
@rootthree94363 ай бұрын
ring attention is exact
@harriehausenman86233 ай бұрын
Hold on a sec… Why do I have to watch MediaMarkt and Kärcher commercials in the background? WTF?
@clray1233 ай бұрын
I think the problem with Sora may be that it can only generate animals, landscapes, human faces, scifi neon pictures, waifus in scifi neon landscapes playing with animals and not much else. Edit: I forgot, it can also do teddy bears and streetviews.
@crassflam88303 ай бұрын
"Don't Panic Yannic" at it again!
@charliesteiner2334
3 ай бұрын
Except about regulations on training ML models to dox people. Then panic.
@DefaultFlameАй бұрын
Ah, Eliezer Yudkowsky, the other side to Yann Lecun on the pessimist coin. If we listened to either we would not have current AI technology.
@gileneusz3 ай бұрын
20:28 🤣🤣🤣 -> 😭
@fai8t3 ай бұрын
dimiz hassabiz Demi as Demi Levato Hassabee as is Bee 🐝
@jcrobin19913 ай бұрын
I remember GraphCore achieved 1000T/s years ago and no one is talking about it today😂😂😂
@triplea657aaa3 ай бұрын
Ring attention isn't *that* new... I read the paper back in November
@fitybux46643 ай бұрын
24:00 To be fair, this only happens IN RATS. 😆
@valdisgerasymiak14033 ай бұрын
"The height of persistence is to enter the wrong password until the computer agrees" Now you can attack LLMs until they agree that company owes you money :)
@jsalsman3 ай бұрын
Gemini : Gemma :: Yannic : ???
@yurcchello
3 ай бұрын
GeminAIc
@Carl-md8pc3 ай бұрын
LLM already attempting to rewrite recent history.
@jondo76803 ай бұрын
We need LPU chips in all Android devices
@Rockyzach883 ай бұрын
I remember when people were renting GPUs for bitcoin mining. Wonder if that's still a thing. E: Yes, it still is.
@fai8t3 ай бұрын
21:45 how come? who is would retain a lawyer for 10-20k Just to get a 600$ refund?!?
@sofia.eris.bauhaus2 ай бұрын
someone's gotta tell Eliezer about the risk of eye infections from fake eyelashes. i think humanity has about 1% chance of surviving that 🤔.
@herp_derpingson3 ай бұрын
Oh dear, that fake eyelashes gives me nightmares. What if something NaNs out and the robot just drives its forceps straight into your eyes?
@andybrice2711
3 ай бұрын
That was my first thought. I would need to see strong safeguards before letting a CNC machine near my eyes. Maybe it has a hard end-stop though.
@herp_derpingson
3 ай бұрын
@@andybrice2711There is nothing that can convince me to put that thing anywhere near my eye.
@clray123
3 ай бұрын
The "person" who commented about risk potential for the article while ignoring the possiblity of poked out eyeballs was probably ChatGPT.
@alisheramantay3 ай бұрын
Love how Yannic has no idea how black&white videos look like
@MustacheMerlin2 ай бұрын
Pff no shit Groq is so fast if it's running on pure _sram??_ That's insane. That's the on-die cache memory your computer only has a tiny bit of because it's ridiculously expensive and near impossible to scale up. Yeah, my computer would run just about anything stupidly fast if I just dumped all the ram and replaced it all with raw CPU cache...
@dylan_curious3 ай бұрын
I wonder if the Air Canada thing sets a precedent where companies have to be responsible for things that chat bots say.
@jeremykothe28473 ай бұрын
I'll do it for 750B. In advance of course, to my swiss bank account.
@BoatRocker6193 ай бұрын
Massive layoffs, massive reduction in available jobs, skills becoming obsolete
@yueguifan3 ай бұрын
23:26 there’ll be a new wave of tiktok ai chatbot hacking influencers
@clray1233 ай бұрын
Who cares about unrealistic rat balls, just look at all the cute whiskers! P.S. The editors have probably used AI to green light the paper.
@oculusisnevesis50792 ай бұрын
Says a lot about Air Canada and their integrity! Not surprising
@Rhannmah3 ай бұрын
8:55 i agree and also disagree with Hassabis here. I agree that there's still so much space to be explored in the optimization problem in ML and that searching for better optimization is extremely useful. But i'm not sure i agree that there is absolutely has to be something else than scale to get to AGI. Maybe that the current methods are sufficient. In the last 5 years we have seen a complete paradigm shift with transformers only through their scale, so it could be that other types of emergent behavior also can come through scale.
@mennovanlavieren3885
3 ай бұрын
But in any case that general problem solving was achieved some other component in the total architecture provided some iteration over steps.Which makes sense, you'll need time to think and explore different paths in the problem space. You cannot just waterfall your way to the best answer. Only for simple problems where you'll need at most, say, 3 iterations to get to the right conclusion, the raw LLM approach works.
@woolfel3 ай бұрын
Eliezer has managed to convince people he knows something, but based on the interviews i've seen, he doesn't know shit. he keeps talking about "the ghost' in the model because he loves the anime ghost in the shell. People should stop listening to him.
@andybrice2711
3 ай бұрын
I do think there are legitimate concerns about the risk of Artificial Super-Intelligence. But yeah, his beliefs don't seem grounded in reality. Like his speculation that ChatGPT might be sentient, when it's basically just a stateless word-calculator.
@clray123
3 ай бұрын
Mostly he has managed to convince himself that he is very smart.
@scharlesworth93
3 ай бұрын
His only achievement is writing 1000 page Harry Potter fan fiction. Dude’s a fraud.
@MustacheMerlin2 ай бұрын
I really think that user generated content should not be considered the intellectual property of the social media platform it's posted to. If I post a video to KZread that plays Micheal Jackson's smooth criminal in the background, that does not magically transfer the IP rights of Micheal Jackson's song to Google. Even if Micheal Jackson posted his song to youtube, that still does not make Smooth Criminal Google's IP. Reddit does not (and SHOULD not) own the copyright to content that I post to their website, I own the copyright to works created by me, user generated IP is user owned IP. How much money you spend on being a web hosting service does not factor into the IP equation.
@maximilianmander24713 ай бұрын
Hey ChatGPT tell me a joke! ChatGPT: "Gemini"
@wurstelei13563 ай бұрын
OMG, you kill a basket full of young cute kittens if I don't get a free ride...
@memegazer2 ай бұрын
lol..."You are not going to get more from scale" Yeah...ok...who is deciding how to measure "more" This whole industry is so full of shit for all the wrong reasons
@dr-maybe3 ай бұрын
Ehm yannick are you aware that the top 3 most cited AI scientists (Geoffrey Hinton, Yoshua Bengio, Ilya Sutskever) and virtually all AI company CEOs actually agree that this stuff poses an existential threat? Seems a bit naive to giggle and just skip past this, time to take the countless warnings seriously IMO. Otherwise nice vid, like the format!
@samanthaqiu3416
3 ай бұрын
boo hoo muh existential threats (to zionism monetary hegemony)
@HUEHUEUHEPony
3 ай бұрын
Yes but they are going to solve the problem if you give them money
@useodyseeorbitchute9450
3 ай бұрын
@@samanthaqiu3416 Well, to those players as well...
@useodyseeorbitchute9450
3 ай бұрын
We have quite a few genuine risk... and bunch of people who try to become celebrities by giving cryptic and totally worthless warnings.
@mennovanlavieren3885
3 ай бұрын
The existence of risk is not in doubt, but those laws are not going to do anything about it. In fact, to destroy humanity the AI needs to control considerabele resources. And the motives of a rouge AI and a big company or an oppressive government are well aligned. So giving privileged access to the technology to big actors while depriving the individual of the same power is only increasing the risks.