Large Language Models: Part 2

Тәжірибелік нұсқаулар және стиль

How do language models like GPT and Palm work?
Part 1: • Large Language Models ...
See next: text-to-image (Parti, Imagen, Dall-E): • Text to Image in 5 min...
0:00 - intro
0:14 - next word prediction
0:20 - word embeddings
1:01 - transformers
3:11 - generating text
4:13 - stacking attention layers
4:47 - training data
5:21 - GPT-3 examples

Пікірлер: 170

@kahoku451 Жыл бұрын
It’s amazing and so funny to me how LLM can produce fully functional python programs and write these poetic Bob Dylan inspired lyrics… but when prompted with “you’re going north. You turn right, then you turn left. Now you’re going …” it said SOUTH ☠️
@samandoria
Жыл бұрын
Just like human "brain cramps" or "brain farts". The problem is that the current models aren't self learning or made to analyse their own answers so they don't correct these error before output or can correct the weights in the future when confronted with new data.
@TimHulse
Жыл бұрын
It Re-raises old but interesting points about the difference between language and logic. You can make linguistically sensible statements that defy logic. Try the book “Godel, Escher, Bach” if you like this area.
@webcache1000
Жыл бұрын
6:02 If you are just just south of the North Pole, turn right and turn left over the North Pole, you would be heading South. I suspect it is getting confused due to the similar riddle where if you head North, turn right and you are now heading South - where are you? It has probably seen this riddle a confusing number of times and weaved that into its weights/response.
@distrologic2925
Жыл бұрын
Because it doesn't necessarily understand how directions work, it only knows how "north" and "right and left" has been used in language before, and is only an estimation. People are unlikely to talk about celestial directions and turn right or left, so there was probably not enough similar to your query in its training data that it could draw on and other examples such as "north and south" have taken more weight.
@swanandjoshi333
Жыл бұрын
Logic is a whole different thing ig
@513393711 ай бұрын
This is one of the best explainer vids on LLMs I’ve seen yet. Not too long, not too short, good pacing, good visualizations. Great work, thanks!
@TimHulse Жыл бұрын
I really enjoyed both of these LLM videos. They are so concise and informative and the pacing is excellent.
@whannabi
Жыл бұрын
He is an Arab from the Middle East. My dog wants a walk.
@apc067
11 ай бұрын
I [or shall I say, my internal GPT?] first misread your comment as "... and the pancake is excellent". 😃
@f18a Жыл бұрын
Great stuff. Possibly the best intro material to LLMs that I have seen. Thunbs up!
@Charlieee1 Жыл бұрын
I like that you used a recipe prompt to demonstrate what a LLM is good at doing, then actually followed the recipe and proved that it actually worked (and tasted good!).
@roywem Жыл бұрын
This is the single best explaination I’ve come across on LLM’s
@csmac3144a4 ай бұрын
This should be adopted by high schools everywhere. Superb teaching. A bit fast for me (I’m 61), but I can re-watch until I get it.
@ianassarandas11 ай бұрын
you actually made the cookies hahaha that's awesome, great video btw!
@Sm_Colly27 күн бұрын
Soothing voice to learn about transformers and warm room is perfect for recipe for my sleep.🛌
@stuartthomas7441 Жыл бұрын
So pleased to get a clear and credible glimpse under the hood. Thank you.
@smudgepost Жыл бұрын
Very much enjoyed these two videos. More please! Clear and detailed.
@somethingness Жыл бұрын
Typo alert! Frost's wonderful poem begins as so: "Two roads diverged in a yellow wood". (Not "diverted"!)
@aidarfaizrakhmanov1901 Жыл бұрын
This is a really good intro indeed! I encourage to make more content like this
@poklet Жыл бұрын
Thank you. Those were very clear explanations of just the right length. Loved that you cooked the pancakes too!
@tvillaluz Жыл бұрын
Haha you made the pancake
@xybnedasdd293011 ай бұрын
I think this is the best, most intuitive and most illustrative video describing LLMs/transformers. Thank you so much!
@craftycurate Жыл бұрын
Really easy to follow, well paced, easy on the ear, and just the right level thanks!
@MaleGeminiCat Жыл бұрын
These are two great videos that introduced how large language model works in a very comprehensive way.👍👍👍
@ravinatarajan4894 Жыл бұрын
Thanks for the crisp walkthrough of the technology. It is a very good introduction.
@jteichma Жыл бұрын
What a great walk through! Thanks so much for sharing.
@scottlott37949 ай бұрын
Dude, I have been researching Transformers and Attention for weeks. You’re a master mind, I get it now!
@Paul-rs4gd Жыл бұрын
Thanks for an original presentation of Large Language Models. It gave me new insight.
@DavidDiaz-zg5sv Жыл бұрын
I love that you finished cooking the recipe! Great video :)
@coraltown110 ай бұрын
the Mona Lisa of LLM explanations .. thanks!
@g5min
10 ай бұрын
Okay, that's the best comment yet -- thank you :-)
@yojimbomk10 ай бұрын
Just 16 minutes, it is amazing. These are the best concise videos on the topic including basics in neural networks I have seen. Many thanks for sharing your knowledge!
@eenvleugjegoeiegames6 ай бұрын
Man you NEED more subscribers, the content and video quality is way too good for only 15k!
@MindyMcAdams Жыл бұрын
Thank you! This is excellent. I love the animations. They are helpful!
@jaunalapa7 ай бұрын
You, sir, have talent for teaching. Even with my fairly technical background and watching a lot of videos I was struggling to get my head around LLMs until I watched your videos. Hope you continue what you started it would help a lot of people especially now when we see a lot of people trying to run their own local LLM versions on their PCs.
@DrNuyenVanFaulk Жыл бұрын
Thanks so much for both of these videos. They are wonderful. I think I understood them a bit more since I’ve done some basic assisted machine learning dev (up to neural networks). If anyone is a bit lost, read up on linear and logistic regression, then onto neural networks.
@fernwood10 ай бұрын
This brings me one step closer to understanding. Thanks.
@KumquatChampion Жыл бұрын
This is awesome. Would love to learn more!
@janicem594210 ай бұрын
Brilliant overview for a non-technical person like me... and glad to see you tested the recipe!
@AMGbot Жыл бұрын
Loved this! Thanks for the great video!
@bloodywolftr Жыл бұрын
Pretty nice explanation in both videos.Thanks!
@preethamrangaswamy7371 Жыл бұрын
Very underrated and underappreciated video.
@vast634 Жыл бұрын
You really have a talent to teach things.
@EGlobalKnowledge11 ай бұрын
An excellent video on language models
@frank-reneschafer55129 ай бұрын
Great effort - nice video, good and comprehensive explanation.
@Patapom311 ай бұрын
Very well made!
@padetiit7014 Жыл бұрын
I loved that you actually cooked that recipe! :-)
@roscatres11 ай бұрын
I hope you keep up with these videos, they are seriously great. Already suscribed and I'll check the rest of your channel. Thank you.
@KalebPeters99 Жыл бұрын
Not a second wasted. Just brilliant ❤️
@AncientSlugThrower Жыл бұрын
Great video, leaving a comment to let you know it was very insightful. Thank you.
@0815Snickersboy Жыл бұрын
This is the best video on Language Models that I seen. Probably the best on the Internet. You should maybe add chatgpt to the title to get more views.
@GodofStories9 ай бұрын
Absolute best explainer. Where have you been though since GPT-4?? Would love to see you do more content. You have a talent!! I think you might be busy working on LLM's for research or in a company, but would be cool to see more videos!
@ayushsharma903611 ай бұрын
That language neural network at 0:50 belongs on a tshirt somewhere
@christianwestermann4680 Жыл бұрын
Well done!
@StratosFair Жыл бұрын
Great explanation, glad I stumbled upon it ! Subscribed :)
@ypetkar9 ай бұрын
Very informative and concise videos to understand LLMs their complexities and what i takes to make these models successful.
@jorgesoberon6866 Жыл бұрын
Very clear. Thanks a lot.
@dv616511 ай бұрын
This was awesome! Thanks
@TheCaioKyleBraga11 ай бұрын
This is my first search for a LLM explanation and very pleased with the video. I am not a mathematician or programmer but I am very interested in learning how LLM works. From my humble perspective I can say we reached a point of no return and this technology is progressing at an exponential rate. With the development of quantum computing, I have no doubt that it will surpass human intelligence in ways we don't understand.
@bingolio Жыл бұрын
EXCELLENT Vid, PLS DO MORE, on Deep learning , covering the whole workflow of making an LLM, especially, what os LORa, VEctor embeddings, etc I'm sure you'll get huge interest You have a gift for explaining. Thanks!
@user-hf5og1bt8jАй бұрын
amazing video! thank you!
@fenandamilanda2032 Жыл бұрын
nice work, really good to visualize these things even though I already know this.
@yourfuneral Жыл бұрын
Thanks for the good explanation, very much on time
@prathams8685 Жыл бұрын
Big thanks mate
@danjsy8 ай бұрын
Great, thanks so much, massively useful.
@stefanbiesdorf46379 ай бұрын
Thank you Steve for a personal moment of enlightening by opening the "black box of AI". This is an outstanding educational piece, in particular in combination with the first part.
@sciexp Жыл бұрын
Very interesting... Now I understand more about how ChatGPT works...
@rajachan8588 Жыл бұрын
Awesome. Thank you
@ivanocj Жыл бұрын
Very good content! Keep going! thanks.
@saisaske19 ай бұрын
Very good explanation
@bgustinjr Жыл бұрын
Thumbs-up for actually making the pancakes. 😂
@Sukant98 Жыл бұрын
amazing videos!! learnt so much
@thomasforrest1931 Жыл бұрын
great video bud cheers!
@carrumar Жыл бұрын
Great videos! Btw, how did you like your avocado cocoa thing? 😄
@ChatGPt20016 ай бұрын
Continuing from the previous response, here are some additional considerations and challenges when working with large language models: 14. Data Privacy and Ethical Concerns: - Be aware of privacy concerns when collecting and using data for training. Ensure that you have the necessary permissions and comply with data protection regulations. Ethical considerations, such as bias and fairness in the data, also need to be addressed. 15. Compute Resources: - Training large language models requires substantial computational resources, including high-end GPUs or TPUs and large-scale distributed computing infrastructure. These resources can be expensive and may not be accessible to everyone. 16. Energy Consumption: - Training large language models consumes a significant amount of electricity, contributing to environmental concerns. Some organizations are actively working on making AI training more energy-efficient. 17. Model Size and Efficiency: - While larger models tend to perform better, they also require more memory and computational power. Balancing model size and efficiency is crucial for real-world applications, as very large models might not be practical for all use cases. 18. Fine-Tuning and Transfer Learning: - Fine-tuning pre-trained models on specific tasks is a common practice, as it requires less data and computational resources compared to training from scratch. Understanding how to effectively fine-tune models is essential. 19. Evaluation Metrics: - Choosing appropriate evaluation metrics is critical. Different NLP tasks may require different metrics. For instance, accuracy may be suitable for classification tasks, while BLEU scores are used for machine translation. Select metrics that align with your objectives. 20. Bias and Fairness: - Large language models can inherit biases present in the training data. Mitigating bias and ensuring fairness in AI systems is a significant challenge. It requires careful curation of training data and ongoing monitoring. 21. Robustness and Safety: - Ensuring that large language models are robust and safe is essential. This includes protecting against adversarial attacks, avoiding harmful or inappropriate outputs, and ensuring that the model behaves predictably. 22. Research and Collaboration: - The field of large language models is rapidly evolving. Staying up-to-date with the latest research and collaborating with the AI community can help improve your understanding and the quality of your models. 23. Resource Sharing: - Due to the resource-intensive nature of training, sharing pre-trained models, datasets, and trained weights is common in the AI community. Leveraging existing resources can save time and resources. 24. Ethical Considerations: - Consider the ethical implications of your work. The power of large language models also comes with responsibility. Engage in ethical discussions and follow guidelines for responsible AI development. 25. Interpretability: - Large language models are often criticized for their lack of interpretability. Efforts to make AI models more understandable and explainable are ongoing to build trust and ensure accountability. Training large language models from scratch is a challenging and resource-intensive endeavor, and it may not be necessary for many practical applications. Leveraging pre-trained models and fine-tuning them for specific tasks is a more common and efficient approach in many cases. Moreover, ensuring responsible and ethical AI practices is paramount when working with such powerful language models.
@koho Жыл бұрын
Great overview! People need to see this video pair before freaking out that LLMs are actually intelligent.
@moedemama Жыл бұрын
Cool video, thanks
@abhaychandrol4 ай бұрын
Thanks
@deand6411 Жыл бұрын
Thank you. Just the right level for my tiny organic brain.
@andresroca9736 Жыл бұрын
Great the concepts transition. Great illustrations. The best of the best this couple of videos. What about more on other networks like r-cnn and audio nets? 😃
@g5min
Жыл бұрын
Thanks! I'm working on one on reinforcement learning now...
@xflory26x Жыл бұрын
Please do more videos on LLMs!!! But also I need to know, how were the pancakes?
@DJWESG1 Жыл бұрын
Ever think about audio synthesis and wave forms?? And how analogue synthesis utilising wave tables can offer a way to both communicate and comput information.
@CoreDump07 Жыл бұрын
liked and subscribed
@mineralt8 ай бұрын
those pancakes look pretty good
@leeamraa Жыл бұрын
fun funny fantastic and I am a fan!
@zuqini11 ай бұрын
I'm a bit confused by how stacking attention layers works at 4:12. Does the second layer take the first layer's prediction as input? Is the first layer's prediction still "next words" at that point, or is it now some sort of abstract intermediate value? How exactly does that capture higher level reasoning? Would appreciate any clarification!
@food4yann9 ай бұрын
I love the fact that you really tried that recipe! How was it?
@SethWieder Жыл бұрын
I would love more of the visual explainers on ML concepts. Subscribing. "Some folks say they're overhyped / But I do think that's true / I think they're just misunderstood / Just like me and you" That generated lyric gave me CHILLS
@kahoku451
Жыл бұрын
Sameee
@martinstu8400 Жыл бұрын
eliminating bias and stereotypes from language models is a lost cause, because it's the same as asking the network to lie.
@parabolicpanorama
Жыл бұрын
you didn't understand anything did you
@dormin1850 Жыл бұрын
This was super informative and concise, loved it! But my real question is how were those pancakes?
@efisgpr Жыл бұрын
GPT-4: Might I suggest dubbing them "Chocomole Pancakes"? 😮😂
@richardharris9708 Жыл бұрын
ChatGPT gets the 37 question right now.
@logic_force8 ай бұрын
Can I do this in JavaScript?!
@speicaldark Жыл бұрын
This is really nice! Now I understand why chatGPT tends to make up a lot of stuff with coherent sentences
@whannabi
Жыл бұрын
What I'm wondering is how are they correcting its errors. For traditional NN, we have heat maps but I'd like to see something similar with transformers at the highest level to see what kind of patterns it noticed. Maybe that's what they use to correct its mistakes
@gideonk123
Жыл бұрын
@@whannabi ChatGPT used a process called reinforcement learning from human feedback (RLHF): They used an already trained GPT-3 which already at the time. Humans both submitted new sentences as input prompts to ChatGPT being trained, and also ranked the output (responses) of the model. Then the ranking of the responses were then used as reward targets to continue training the model to obtain more desirable responses (measured by how the response rankings had increased).
@triton62674 Жыл бұрын
Fantastic work, hope those pancakes tasted better than they looked! xD
@g5min
Жыл бұрын
they were seriously delicious
@mlusalin2379 Жыл бұрын
It looks like GPT hands you a great chocolate guacamole pancake recipe... now I wanted to try too loool
@chnolte Жыл бұрын
Just look how far we have come in only eight months.
@GeorgeJohnsonJackofAllTrades8 ай бұрын
Rather than asking chatgpt to translate, I recommend asking the question in the target language. You'll significantly different results - more accurate in the target language. Stated without proof or explanation which should be intuitively obvious after watching these two videos...
@nagualdesign Жыл бұрын
Wonderful stuff. 👍 Also, _please can I get some oven and the oven please let us have to do the run and not a big difference in a bit more about people who have been in touch your own house is the best way of a bit more about people..._ *My phone wrote the part in italics.
@Ai.Sentinel Жыл бұрын
Was the chocolate guacamole pancake any good?
@KeikosCake Жыл бұрын
pancake approved 🥞👍
@genegade Жыл бұрын
Verdict on the guacookies?
@altalt7653 Жыл бұрын
I can't get why we stack them like so. If the first transformer block predicts a word, what second does, third? And why do they still need attention then?
@hongyifan52038 ай бұрын
Me: What is a LLM? Average university pRofeSsOr: *Slams 12 academic papers with 119 equations and 165 references and told me to read* Chad Steve Seitz:
@DonaldTamMisterDee Жыл бұрын
This is super fascinating. I want to learn to grow my own language models from the ground up with languages like c++/rust. I dont care if the language model i develop is inaccurate or not. I just want to understand chatgpt under the hood. How may i get started?
@spenarkley
Жыл бұрын
Why dont you ask the language model how to make a language model?
@infiniteplanes5775
Жыл бұрын
I have! Really great tips!
@parabolicpanorama
Жыл бұрын
ask gpt to stack transformer layers and add some input output layers. add some loss functions to optimize and get a large dataset.
@jaredf6205 Жыл бұрын
What do you think of GPT4?
@f4ls381 Жыл бұрын
5:55 the german one is correct