Large Language Models: Part 2

Тәжірибелік нұсқаулар және стиль

How do language models like GPT and Palm work?
Part 1: • Large Language Models ...
See next: text-to-image (Parti, Imagen, Dall-E): • Text to Image in 5 min...
0:00 - intro
0:14 - next word prediction
0:20 - word embeddings
1:01 - transformers
3:11 - generating text
4:13 - stacking attention layers
4:47 - training data
5:21 - GPT-3 examples

Пікірлер: 170

  • @kahoku451
    @kahoku451 Жыл бұрын

    It’s amazing and so funny to me how LLM can produce fully functional python programs and write these poetic Bob Dylan inspired lyrics… but when prompted with “you’re going north. You turn right, then you turn left. Now you’re going …” it said SOUTH ☠️

  • @samandoria

    @samandoria

    Жыл бұрын

    Just like human "brain cramps" or "brain farts". The problem is that the current models aren't self learning or made to analyse their own answers so they don't correct these error before output or can correct the weights in the future when confronted with new data.

  • @TimHulse

    @TimHulse

    Жыл бұрын

    It Re-raises old but interesting points about the difference between language and logic. You can make linguistically sensible statements that defy logic. Try the book “Godel, Escher, Bach” if you like this area.

  • @webcache1000

    @webcache1000

    Жыл бұрын

    6:02 If you are just just south of the North Pole, turn right and turn left over the North Pole, you would be heading South. I suspect it is getting confused due to the similar riddle where if you head North, turn right and you are now heading South - where are you? It has probably seen this riddle a confusing number of times and weaved that into its weights/response.

  • @distrologic2925

    @distrologic2925

    Жыл бұрын

    Because it doesn't necessarily understand how directions work, it only knows how "north" and "right and left" has been used in language before, and is only an estimation. People are unlikely to talk about celestial directions and turn right or left, so there was probably not enough similar to your query in its training data that it could draw on and other examples such as "north and south" have taken more weight.

  • @swanandjoshi333

    @swanandjoshi333

    Жыл бұрын

    Logic is a whole different thing ig

  • @5133937
    @513393711 ай бұрын

    This is one of the best explainer vids on LLMs I’ve seen yet. Not too long, not too short, good pacing, good visualizations. Great work, thanks!

  • @TimHulse
    @TimHulse Жыл бұрын

    I really enjoyed both of these LLM videos. They are so concise and informative and the pacing is excellent.

  • @whannabi

    @whannabi

    Жыл бұрын

    He is an Arab from the Middle East. My dog wants a walk.

  • @apc067

    @apc067

    11 ай бұрын

    I [or shall I say, my internal GPT?] first misread your comment as "... and the pancake is excellent". 😃

  • @f18a
    @f18a Жыл бұрын

    Great stuff. Possibly the best intro material to LLMs that I have seen. Thunbs up!

  • @Charlieee1
    @Charlieee1 Жыл бұрын

    I like that you used a recipe prompt to demonstrate what a LLM is good at doing, then actually followed the recipe and proved that it actually worked (and tasted good!).

  • @roywem
    @roywem Жыл бұрын

    This is the single best explaination I’ve come across on LLM’s

  • @csmac3144a
    @csmac3144a4 ай бұрын

    This should be adopted by high schools everywhere. Superb teaching. A bit fast for me (I’m 61), but I can re-watch until I get it.

  • @ianassarandas
    @ianassarandas11 ай бұрын

    you actually made the cookies hahaha that's awesome, great video btw!

  • @Sm_Colly
    @Sm_Colly27 күн бұрын

    Soothing voice to learn about transformers and warm room is perfect for recipe for my sleep.🛌

  • @stuartthomas7441
    @stuartthomas7441 Жыл бұрын

    So pleased to get a clear and credible glimpse under the hood. Thank you.

  • @smudgepost
    @smudgepost Жыл бұрын

    Very much enjoyed these two videos. More please! Clear and detailed.

  • @somethingness
    @somethingness Жыл бұрын

    Typo alert! Frost's wonderful poem begins as so: "Two roads diverged in a yellow wood". (Not "diverted"!)

  • @aidarfaizrakhmanov1901
    @aidarfaizrakhmanov1901 Жыл бұрын

    This is a really good intro indeed! I encourage to make more content like this

  • @poklet
    @poklet Жыл бұрын

    Thank you. Those were very clear explanations of just the right length. Loved that you cooked the pancakes too!

  • @tvillaluz
    @tvillaluz Жыл бұрын

    Haha you made the pancake

  • @xybnedasdd2930
    @xybnedasdd293011 ай бұрын

    I think this is the best, most intuitive and most illustrative video describing LLMs/transformers. Thank you so much!

  • @craftycurate
    @craftycurate Жыл бұрын

    Really easy to follow, well paced, easy on the ear, and just the right level thanks!

  • @MaleGeminiCat
    @MaleGeminiCat Жыл бұрын

    These are two great videos that introduced how large language model works in a very comprehensive way.👍👍👍

  • @ravinatarajan4894
    @ravinatarajan4894 Жыл бұрын

    Thanks for the crisp walkthrough of the technology. It is a very good introduction.

  • @jteichma
    @jteichma Жыл бұрын

    What a great walk through! Thanks so much for sharing.

  • @scottlott3794
    @scottlott37949 ай бұрын

    Dude, I have been researching Transformers and Attention for weeks. You’re a master mind, I get it now!

  • @Paul-rs4gd
    @Paul-rs4gd Жыл бұрын

    Thanks for an original presentation of Large Language Models. It gave me new insight.

  • @DavidDiaz-zg5sv
    @DavidDiaz-zg5sv Жыл бұрын

    I love that you finished cooking the recipe! Great video :)

  • @coraltown1
    @coraltown110 ай бұрын

    the Mona Lisa of LLM explanations .. thanks!

  • @g5min

    @g5min

    10 ай бұрын

    Okay, that's the best comment yet -- thank you :-)

  • @yojimbomk
    @yojimbomk10 ай бұрын

    Just 16 minutes, it is amazing. These are the best concise videos on the topic including basics in neural networks I have seen. Many thanks for sharing your knowledge!

  • @eenvleugjegoeiegames
    @eenvleugjegoeiegames6 ай бұрын

    Man you NEED more subscribers, the content and video quality is way too good for only 15k!

  • @MindyMcAdams
    @MindyMcAdams Жыл бұрын

    Thank you! This is excellent. I love the animations. They are helpful!

  • @jaunalapa
    @jaunalapa7 ай бұрын

    You, sir, have talent for teaching. Even with my fairly technical background and watching a lot of videos I was struggling to get my head around LLMs until I watched your videos. Hope you continue what you started it would help a lot of people especially now when we see a lot of people trying to run their own local LLM versions on their PCs.

  • @DrNuyenVanFaulk
    @DrNuyenVanFaulk Жыл бұрын

    Thanks so much for both of these videos. They are wonderful. I think I understood them a bit more since I’ve done some basic assisted machine learning dev (up to neural networks). If anyone is a bit lost, read up on linear and logistic regression, then onto neural networks.

  • @fernwood
    @fernwood10 ай бұрын

    This brings me one step closer to understanding. Thanks.

  • @KumquatChampion
    @KumquatChampion Жыл бұрын

    This is awesome. Would love to learn more!

  • @janicem5942
    @janicem594210 ай бұрын

    Brilliant overview for a non-technical person like me... and glad to see you tested the recipe!

  • @AMGbot
    @AMGbot Жыл бұрын

    Loved this! Thanks for the great video!

  • @bloodywolftr
    @bloodywolftr Жыл бұрын

    Pretty nice explanation in both videos.Thanks!

  • @preethamrangaswamy7371
    @preethamrangaswamy7371 Жыл бұрын

    Very underrated and underappreciated video.

  • @vast634
    @vast634 Жыл бұрын

    You really have a talent to teach things.

  • @EGlobalKnowledge
    @EGlobalKnowledge11 ай бұрын

    An excellent video on language models

  • @frank-reneschafer5512
    @frank-reneschafer55129 ай бұрын

    Great effort - nice video, good and comprehensive explanation.

  • @Patapom3
    @Patapom311 ай бұрын

    Very well made!

  • @padetiit7014
    @padetiit7014 Жыл бұрын

    I loved that you actually cooked that recipe! :-)

  • @roscatres
    @roscatres11 ай бұрын

    I hope you keep up with these videos, they are seriously great. Already suscribed and I'll check the rest of your channel. Thank you.

  • @KalebPeters99
    @KalebPeters99 Жыл бұрын

    Not a second wasted. Just brilliant ❤️

  • @AncientSlugThrower
    @AncientSlugThrower Жыл бұрын

    Great video, leaving a comment to let you know it was very insightful. Thank you.

  • @0815Snickersboy
    @0815Snickersboy Жыл бұрын

    This is the best video on Language Models that I seen. Probably the best on the Internet. You should maybe add chatgpt to the title to get more views.

  • @GodofStories
    @GodofStories9 ай бұрын

    Absolute best explainer. Where have you been though since GPT-4?? Would love to see you do more content. You have a talent!! I think you might be busy working on LLM's for research or in a company, but would be cool to see more videos!

  • @ayushsharma9036
    @ayushsharma903611 ай бұрын

    That language neural network at 0:50 belongs on a tshirt somewhere

  • @christianwestermann4680
    @christianwestermann4680 Жыл бұрын

    Well done!

  • @StratosFair
    @StratosFair Жыл бұрын

    Great explanation, glad I stumbled upon it ! Subscribed :)

  • @ypetkar
    @ypetkar9 ай бұрын

    Very informative and concise videos to understand LLMs their complexities and what i takes to make these models successful.

  • @jorgesoberon6866
    @jorgesoberon6866 Жыл бұрын

    Very clear. Thanks a lot.

  • @dv6165
    @dv616511 ай бұрын

    This was awesome! Thanks

  • @TheCaioKyleBraga
    @TheCaioKyleBraga11 ай бұрын

    This is my first search for a LLM explanation and very pleased with the video. I am not a mathematician or programmer but I am very interested in learning how LLM works. From my humble perspective I can say we reached a point of no return and this technology is progressing at an exponential rate. With the development of quantum computing, I have no doubt that it will surpass human intelligence in ways we don't understand.

  • @bingolio
    @bingolio Жыл бұрын

    EXCELLENT Vid, PLS DO MORE, on Deep learning , covering the whole workflow of making an LLM, especially, what os LORa, VEctor embeddings, etc I'm sure you'll get huge interest You have a gift for explaining. Thanks!

  • @user-hf5og1bt8j
    @user-hf5og1bt8jАй бұрын

    amazing video! thank you!

  • @fenandamilanda2032
    @fenandamilanda2032 Жыл бұрын

    nice work, really good to visualize these things even though I already know this.

  • @yourfuneral
    @yourfuneral Жыл бұрын

    Thanks for the good explanation, very much on time

  • @prathams8685
    @prathams8685 Жыл бұрын

    Big thanks mate

  • @danjsy
    @danjsy8 ай бұрын

    Great, thanks so much, massively useful.

  • @stefanbiesdorf4637
    @stefanbiesdorf46379 ай бұрын

    Thank you Steve for a personal moment of enlightening by opening the "black box of AI". This is an outstanding educational piece, in particular in combination with the first part.

  • @sciexp
    @sciexp Жыл бұрын

    Very interesting... Now I understand more about how ChatGPT works...

  • @rajachan8588
    @rajachan8588 Жыл бұрын

    Awesome. Thank you

  • @ivanocj
    @ivanocj Жыл бұрын

    Very good content! Keep going! thanks.

  • @saisaske1
    @saisaske19 ай бұрын

    Very good explanation

  • @bgustinjr
    @bgustinjr Жыл бұрын

    Thumbs-up for actually making the pancakes. 😂

  • @Sukant98
    @Sukant98 Жыл бұрын

    amazing videos!! learnt so much

  • @thomasforrest1931
    @thomasforrest1931 Жыл бұрын

    great video bud cheers!

  • @carrumar
    @carrumar Жыл бұрын

    Great videos! Btw, how did you like your avocado cocoa thing? 😄

  • @ChatGPt2001
    @ChatGPt20016 ай бұрын

    Continuing from the previous response, here are some additional considerations and challenges when working with large language models: 14. Data Privacy and Ethical Concerns: - Be aware of privacy concerns when collecting and using data for training. Ensure that you have the necessary permissions and comply with data protection regulations. Ethical considerations, such as bias and fairness in the data, also need to be addressed. 15. Compute Resources: - Training large language models requires substantial computational resources, including high-end GPUs or TPUs and large-scale distributed computing infrastructure. These resources can be expensive and may not be accessible to everyone. 16. Energy Consumption: - Training large language models consumes a significant amount of electricity, contributing to environmental concerns. Some organizations are actively working on making AI training more energy-efficient. 17. Model Size and Efficiency: - While larger models tend to perform better, they also require more memory and computational power. Balancing model size and efficiency is crucial for real-world applications, as very large models might not be practical for all use cases. 18. Fine-Tuning and Transfer Learning: - Fine-tuning pre-trained models on specific tasks is a common practice, as it requires less data and computational resources compared to training from scratch. Understanding how to effectively fine-tune models is essential. 19. Evaluation Metrics: - Choosing appropriate evaluation metrics is critical. Different NLP tasks may require different metrics. For instance, accuracy may be suitable for classification tasks, while BLEU scores are used for machine translation. Select metrics that align with your objectives. 20. Bias and Fairness: - Large language models can inherit biases present in the training data. Mitigating bias and ensuring fairness in AI systems is a significant challenge. It requires careful curation of training data and ongoing monitoring. 21. Robustness and Safety: - Ensuring that large language models are robust and safe is essential. This includes protecting against adversarial attacks, avoiding harmful or inappropriate outputs, and ensuring that the model behaves predictably. 22. Research and Collaboration: - The field of large language models is rapidly evolving. Staying up-to-date with the latest research and collaborating with the AI community can help improve your understanding and the quality of your models. 23. Resource Sharing: - Due to the resource-intensive nature of training, sharing pre-trained models, datasets, and trained weights is common in the AI community. Leveraging existing resources can save time and resources. 24. Ethical Considerations: - Consider the ethical implications of your work. The power of large language models also comes with responsibility. Engage in ethical discussions and follow guidelines for responsible AI development. 25. Interpretability: - Large language models are often criticized for their lack of interpretability. Efforts to make AI models more understandable and explainable are ongoing to build trust and ensure accountability. Training large language models from scratch is a challenging and resource-intensive endeavor, and it may not be necessary for many practical applications. Leveraging pre-trained models and fine-tuning them for specific tasks is a more common and efficient approach in many cases. Moreover, ensuring responsible and ethical AI practices is paramount when working with such powerful language models.

  • @koho
    @koho Жыл бұрын

    Great overview! People need to see this video pair before freaking out that LLMs are actually intelligent.

  • @moedemama
    @moedemama Жыл бұрын

    Cool video, thanks

  • @abhaychandrol
    @abhaychandrol4 ай бұрын

    Thanks

  • @deand6411
    @deand6411 Жыл бұрын

    Thank you. Just the right level for my tiny organic brain.

  • @andresroca9736
    @andresroca9736 Жыл бұрын

    Great the concepts transition. Great illustrations. The best of the best this couple of videos. What about more on other networks like r-cnn and audio nets? 😃

  • @g5min

    @g5min

    Жыл бұрын

    Thanks! I'm working on one on reinforcement learning now...

  • @xflory26x
    @xflory26x Жыл бұрын

    Please do more videos on LLMs!!! But also I need to know, how were the pancakes?

  • @DJWESG1
    @DJWESG1 Жыл бұрын

    Ever think about audio synthesis and wave forms?? And how analogue synthesis utilising wave tables can offer a way to both communicate and comput information.

  • @CoreDump07
    @CoreDump07 Жыл бұрын

    liked and subscribed

  • @mineralt
    @mineralt8 ай бұрын

    those pancakes look pretty good

  • @leeamraa
    @leeamraa Жыл бұрын

    fun funny fantastic and I am a fan!

  • @zuqini
    @zuqini11 ай бұрын

    I'm a bit confused by how stacking attention layers works at 4:12. Does the second layer take the first layer's prediction as input? Is the first layer's prediction still "next words" at that point, or is it now some sort of abstract intermediate value? How exactly does that capture higher level reasoning? Would appreciate any clarification!

  • @food4yann
    @food4yann9 ай бұрын

    I love the fact that you really tried that recipe! How was it?

  • @SethWieder
    @SethWieder Жыл бұрын

    I would love more of the visual explainers on ML concepts. Subscribing. "Some folks say they're overhyped / But I do think that's true / I think they're just misunderstood / Just like me and you" That generated lyric gave me CHILLS

  • @kahoku451

    @kahoku451

    Жыл бұрын

    Sameee

  • @martinstu8400
    @martinstu8400 Жыл бұрын

    eliminating bias and stereotypes from language models is a lost cause, because it's the same as asking the network to lie.

  • @parabolicpanorama

    @parabolicpanorama

    Жыл бұрын

    you didn't understand anything did you

  • @dormin1850
    @dormin1850 Жыл бұрын

    This was super informative and concise, loved it! But my real question is how were those pancakes?

  • @efisgpr
    @efisgpr Жыл бұрын

    GPT-4: Might I suggest dubbing them "Chocomole Pancakes"? 😮😂

  • @richardharris9708
    @richardharris9708 Жыл бұрын

    ChatGPT gets the 37 question right now.

  • @logic_force
    @logic_force8 ай бұрын

    Can I do this in JavaScript?!

  • @speicaldark
    @speicaldark Жыл бұрын

    This is really nice! Now I understand why chatGPT tends to make up a lot of stuff with coherent sentences

  • @whannabi

    @whannabi

    Жыл бұрын

    What I'm wondering is how are they correcting its errors. For traditional NN, we have heat maps but I'd like to see something similar with transformers at the highest level to see what kind of patterns it noticed. Maybe that's what they use to correct its mistakes

  • @gideonk123

    @gideonk123

    Жыл бұрын

    @@whannabi ChatGPT used a process called reinforcement learning from human feedback (RLHF): They used an already trained GPT-3 which already at the time. Humans both submitted new sentences as input prompts to ChatGPT being trained, and also ranked the output (responses) of the model. Then the ranking of the responses were then used as reward targets to continue training the model to obtain more desirable responses (measured by how the response rankings had increased).

  • @triton62674
    @triton62674 Жыл бұрын

    Fantastic work, hope those pancakes tasted better than they looked! xD

  • @g5min

    @g5min

    Жыл бұрын

    they were seriously delicious

  • @mlusalin2379
    @mlusalin2379 Жыл бұрын

    It looks like GPT hands you a great chocolate guacamole pancake recipe... now I wanted to try too loool

  • @chnolte
    @chnolte Жыл бұрын

    Just look how far we have come in only eight months.

  • @GeorgeJohnsonJackofAllTrades
    @GeorgeJohnsonJackofAllTrades8 ай бұрын

    Rather than asking chatgpt to translate, I recommend asking the question in the target language. You'll significantly different results - more accurate in the target language. Stated without proof or explanation which should be intuitively obvious after watching these two videos...

  • @nagualdesign
    @nagualdesign Жыл бұрын

    Wonderful stuff. 👍 Also, _please can I get some oven and the oven please let us have to do the run and not a big difference in a bit more about people who have been in touch your own house is the best way of a bit more about people..._ *My phone wrote the part in italics.

  • @Ai.Sentinel
    @Ai.Sentinel Жыл бұрын

    Was the chocolate guacamole pancake any good?

  • @KeikosCake
    @KeikosCake Жыл бұрын

    pancake approved 🥞👍

  • @genegade
    @genegade Жыл бұрын

    Verdict on the guacookies?

  • @altalt7653
    @altalt7653 Жыл бұрын

    I can't get why we stack them like so. If the first transformer block predicts a word, what second does, third? And why do they still need attention then?

  • @hongyifan5203
    @hongyifan52038 ай бұрын

    Me: What is a LLM? Average university pRofeSsOr: *Slams 12 academic papers with 119 equations and 165 references and told me to read* Chad Steve Seitz:

  • @DonaldTamMisterDee
    @DonaldTamMisterDee Жыл бұрын

    This is super fascinating. I want to learn to grow my own language models from the ground up with languages like c++/rust. I dont care if the language model i develop is inaccurate or not. I just want to understand chatgpt under the hood. How may i get started?

  • @spenarkley

    @spenarkley

    Жыл бұрын

    Why dont you ask the language model how to make a language model?

  • @infiniteplanes5775

    @infiniteplanes5775

    Жыл бұрын

    I have! Really great tips!

  • @parabolicpanorama

    @parabolicpanorama

    Жыл бұрын

    ask gpt to stack transformer layers and add some input output layers. add some loss functions to optimize and get a large dataset.

  • @jaredf6205
    @jaredf6205 Жыл бұрын

    What do you think of GPT4?

  • @f4ls381
    @f4ls381 Жыл бұрын

    5:55 the german one is correct

Келесі