Vectoring Words (Word Embeddings) - Computerphile

How do you represent a word in AI? Rob Miles reveals how words can be formed from multi-dimensional vectors - with some unexpected results.
08:06 - Yes, it's a rubber egg :)
Unicorn AI:
EXTRA BITS: • EXTRA BITS: More Word ...
AI KZread Comments: • AI KZread Comments - ...
More from Rob Miles: bit.ly/Rob_Miles_KZread
Thanks to Nottingham Hackspace for providing the filming location: bit.ly/notthack
/ computerphile
/ computer_phile
This video was filmed and edited by Sean Riley.
Computer Science at the University of Nottingham: bit.ly/nottscomputer
Computerphile is a sister project to Brady Haran's Numberphile. More at www.bradyharan.com

Пікірлер: 402

  • @VladVladislav790
    @VladVladislav7904 жыл бұрын

    "Not in this data set" is my new favorite comeback oneliner

  • @MrAmgadHasan

    @MrAmgadHasan

    Жыл бұрын

    It's similar to "not in this timeline" that we hear a lot in time-travel scifi

  • @Jason-wm5qe

    @Jason-wm5qe

    Жыл бұрын

    😂

  • @wohdinhel
    @wohdinhel4 жыл бұрын

    “What does the fox say?” “Don’t they go ‘ring ding ding’?” “Not in this dataset”

  • 4 жыл бұрын

    Train the same algorithm on songs instead of news articles and I figure you could get some really interesting results as well. Songs work on feelings and that should change the connections between the words as well - I bet the technology can be used to tell a lot about the perspective people take on things as well.

  • @argenteus8314

    @argenteus8314

    4 жыл бұрын

    @ Songs also use specific rhythmic structures; assuming most of your data was popular music, I bet that there'd be a strong bias for word sequences that can fit nicely into a 4/4 time signature, and maybe even some consistent rhyming structures.

  • @killedbyLife

    @killedbyLife

    4 жыл бұрын

    @ Train it with only lyrics from Manowar!

  • @ruben307

    @ruben307

    4 жыл бұрын

    @ I wonder how strong Rhymes would show up in that dataset.

  • 4 жыл бұрын

    @@killedbyLife That's odd - I listen to Manowar regularly. Nice pick. 😉

  • @kurodashinkei
    @kurodashinkei4 жыл бұрын

    Tomorrow's headline: "Science proves fox says 'Phoebe'"

  • @bookslug2919

    @bookslug2919

    4 жыл бұрын

    Fox News

  • @xario2007
    @xario20074 жыл бұрын

    Okay, that was amazing. "London + Japan - England = Tokyo"

  • @yshwgth

    @yshwgth

    4 жыл бұрын

    That needs to be a web site

  • @cheaterman49

    @cheaterman49

    4 жыл бұрын

    More impressed by Santa + pig - oink = "ho ho ho"

  • @VoxAcies

    @VoxAcies

    4 жыл бұрын

    This blew my mind. Doing math with meaning is amazing.

  • @erikbrendel3217

    @erikbrendel3217

    4 жыл бұрын

    you mean Toyko!

  • @Dojan5

    @Dojan5

    4 жыл бұрын

    I was actually expecting New York when they added America. As a child I always thought New York was the capital of the U.S., I was at least around eight when I learned that it wasn't. Similarly, when people talk of Australia's cities, Canberra is rarely spoken of, but Sydney comes up a lot.

  • @bluecobra95
    @bluecobra954 жыл бұрын

    'fox' + 'says' = 'Phoebe' may be from newspapers quoting English actress Phoebe Fox

  • @skepticmoderate5790

    @skepticmoderate5790

    4 жыл бұрын

    Wow what a pull.

  • @rainbowevil

    @rainbowevil

    3 жыл бұрын

    It was given ‘oink’ minus ‘pig’ plus ‘fox’ though, not fox + says. So we’d expect to see the same results as for cow & cat etc. of it “understanding” that we’re looking at the noises that the animals make. Obviously it’s not understanding, just an encoding of how those words appear near each other, but we end up with something remarkably similar to understanding.

  • @adamsvoboda7717
    @adamsvoboda77174 жыл бұрын

    Meanwhile in 2030: "human" + "oink oink" - "pig" = "pls let me go skynet"

  • @Chayat0freak
    @Chayat0freak4 жыл бұрын

    I did this for my final project in my bsc. Its amazing. I found cider - apples + grapes = wine. My project attempted to use these relationships to build simulated societies and stories.

  • @Games-mw1wd

    @Games-mw1wd

    4 жыл бұрын

    would you be willing to share a link? This seems really interesting.

  • @TOASTEngineer

    @TOASTEngineer

    4 жыл бұрын

    Yeah, that sounds right up my alley, how well did it work

  • @ZoranRavic

    @ZoranRavic

    4 жыл бұрын

    Dammit Dean, you can't bait people with this kind of a project idea and not tell us how it went

  • @KnakuanaRka

    @KnakuanaRka

    4 жыл бұрын

    You want to give some info as to how that went?

  • @blasttrash

    @blasttrash

    Жыл бұрын

    you are lying you did not do it. if you did, then paste the source(paper or code). - cunningham

  • @Cr42yguy
    @Cr42yguy4 жыл бұрын

    EXTRA BITS NEEDED!

  • @kamalmanzukie

    @kamalmanzukie

    3 жыл бұрын

    grow up

  • @Alche_mist
    @Alche_mist4 жыл бұрын

    Fun points: A lot of the Word2vec concepts come from Tomáš Mikolov, a Czech scientist at Google. The Czech part is kinda important here - Czech, as a Slavic language, is very flective - you have a lot of different forms for a single word, dependent on its surroundings in a sentence. In some interview I read (that was in Czech and in a paid online newspaper, so I can't give a link), he mentioned that this inspired him a lot - you can see the words clustering by their grammatical properties when running on a Czech dataset and it's easier to reason about such changes when a significant portion of them is exposed visibly in the language itself (and learned as a child in school, because some basic parts of it are needed in order to write correctly).

  • @JDesrosiers

    @JDesrosiers

    Жыл бұрын

    very interesting

  • @afriedrich1452

    @afriedrich1452

    Жыл бұрын

    I keep wondering if I was the one who gave the inventor of Word2vec the idea of vectoring words 15 years ago. Probably not.

  • @notthedroidsyourelookingfo4026

    @notthedroidsyourelookingfo4026

    Жыл бұрын

    Now I wonder what would've happened if it had been a Chinese, where you don't have that at all!

  • @GuinessOriginal

    @GuinessOriginal

    Жыл бұрын

    Wonder how this works with Japanese? Their token spaces must be much bigger and more complex

  • @newbie8051

    @newbie8051

    Жыл бұрын

    Technically you can share the link to the newspaper

  • @rich1051414
    @rich10514144 жыл бұрын

    This thing would ace the analogy section of the SAT. Apple is to tree as grape is to ______. model.most_similar_cosul(positive['tree', 'grape'], negative['apple']) = "vine"

  • @buzz092
    @buzz0924 жыл бұрын

    Always love to see Rob Miles here!

  • @RobertMilesAI

    @RobertMilesAI

    4 жыл бұрын

  • @yondaime500

    @yondaime500

    4 жыл бұрын

    Even when the video doesn't have that "AAAHH" quality to it.

  • @veggiet2009
    @veggiet20094 жыл бұрын

    Foxes do chitter! But primarily they say "Phoebe"

  • @panda4247
    @panda42474 жыл бұрын

    I like this guy and his long sentences. It's nice to see somebody who can muster a coherent sentence of that length. So, if you run this (it's absurdly simple, right), but if you run this on a large enough data set and give it enough compute to actually perform really well, it ends up giving you for each word a vector (that's of length however many units you have in your hidden layer), for which the nearby-ness of those vectors expresses something meaningful about how similar the contexts are that those words appear in, and our assumption is that words that appear in similar contexts are similar words.

  • @thesecondislander

    @thesecondislander

    Жыл бұрын

    His neural network has a very large context, evidently ;)

  • @MrAmgadHasan

    @MrAmgadHasan

    Жыл бұрын

    Imagine a conversation between him and D Trump.

  • @muddi900
    @muddi9004 жыл бұрын

    'What does it mean for two words to be similar?' That is a philosophy lesson I am not ready for bro

  • @williamromero-auila7129

    @williamromero-auila7129

    4 жыл бұрын

    Breau

  • @_adi_dev_

    @_adi_dev_

    4 жыл бұрын

    How dare you assume my words meaning, don't you know its the current era

  • @cerebralm

    @cerebralm

    4 жыл бұрын

    that's kind of the great thing about computer science... you can take philosophical waffling and actually TEST it

  • @youteubakount4449

    @youteubakount4449

    4 жыл бұрын

    I'm not your bro, pal

  • @carlosemiliano00

    @carlosemiliano00

    4 жыл бұрын

    @@cerebralm "Computer science is the continuation of logic by other means"

  • @nemanjajerinic6141
    @nemanjajerinic61416 ай бұрын

    Today, vector databases are a revolution to AI models. This man was way ahead of time.

  • @wolfbd5950
    @wolfbd59504 жыл бұрын

    This was weirdly fascinating to me. I'm generally interested by most of the Computerphile videos, but this one really snagged something in my brain. I've got this odd combination of satisfaction and "Wait, really? That works?! Oh, wow!"

  • @alexisxander817
    @alexisxander8173 жыл бұрын

    I am in love with this man's explanation! makes it so intuitive. I have a special respect for folks who can make a complex piece of science/math/computer_science into an abstract piece of art. RESPECT!

  • @nidavis

    @nidavis

    Жыл бұрын

    "it's the friends you make along the way" lol

  • @sgttomas

    @sgttomas

    Жыл бұрын

    I was just thinking this and came to the comments…. Yup. Mr Miles is terrific. 🎉

  • @webgpu

    @webgpu

    Жыл бұрын

    "complex" ? 🙂

  • @Commiehunter12

    @Commiehunter12

    9 ай бұрын

    He's Twerp. He's afraid to talk about X Y and XX Chromosomes and how we express them in language. shame on you

  • @subject8332

    @subject8332

    6 ай бұрын

    @@Commiehunter12 No, he just didn't want to trigger the priesthood in a video about word embeddings but looks like he wasn't careful enough.

  • @kal9001
    @kal90014 жыл бұрын

    Rather than biggest city, it seems obvious it would be the most written about city, which may or may not be the same thing.

  • @packered

    @packered

    4 жыл бұрын

    Yeah, I was going to say most famous cities. Still a very cool relationship

  • @oldvlognewtricks

    @oldvlognewtricks

    4 жыл бұрын

    Would be interested by the opposite approach: ‘Washington D.C. - America + Australia = Canberra’

  • @Okradoma

    @Okradoma

    4 жыл бұрын

    Toby Same here... I’m surprised they didn’t run that,

  • @tolep

    @tolep

    3 жыл бұрын

    Stock markets

  • @LeoStaley
    @LeoStaley4 жыл бұрын

    I'm a simple man. I see Rob Miles, I click.

  • @koerel

    @koerel

    4 жыл бұрын

    I could listen to him all day!

  • @Alkis05
    @Alkis053 жыл бұрын

    This is basically node embedding from graph neural networks. Each sentence you use to train the it can be seen as a random walk in the graph that relates each world with each other. The number of words in the sentence can be seem as how long you walk from the node. Besides "word-vector arithmetics", one thing interesting to see would be to use this data to generate a graph of all the words and how they relate to each other. Than you could do network analysis with it, see for example, how many clusters of words and figure out what is their labels. Or label a few of them and let the graph try to predict the rest of them. Another interesting thing would be to try to embed sentences based on the embedding of words. For that you would get a sentence and train a function that maps points in the word space to points in the sentence space, by aggregating the word points some how. That way you could compare sentences that are close together. Then you can make sentences-vector arithmetics. This actually sounds like a cool project. I think I'm gonna give it a try.

  • @jamesjonnes

    @jamesjonnes

    10 ай бұрын

    How did it go?

  • @PerMortensen
    @PerMortensen4 жыл бұрын

    Wow, that is mindblowing.

  • @joshuar3702
    @joshuar37024 жыл бұрын

    I'm a man of simple tastes. I see Rob Miles, I press the like button.

  • @Verrisin
    @Verrisin4 жыл бұрын

    floats: some of the real numbers - Best description and explanation ever! - It encompasses all the problems and everything....

  • @RobertMilesAI

    @RobertMilesAI

    3 жыл бұрын

    "A tastefully curated selection of the real numbers"

  • @SeanSuggs
    @SeanSuggsАй бұрын

    Rob Miles and computerphile thank you... IDK why youtube gave this gem back to me today (probably for my insesent searching for the latest LLM news these days) but I am greatful to you even more now than I was 4yrs ago... Thank you

  • @WondrousHello
    @WondrousHello Жыл бұрын

    This has suddenly become massively relevant 😅

  • @Verrisin
    @Verrisin4 жыл бұрын

    Man, ... when AI will realize we can only imagine 3 dimensions, it will be so puzzled how we can do anything at all...

  • @overloader7900

    @overloader7900

    3 жыл бұрын

    Actually 2 spacial visual dimension with projection... Then we have time, sounds, smells...

  • @Democracy_Manifest

    @Democracy_Manifest

    9 ай бұрын

    The amount of neurons is more important than the experienced dimensions.

  • @MakkusuOtaku
    @MakkusuOtaku4 жыл бұрын

    Word embedding is my favorite pass-time.

  • @arsnakehert
    @arsnakehert Жыл бұрын

    Love how you guys are just having fun with the model by the end

  • @b33thr33kay
    @b33thr33kay Жыл бұрын

    You really have a way with words, Rob. Please never stop what you do. ❤️

  • @Sk4lli
    @Sk4lli4 жыл бұрын

    This was soooo interesting to me. I never dug deeper in how these networks work. But so many "Oh! That's how it is!". When I watched the video about GPT-2 and you he said that all the connections are just statistics, I just noted that internally as interesting and "makes sense" but didn't really get it. But with this video it clicked! So many interesting things, so thanks a lot for that. I love these videos. And seeing the math that can be done with these vectors is amazing! Wish I could like this more than once.

  • @crystalsoulslayer
    @crystalsoulslayer11 ай бұрын

    It makes so much more sense to represent words numerically rather than as collections of characters. That may be the way we write them, but the characters are just loose hints at pronunciation, which the model probably doesn't care about for meaning. And what would happen if a language model that relied on characters tried to learn a language that doesn't use that system of writing? Fascinating stuff.

  • @tridunghuynh5573
    @tridunghuynh55733 жыл бұрын

    I love the way he's discussing complicated topics. Thank you very much

  • @superjugy
    @superjugy4 жыл бұрын

    OMG that ending. Love Robert's videos!

  • @SpaceChicken
    @SpaceChicken Жыл бұрын

    Phenomenal talk. Surprisingly compelling given the density of the topic. I really do hope they let this man out of prison one day.

  • @RazorbackPT
    @RazorbackPT4 жыл бұрын

    I would suspect that this has to be very similar to how our own brains interpret languange, but then again evolution has a tendency to go about solving problems in very strange and inefficient ways.

  • @maxid87

    @maxid87

    4 жыл бұрын

    Do you have examples? I am really curious - so far I always assumed nature does it the most efficient way possible.

  • @wkingston1248

    @wkingston1248

    4 жыл бұрын

    @@maxid87 mammals have a nerve that goes from the brain to the throat, but due to changes in mammals it always goes under a vien in the heart then back up to the throat. This is so extreme that on a giraffe the nerve is like 9 feet long or something. In general evolution does a bad job at remmoving unnecessary features.

  • @Bellenchia

    @Bellenchia

    4 жыл бұрын

    Clever Hans

  • @maxid87

    @maxid87

    4 жыл бұрын

    @@wkingston1248 how do you know that this is inefficient? Might seem like that at first glance but maybe there is some deeper reason for it? Are there actual papers on this topic that answer the question?

  • @cmilkau

    @cmilkau

    4 жыл бұрын

    I doubt there is a lot of evolution at play in human language processing. It seems reasonable to assume that association (cat~dog) and decomposition (Tokyo = japanese + city) play an important role.

  • @mynamesnotsteve
    @mynamesnotsteve3 жыл бұрын

    I'm surprised that there's been no mention of Rob's cufflinks in the comments for well over a year after upload

  • @abdullahyahya2471
    @abdullahyahya24719 ай бұрын

    Mind blown, Thanks for the easy explanation. So calm and composed.

  • @channagirijagadish1201
    @channagirijagadish1201 Жыл бұрын

    Very well done. I love the explanation. He obviously has deep insight to explain it so very well. Thanks.

  • @cheeyuanng853
    @cheeyuanng853 Жыл бұрын

    This gotta be one of the best intuitive explanation of word2vec.

  • @lonephantom09
    @lonephantom094 жыл бұрын

    Beautifully simple explanation! Resplendent!

  • @tapanbasak1453
    @tapanbasak14536 ай бұрын

    This page blows my mind. It takes you through the journey of thinking.

  • @helifalic
    @helifalic4 жыл бұрын

    This blew my mind. Simply wonderful!

  • @giraffebutt
    @giraffebutt4 жыл бұрын

    What’s with that room? Is this Prisonphiles?

  • @MichaelErskine

    @MichaelErskine

    4 жыл бұрын

    It's Nottinghack - but true it's a bit prison-like

  • @Sanders4069
    @Sanders40692 ай бұрын

    So glad they allow this prisoner a conjugal visit to discuss these topics!

  • @Razzha
    @Razzha4 жыл бұрын

    Mind blown, thank you very much for this explanation!

  • @kamandshayegan4824
    @kamandshayegan48247 ай бұрын

    I am amazed and in love with his explanations. I just understand it clearly, you know.

  • @patricke1362
    @patricke13623 ай бұрын

    super nice style of speaking, voice and phrasing. Good work !

  • @StevenVanHorn
    @StevenVanHorn4 жыл бұрын

    I'm realllly curious about the basis vectors in this. What's the closest few words to etc..

  • @Guztav1337

    @Guztav1337

    4 жыл бұрын

    That. Now I'm really curious.

  • @yugioh8810

    @yugioh8810

    4 жыл бұрын

    I don't think that such reprenstation captures the distance information at all to begin with. The *closest* word is it has a distance of 1, (hamming distance in this case, I claim that each flipped bit counts as 1 hamming distance), but is not a word at all. Whereas in a vector-encoded representation since the words are mapped to a *vector space* then the closeness-farness of two vectors are conveyed in that representation. information representation if a fabulous topic I don't think I understand it yet. Information theory may help us understand information and information representation.

  • @Guztav1337

    @Guztav1337

    4 жыл бұрын

    @worthy null , wtf are you on about? Nobody said anything about Hamming distance. He asked: what few words are the closest to the basis vectors [in euclidean distance] in that vector space.

  • @LEZAKKAZ

    @LEZAKKAZ

    4 жыл бұрын

    I see where youre going with your analogy, but embeddings generally dont work like that. At first all the words are randomly given a random vector and then those vectors change throughout the training process. So the words you're looking for would be meaningless in this case. If you're looking for the centroid word(words that appear in the center of the embeddings) then that would be words that have very broad contexts such as "the".

  • @StevenVanHorn

    @StevenVanHorn

    4 жыл бұрын

    @Gerben van Straaten something that might be cute would be defining some human meaningful basis vectors then rotating/scaling the points to fit them. Then see what the remaining basises are. You're definitely right that they would not be human meaningful out of the box though

  • @vic2734
    @vic27344 жыл бұрын

    Beautiful concept. Thanks for sharing!

  • @tommyhuffman7499
    @tommyhuffman7499 Жыл бұрын

    This is by far the best video I've seen on Machine Learning. So cool!!!

  • @helmutzollner5496
    @helmutzollner5496 Жыл бұрын

    Very interesting. Would like to see more about these word vectors and how to use them.

  • @kenkiarie
    @kenkiarie4 жыл бұрын

    This is very impressive. This is actually amazing.

  • @peabnuts123
    @peabnuts1234 жыл бұрын

    16:20 Rob loves it, he's so excited by it 😄

  • @redjr242
    @redjr2424 жыл бұрын

    This is fascinating! Might we be able to represent language in the abstract as a vector space? Furthermore, similar but slightly different words in different languages are represented by similar by slightly different vectors in this vector space?

  • @rishabhmahajan6607
    @rishabhmahajan66073 жыл бұрын

    Brilliantly explained! Thank you for this video

  • @taneliharkonen2463
    @taneliharkonen2463 Жыл бұрын

    Mind blown... Able to do arithmetic on the meaning of words... I did not see that one coming :o A killer explanation on the subject thanks!! :D

  • @Nagria2112
    @Nagria21124 жыл бұрын

    Rob Miles is back :D

  • @danielroder830
    @danielroder8304 жыл бұрын

    You could make a game with that, some kind of scrabble with random words, add and substract words to get other words. Maybe with the goal to get long words or specific words or get shortest or longest distance from a specific word.

  • @SanderBuruma
    @SanderBuruma4 жыл бұрын

    absolutely fascinating

  • @Galakyllz
    @Galakyllz4 жыл бұрын

    Amazing video! I appreciate every minute of your effort, really. Think back, wondering "Will anyone notice this? Fine, I'll do it." Yes, and thank you.

  • @MrSigmaSharp
    @MrSigmaSharp4 жыл бұрын

    Oh yes, explaination and a concrete example

  • @JamieDodgerification
    @JamieDodgerification4 жыл бұрын

    Would it be possible for Rob to share his colab notebook / code with us so we can play around with the model for ourselves? :D

  • @jeffreymiller2801

    @jeffreymiller2801

    4 жыл бұрын

    I'm pretty sure it's just the standard model that comes with gensim

  • @steefvanwinkel

    @steefvanwinkel

    4 жыл бұрын

    See bdot02's comment above

  • @Pasan34
    @Pasan34 Жыл бұрын

    Put 'politician' instead of Santa in the last example. Buy you a beer if it comes out as 'bleating'.

  • @alisalloum629
    @alisalloum6292 жыл бұрын

    damn that's the best enjoyable informative video I've seen in a while

  • @bruhe_moment
    @bruhe_moment4 жыл бұрын

    Very cool! I didn't know we could do word association to this degree.

  • @theshuman100
    @theshuman1004 жыл бұрын

    word embeddings are the friends we make along the way

  • @DrD0000M
    @DrD0000M4 жыл бұрын

    3rd result for dog is "bark incessantly." Even AI knows dogs are annoying mutants. Fun fact: Wolves don't bark, well, almost never.

  • @Dawn-hd5xx

    @Dawn-hd5xx

    4 жыл бұрын

    Wild cats also don't meow. Even feral "domestic" (as in the species) cats don't meow, it's only towards humans that they do.

  • @NicknotNak

    @NicknotNak

    4 жыл бұрын

    Duddino Gatto they mew. But they outgrow it pretty quickly. Humans don’t babble like babies when we grow up, but if that was the only think our feline overlords responded to, we would.

  • @Noxeus1996
    @Noxeus19964 жыл бұрын

    This video really deserves more views.

  • @rafaelzarategalvez6728
    @rafaelzarategalvez67284 жыл бұрын

    It'd have been nice to hear about the research craze around more sophisticated approaches to NLP. It's hard to keep up with the amount of publications lately related to achieving "state-of-the-art" models using GLUE's benchmark.

  • @user-cj2rm3nz7b
    @user-cj2rm3nz7b4 ай бұрын

    Wonderful explanation

  • @debayanpal8107
    @debayanpal81072 ай бұрын

    best explanation about word embedding

  • @dzlcrd9519
    @dzlcrd95194 жыл бұрын

    Awesome explaining

  • @distrologic2925
    @distrologic29254 жыл бұрын

    I love that I have been thinking about modelling natural language for some time now, and this video basically confirms my way of heading. I have never heard of word embedding, but its exactly what I was looking for. Thank you computerphile and youtube!

  • @UserName________
    @UserName________9 ай бұрын

    How far we've come only 3 years later.

  • @simonfitch1120
    @simonfitch11204 жыл бұрын

    That was fascinating - thanks!

  • @datasciyinfo5133
    @datasciyinfo5133 Жыл бұрын

    Thanks for a great explanation of word embeddings. Sometimes I need a review. I think I understand it, then after looking at the abstract, n-dimensional embedding space in ChatGPT and Variational Autoencoders, I forget about the basic word embeddings. At least it’s a simple 300-number vector per word, that describes most of the highest frequency neighboring words.

  • @michaelcharlesthearchangel

    @michaelcharlesthearchangel

    Жыл бұрын

    Me too. I loved the review after looking how GPT4 and its code/autoencoder-set looks under the hood. I also had to investigate the keywords being used like "token" when we think about multi vector signifiers and the polysemiology of glyphic memorization made by these massive AI databases. Parameters for terms, words went from 300 to 300,000 to 300,000,000 to 1.5 trillion to ♾ infinite. Meaning: Pinecone and those who've reached infinite parameters have created the portal to a true self-learning operating system, self-aware AI.

  • @jackpisso1761
    @jackpisso17614 жыл бұрын

    That's just... amazing!

  • @TrevorOFarrell
    @TrevorOFarrell4 жыл бұрын

    Nice thinkpad rob! I'm using the same version of x1 carbon with the touch bar as my daily machine. Great taste.

  • @endogeneticgenetics
    @endogeneticgenetics Жыл бұрын

    Would love sample code in cases like this where there’s a Jupyter notebook already laying about!

  • @Veptis
    @Veptis4 жыл бұрын

    it's more than slightly surprising that you can explain this concept in 17 minutes, instead of going to a semester full of lectures.

  • @Rockyzach88

    @Rockyzach88

    Жыл бұрын

    Yeah but did you "learn" or just "understand while listening". Those are not the same things. Although, they may complement each other nicely in some cases.

  • @robinw77
    @robinw774 жыл бұрын

    When the room started getting brighter and brighter, I thought the rapture was happening xD

  • @edoardoschnell
    @edoardoschnell4 жыл бұрын

    This is über amazing. I wonder if you could use that to predict cache hits and misses

  • @nonchip
    @nonchip4 жыл бұрын

    3:00 pretty sure that graphic should've been just 2 points on the same line, given what he said a few sentences before that.

  • @panda4247

    @panda4247

    4 жыл бұрын

    Yep, if the mapping of images is just taking the values each pixel and then making N-dimensional vector (where N is number of pixels), then the picture with more brightness would be the on the same line (if solid black pixels were still solid black, depending on your brightness filter applied).

  • @maksdejna5486
    @maksdejna5486 Жыл бұрын

    Really nice explanation :)

  • @shourabhpayal1198
    @shourabhpayal11982 жыл бұрын

    Great explanation

  • @phasm42
    @phasm424 жыл бұрын

    Very informative!

  • @dialecticalmonist3405
    @dialecticalmonist3405 Жыл бұрын

    Santa giving you "ho ho ho" was both terrifying and humorous at the same time. Wow.

  • @WylliamJudd
    @WylliamJudd4 жыл бұрын

    Wow, that is really impressive!

  • @Gargamelle
    @Gargamelle3 жыл бұрын

    If you train 2 networks with different languages I guess the latent space? would be similar. And the differences could be really relevant to how we thought differently due to using different language

  • @wazzzuuupkiwi
    @wazzzuuupkiwi4 жыл бұрын

    This is amazing

  • @4.0.4
    @4.0.44 жыл бұрын

    I never thought the difference between "man" and "woman" could be so intertwined in a language! I can imagine it would make some people uneasy.

  • @RafaelCouto
    @RafaelCouto4 жыл бұрын

    Plz more AI videos, they are awesome!

  • @worldaviation4k
    @worldaviation4k4 жыл бұрын

    is the diagram with angles and arrows going off in all directions just for us to visualise it rather than how computers are looking at it, I didn't think they'd be calculating degrees. I thought it would be more about numbers of how close the match is like 0-100

  • @phasm42
    @phasm424 жыл бұрын

    The weights would be per-connection and independent of the input, so is the vector composed of the activation of each hidden layer node for a given input?

  • @jellyboy00
    @jellyboy004 жыл бұрын

    I think this looks more like an auto-encoder than GAN

  • @cmilkau

    @cmilkau

    4 жыл бұрын

    It's structurally the same as an AE, but it's trained differently. An AE has to reconstruct its input, but this network needs to find closely related, but different words.

  • @matiasbarrios7983
    @matiasbarrios79834 жыл бұрын

    This is awesome

  • @112Nelo
    @112Nelo4 жыл бұрын

    What kind of news data set has the info that Santa says HOHOHO?

  • @RedwoodRhiadra

    @RedwoodRhiadra

    4 жыл бұрын

    One that includes all the "human interest" fluff that newspapers publish around the holidays...

  • @blakef.8566

    @blakef.8566

    3 жыл бұрын

    You mean HO_HO_HO

  • @youssefezzeddine923
    @youssefezzeddine923Ай бұрын

    This is one of the coolest things i've seen in a while. Just thinking how small a neighbourhood of one word/vector should we take ? Or how does the implementation of context affect the choice of optimal neighbourhoods ?

  • @youssefezzeddine923

    @youssefezzeddine923

    Ай бұрын

    And contexts themselves vary from a person to another depending on how they experienced life. So it would be interesting to see also a set of optimal contexts and that would affect the whole thing.

  • @arsilvyfish11
    @arsilvyfish11 Жыл бұрын

    Can you share the above colab notebook, it would be really great for a quick reference with the vid.

  • @dixie_rekd9601
    @dixie_rekd9601 Жыл бұрын

    I kinda overlooked the importance of this video when it was released 3 yrs ago. not its basically the explanaition of how chatGPT does its thing... but with more data.