(GPT-2) Language Models are Unsupervised Multitask Learners | Paper Explained

Ғылым және технология

Here’s another video from my GPT series where I analyze the GPT-2(Language Models are Unsupervised Multitasks Learners) paper. I took a closer look at data gathering process, results and safety concerns that prevented the initial public release of the model.
Paper:
d4mucfpksywv.cloudfront.net/b...
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Links:
huggingface.co/datasets
openai.com/blog/better-langua...
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Connect with me on:
Linkedin - / maciej-balawejder-rt8015
GitHub - github.com/maciejbalawejder
Medium - / maciejbalawejder
Buy Me a Coffee - [www.buymeacoffee.com/mbalawejder](www.buymeacoffee.com/mbalawejder)
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Timestamps:
0:00 Introduction
0:30 GPT-1 Recap
1:18 Abstract
2:20 Dataset
3:30 Byte Pair Encoding
6:30 Architecture
7:25 Results
7:50 Lambada
8:40 CBT
10:25 Winograd Schema Challenge
11:14 CoQA
11:35 Summarization
12:30 Translation
13:13 Question Answering
13:38 Conclusions
15:00 Safety Concerns

Пікірлер: 7

  • @sg1192k
    @sg1192k4 ай бұрын

    Thanks for the video man!

  • @KS-df1cp
    @KS-df1cp3 ай бұрын

    If you listen to his GPT1 presentation then this one will make more sense. Thanks Maciej, very well explained. :)

  • @hubbankhan3309
    @hubbankhan3309 Жыл бұрын

    super helpful elaboration, thanks man, respect!

  • @randomthoughts7838
    @randomthoughts7838 Жыл бұрын

    Superb explaination, please make similar explaination for gpt3 and bert

  • @vanongle9648
    @vanongle96489 ай бұрын

    Thank you for explained Video ! I think the fact that open AI does not provide open source code is partly because of information security and technology competition .

  • @user-jo8ix2rp6v
    @user-jo8ix2rp6v Жыл бұрын

    Have you written a code for it just like gpt1?

  • @maciejbalawejder

    @maciejbalawejder

    Жыл бұрын

    The architecture is the same as gpt-1, the only difference is configuration(number of layers, heads, and d_size)

Келесі