What Makes a Good Image Generator AI?

Ғылым және технология

Three paper recommendations this time:
- Inception score - "Improved Techniques for Training GANs" - arxiv.org/abs/1606.03498
- "Progressive Growing of GANs for Improved Quality" - arxiv.org/abs/1710.10196
- Inception score criticism - "A Note on the Inception Score" - arxiv.org/abs/1801.01973
Pick up cool perks on our Patreon page:
› / twominutepapers
We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
313V, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Anthony Vdovitchenko, Brian Gilman, Christian Ahlin, Christoph Jadanowski, Dennis Abts, Eric Haddad, Eric Martel, Evan Breznyik, Geronimo Moralez, Jason Rollins, Javier Bustamante, John De Witt, Kaiesh Vohra, Kjartan Olason, Lorin Atzberger, Marcin Dukaczewski, Marten Rauschenberg, Maurits van Mastrigt, Michael Albrecht, Michael Jensen, Morten Punnerud Engelstad, Nader Shakerin, Owen Campbell-Moore, Owen Skarpness, Raul Araújo da Silva, Richard Reis, Rob Rowe, Robin Graham, Ryan Monsurate, Shawn Azman, Steef, Steve Messina, Sunil Kim, Thomas Krcmar, Torsten Reil, Zach Boldyga, Zach Doty.
/ twominutepapers
Thumbnail background image credit: pixabay.com/photo-3840163/
Splash screen/thumbnail design: Felícia Fehér - felicia.hu
Károly Zsolnai-Fehér's links:
Facebook: / twominutepapers
Twitter: / karoly_zsolnai
Web: cg.tuwien.ac.at/~zsolnai/

Пікірлер: 63

@FunkyPrince5 жыл бұрын
3:39 - When you'll see it... (Row:2 Column:12)
@TwoMinutePapers
5 жыл бұрын
Whoa.
@sausage4mash
5 жыл бұрын
funkyprince is mad, do not remove your blindfold and look !
@jessehammer123
5 жыл бұрын
Column 2, Row 5 gets you some unholy combination of George Bush and Michael Bloomberg.
@DougSalad
5 жыл бұрын
Row 2, Column 10 is literally BLEEDING FROM THE EYES
@beskamir5977
5 жыл бұрын
Lovely, and I made the mistake of watching this during the night right before I'm about to go to sleep.
@mizuhonova5 жыл бұрын
Is it possible to now use the inception scorer network to tweak the GAN network to maximize its inception score?
@dragonskunkstudio75825 жыл бұрын
This is more enjoyable because you were in the past so thrilled at everything. What a time to be alive and all that.
@martiddy5 жыл бұрын
Your channel is almost at 200k subscribers, keep the good work!
@martiddy
5 жыл бұрын
@@nagualdesign sorry, the autocorrect mess it up!
@martiddy
5 жыл бұрын
@@nagualdesign Actually, I'm not a fan of GamingGirl. But I have a friend who is called like that (is probably because of that).
@TwoMinutePapers
5 жыл бұрын
We are the most surprised by this. Thank you so much for all the love!
@kayrosis55235 жыл бұрын
1: Inception Score as a measurement for the quality of an AI is the perfect name! 2: that sharkdog land squid at 5:20 is horrifying
@tripzero05 жыл бұрын
thanks for describing "overfitting" for us. Helps us noobs out!
@mdp5337
5 жыл бұрын
I wish to recommend (in general, and it contains the notion) 'Algorithms to Live By' from Brian Christian and Tom Griffiths. Not in the specific context of NN based AI.
@wiktormigaszewski86845 жыл бұрын
Inception Score (and its new version FID, Fréchet Inception Distance) measure differences of distribution of scores for 1// generated images, 2// real images-> bigger distance means less "real" generated images. Name "Inception Score" comes from Inception-v6 application (official Tensorflow version)that is used for this distribution comparison
@diamondguy36515 жыл бұрын
Great episode as always!
@TheKaitav5 жыл бұрын
Thanks for giving the hope, that there is a scope of improvement here. :)
@kebakent5 жыл бұрын
Something like this would seem to be well suited for cosmetic surgeries. "Given that the network has been trained on celebrities, what is the closest face resembling my own?" or "How is my face different from that of a celebrity?"
@all_so_frivolous5 жыл бұрын
Well, inception score doesn’t actually penalize overfitting. A network that just memorizes the dataset will get perfect inception score (and frechet distance as well). Afaik this is still an open problem.
@morrowindIsFun5 жыл бұрын
Really enjoyed the video!
@dtracers5 жыл бұрын
at 1:00 a lot of these images look good if you do not actually look but once you do they look really really freaky
@MsJeffreyF5 жыл бұрын
2:55 L1 distance in pixel space? So manhatten distance, absolute value of the difference between the two images (pixel_i - pixel_i). Is that really a good distance formula for images?
@TwoMinutePapers
5 жыл бұрын
That is one way of measuring it. If I remember correctly, they also recorded the images that are similar in terms of neuron activations as well.
@joery8290
5 жыл бұрын
I've read in Goodfellow et al. 2016 That L1 norm measure in image recognition is usually preferred due to being able to resemble image polarity better. It's usually more empirical tho
@hakim46795 жыл бұрын
0:11 - That's definitely Cory in The House
@tommycard4569
5 жыл бұрын
Hakim Mohamoud guess they can generate anime characters too
@AbhishekKumar-mq1tt5 жыл бұрын
Thank u for this awesome video
@SmashedHatProject5 жыл бұрын
Will there be identity generating NNs as well?
@cvspvr5 жыл бұрын
*TROOOLY EXCITING! WHAT A TIME TO BE ALIVE!!!*
@vlogsofanundergrad20345 жыл бұрын
0:15 Steven Wilson!
@jobigoud
5 жыл бұрын
All the images for the first 25 seconds are real people the network was trained on.
@samirm5 жыл бұрын
can you please do a video on google's BERT? thank you
@khongminh51684 жыл бұрын
Incredible
@WildAnimalChannel5 жыл бұрын
I'm not sure about the celebrity generator. I have a feeling it is memorising parts of faces like hair and noses and joining them together like a photo-fit. Instead of creating new hair and new noses if you see what I mean. It seems a bit too detailed to be entirely new. I would like to see it match the closest eyes and closest hair separately.
@hellfiresiayan
5 жыл бұрын
I agree. Many of the "bases" of the photos I can recognize as famous celebrity photos. For example, 3:33 top right corner is Kirsten Stewart, who had a very distinct haircut and skin tone at the time of the photo. The black guy at 3:35 has Dave Chappelle's very distinctive features. At 3:38 the fourth image on the first row uses the same frame as the Kirsten Stewart one. Finally at 3:38 as well, the 5th image in the 2nd column is 100% George Bush - it even has the Air Force One in the background. I suppose using a celeb as the base is expected since it can't be completely original, but some of the celebs have such unique features that it's hard to create new ones for the AI, I think.
@francoisrd5 жыл бұрын
Thanks for pointing out that Inception Score has limitations and is far from a perfect metric. It's important to keep information grounded to reality or we end up with unfounded hype.
@wiktormigaszewski86845 жыл бұрын
I am looking for a movie of "Two Minute Paper" with an application that generates a 3D computer game and plays against this imagining (like as if we tried to solve imagined potential tough life situations) - can anyone help me with the link?... This 3D game looked like the old Doom, with orange brick walls.
@orenong5 жыл бұрын
4:34! 166!!!
@mdoerkse2 жыл бұрын
0:40 Definitely a Hemsworth.
@artman405 жыл бұрын
How easily can deepfakes and generated faces be combined?
@alphacore43325 жыл бұрын
4:55 - it generates cheeseburgers. surrender now you fools.
@jackflynn30975 жыл бұрын
more papers to read
@lukec58385 жыл бұрын
Don't forget mode collapse.
@abcdxx10595 жыл бұрын
Are we sure these images are not copies
@TheEaglesGuru5 жыл бұрын
Am I the only one who is not that impressed by these generated images? I can clearly see which celebrities are blended in many of the cases.
@resignationify2 жыл бұрын
4:45 "truly exciting, what a time to be alive" ...really?
@AllExistence5 жыл бұрын
Can NN make people more attractive? Can such a concept be reached to it?
@blacklabelmansociety5 жыл бұрын
WHAT A TIME TO BE ALIVE
@__-tz6xx5 жыл бұрын
I can see in the far future these will all be the faces of robots. Dun dunn dunnnnn! Spooky.
@sabofx
5 жыл бұрын
Funny how the text "Dun dunn dunnnnn" doesn't contain any melody whilst I'm pretty certain i know exactly what you mean. Makes me wonder who ever came up first with that tune?
@bns15785 жыл бұрын
Interesting what would our scientists work as if AI will ever have an IQ of over 200?
@jacasch31645 жыл бұрын
can you stop panning around in these image walls. makes me dizzy like crazy. thank you
@qolio5 жыл бұрын
1st like!
@WildAnimalChannel5 жыл бұрын
Long story short: can't solve a problem? Use a neural network.
@IgorGabrielan5 жыл бұрын
generator.ai
@davidwilliston12095 жыл бұрын
Your use of English slang is over fitted :D
@mitusmusic5 жыл бұрын
dont watch this shit high.