Data Analysis 5: Data Reduction - Computerphile
Too much data? Dr Mike Pound on how best to reduce your dataset. This is part 5 of the Data Analysis Learning Playlist: • Data Analysis with Dr ...
This Learning Playlist was designed by Dr Mercedes Torres-Torres & Dr Michael Pound of the University of Nottingham Computer Science Department. Find out more about Computer Science at Nottingham here: bit.ly/2IqwtNg
This series was made possible by sponsorship from by Google.
The music dataset can be found here: github.com/mdeff/fma
/ computerphile
/ computer_phile
This video was filmed and edited by Sean Riley.
Computer Science at the University of Nottingham: bit.ly/nottscomputer
Computerphile is a sister project to Brady Haran's Numberphile. More at www.bradyharan.com
Пікірлер: 37
Check out the full Data Analysis Learning Playlist: kzread.info/head/PLzH6n4zXuckpfMu_4Ff8E7Z1behQks5ba
This playlist is a goldmine
Amazing series, great topic, great teacher! I'm actually excited for next semester now where I get to do this stuff in class!
4:08 artist_hotttnesss - LOL
When I see Dr. Mike in thumbnail, the time that it takes to click on a video is greatly reduced.
Really an impressive playlist. Wow dude. A huge public service.
one viable definition of 'big data' is, when it crashes excel... making you wait an unreasonable amount of time, when processed in R, could be another one.
Thanks for the volume consistency
Great series
When the intro sounds like an intro of either a reality TV show or a video game. Great work Computerphile.
great vid!
Best of luck 😎
Where is the link to this metadata of music tracks? It's not in the link in the description..
Angle shot - that's new
i like this dude
KZread: How about watching 6 videos at once????
What's the difference from data cleaning?
@4.0.4
5 жыл бұрын
I think cleaning is about getting rid or filling in data, and reduction is about summarizing, think the map-reduce stuff.
Have you tried fread() ... Slightly faster to read csv
@quangho8120
5 жыл бұрын
Does it, like more than 2 time speeds up the process?
@JOHNSMITH-ve3rq
2 жыл бұрын
Way faster
PCA, p-value
In the previous episodes, R was okay. But for this one, you should've used Splunk.
@supersu6138
3 жыл бұрын
Or spark
This wonky camera angle is very uncomfortable
Can you please share csv and R files? 😇
Nice explanation, would have been even more useful if it would be in python.
Being French, the way Dr Pound pronounces "genre" seems so weird.
@Abby_Liu
4 жыл бұрын
how,, are you supposed to say it
@supersu6138
3 жыл бұрын
Jon ra
2:26 wait a second, does Dr Mike Pound have Heterochromia iridum ???
@alakhdar100
4 жыл бұрын
that's what i expected to happen when people watch a playlist of data analysis.
Not complaining, but the professor is so handsome that I can't really focus... 😂😤🙈🤭
You're completely wrong about how Spotify finds recommendations - they don't analyse the music whatsoever (like Pandora did back in the day) they just go on what other people put in their playlists and find people who are similar to you and pick songs from their playlists that aren't in yours.
@Joel-if2bg
3 жыл бұрын
He's not wrong at all. The Echo Nest is a Spotify subsidiary acquired in 2014 and uses exactly the techniques he outlined in conjunction with what you said (and other algorithms) to recommend music.
Keep your camera straight, please!