What exactly is Apache Spark? | Big Data Tools
Ғылым және технология
What is Apache spark? And how does it fit into Big Data? How is it related to hadoop? We'll look at the architecture of spark, learn some of the key components, see how it related to other big data tools like hadoop.
⏯RELATED VIDEOS⏯
Building a Data Pipeline: • Building a Data Pipeline
------------------------------------------------------------------------------
Data Podcast ►► open.spotify.com/show/4PWmW2g...
Website ►► www.nullqueries.com/
------------------------------------------------------------------------------
🎓Data courses (Not Produced by nullQueries)🎓
Azure Data Engineering: click.linksynergy.com/deeplin...
DE Essentials, hands on: click.linksynergy.com/deeplin...
------------------------------------------------------------------------------
📷VIDEO GEAR📷
Programming Mouse: amzn.to/3zEom7f
Lighting: amzn.to/3o8tXAM
RGB light: amzn.to/3o8AQBS
USB Microphone: amzn.to/3m3hjAt
Mixer: amzn.to/2ZyqMIk
XLR Microphone: amzn.to/3AHPZ0L
💻VIDEO SOFTWARE💻
music/stock: 1.envato.market/rnX70y
------------------------------------------------------------------------------
For business inquiries please contact nullQueries@gmail.com
Some of the links in this description are affiliate links and support the channel. Thanks for the support!
------------------------------------------------------------------------------
00:00 Intro
00:25 History
00:44 Goals
00:58 Architecture
02:22 Libraries
02:57 Platforms
02:57 Comparisons
Пікірлер: 33
What's your opinion of spark? Think it's the future of data? Realtime data: kzread.info/dash/bejne/qXmpx7GDZLK5nMo.html Big Data Pipelines: kzread.info/dash/bejne/mn-qmZKzlte9Xbw.html
@tamirtsogbayar3912
2 жыл бұрын
in real time data streaming, spark is a speed layer ?
@bbishopski
2 жыл бұрын
We couldn't get it to perform. It took days to process a job. After 4 months of trying,vwe rewrote in plain java and it takes about an hour to crunch the data instead of days. Furthermore, instead of 192 CPU with 1.5tb ram, we now use minimal 2gb JVM, saving tens of thousands per month . That's my experience. As with everything else, it depends on the application and the team skills.
Nicely organised and to the point. I wish my whole academics was like this.
Love the content and info delivery! Keep up the great work 👍
Excellent video and information. Thank you for explaining this for a broader audience.
Nice introduction/presentation, concise, to the point. Thanks!
Thank you for this video . Excellent content.
Wow! Very insightful, short, informative video. Exactly how I like it
@nullQueries
2 жыл бұрын
Glad you liked it!
Thank you so much for this video. The explanation was very easy to understand!
Null queries uploads a video. Time to leave everything as is, and hit like before I watch.
That is a book's portion summarized to the point . Many thanks
Excellent vid! I learnt a lot
Great video, thank you!
Excellent content!
Very interesting video! It gave Spark capabilities in a nutshell :) Thanks
@nullQueries
2 жыл бұрын
Glad you liked it!
Great video! thanks
Tremendously amazing💖
thank you sir for making this video
Thank you so much, it really helps me get the idea behind it.
@nullQueries
2 жыл бұрын
Glad it helped!
Good video as usual
@nullQueries
2 жыл бұрын
Thanks!
Which video editing software do you use? The quality is insane!
@nullQueries
2 жыл бұрын
Thanks! I use the adobe products (Illustrator for objects, After Effects for animation, Audition for Audio).
This is the video , i am looking for short but depth knowledge of spark. I was wondering whether should i learn this or not.
OOO Best people
But what is it?!?! Is it a programming language like Python? Is it a type of hardware like a GPU? Is it a database client like MysqlWorkbench? Is it an algorithm like PageRank? Is it a computer program like excel? "Fast general purpose framework for data processing" tells me absolutely nothing. That could be talking about a library like tensorflow, an organisational strategy, a set of design principles, literally anything man.
@nullQueries
Жыл бұрын
It's a Framework (A fast, general purpose one for data). Like .Net and Ruby on rails are Frameworks. Tensorflow is also an AI and ML Framework, written in C++. But most people know it from the python library that allows python to utilize the tensorflow framework
@personalbranddata
Жыл бұрын
It's a Java application written in Scala. The Executors mentioned in the video run a JVM to do the actual work (after it was assigned by the Driver which also runs inside a JVM). So essentially Spark is a distributed Java application. In addition you have interfaces in Python (pyspark library) and R (sparkR and sparklyr libraries). The SQL interface mentioned in the video is actually a higher level abstraction built (again in Scala) on top of the Spark Core. The hierarchy of abstractions in Spark is like this. The first three ones are natively written in Scala and constitute what's called Spark: Spark Core -> Spark SQL -> Spark DataFrame (-> access via e.g. pyspark)
rdd