What exactly is Apache Spark? | Big Data Tools

Ғылым және технология

What is Apache spark? And how does it fit into Big Data? How is it related to hadoop? We'll look at the architecture of spark, learn some of the key components, see how it related to other big data tools like hadoop.
⏯RELATED VIDEOS⏯
Building a Data Pipeline: • Building a Data Pipeline
------------------------------------------------------------------------------
Data Podcast ►► open.spotify.com/show/4PWmW2g...
Website ►► www.nullqueries.com/
------------------------------------------------------------------------------
🎓Data courses (Not Produced by nullQueries)🎓
Azure Data Engineering: click.linksynergy.com/deeplin...
DE Essentials, hands on: click.linksynergy.com/deeplin...
------------------------------------------------------------------------------
📷VIDEO GEAR📷
Programming Mouse: amzn.to/3zEom7f
Lighting: amzn.to/3o8tXAM
RGB light: amzn.to/3o8AQBS
USB Microphone: amzn.to/3m3hjAt
Mixer: amzn.to/2ZyqMIk
XLR Microphone: amzn.to/3AHPZ0L
💻VIDEO SOFTWARE💻
music/stock: 1.envato.market/rnX70y
------------------------------------------------------------------------------
For business inquiries please contact nullQueries@gmail.com
Some of the links in this description are affiliate links and support the channel. Thanks for the support!
------------------------------------------------------------------------------
00:00 Intro
00:25 History
00:44 Goals
00:58 Architecture
02:22 Libraries
02:57 Platforms
02:57 Comparisons

Пікірлер: 33

  • @nullQueries
    @nullQueries2 жыл бұрын

    What's your opinion of spark? Think it's the future of data? Realtime data: kzread.info/dash/bejne/qXmpx7GDZLK5nMo.html Big Data Pipelines: kzread.info/dash/bejne/mn-qmZKzlte9Xbw.html

  • @tamirtsogbayar3912

    @tamirtsogbayar3912

    2 жыл бұрын

    in real time data streaming, spark is a speed layer ?

  • @bbishopski

    @bbishopski

    2 жыл бұрын

    We couldn't get it to perform. It took days to process a job. After 4 months of trying,vwe rewrote in plain java and it takes about an hour to crunch the data instead of days. Furthermore, instead of 192 CPU with 1.5tb ram, we now use minimal 2gb JVM, saving tens of thousands per month . That's my experience. As with everything else, it depends on the application and the team skills.

  • @jamestom3168
    @jamestom31682 жыл бұрын

    Nicely organised and to the point. I wish my whole academics was like this.

  • @MrThuyphuc
    @MrThuyphuc Жыл бұрын

    Love the content and info delivery! Keep up the great work 👍

  • @challengerssb
    @challengerssb2 жыл бұрын

    Excellent video and information. Thank you for explaining this for a broader audience.

  • @paragchaudhari4087
    @paragchaudhari4087 Жыл бұрын

    Nice introduction/presentation, concise, to the point. Thanks!

  • @TR4008
    @TR40082 жыл бұрын

    Thank you for this video . Excellent content.

  • @neb1578
    @neb15782 жыл бұрын

    Wow! Very insightful, short, informative video. Exactly how I like it

  • @nullQueries

    @nullQueries

    2 жыл бұрын

    Glad you liked it!

  • @cg8770
    @cg8770 Жыл бұрын

    Thank you so much for this video. The explanation was very easy to understand!

  • @coding3438
    @coding34382 жыл бұрын

    Null queries uploads a video. Time to leave everything as is, and hit like before I watch.

  • @mohammedal-weez6754
    @mohammedal-weez6754 Жыл бұрын

    That is a book's portion summarized to the point . Many thanks

  • @ADAMSIVES
    @ADAMSIVES Жыл бұрын

    Excellent vid! I learnt a lot

  • @tehb1726
    @tehb17262 жыл бұрын

    Great video, thank you!

  • @arthd5683
    @arthd5683 Жыл бұрын

    Excellent content!

  • @AndiBreshanaj
    @AndiBreshanaj2 жыл бұрын

    Very interesting video! It gave Spark capabilities in a nutshell :) Thanks

  • @nullQueries

    @nullQueries

    2 жыл бұрын

    Glad you liked it!

  • @bryan.canelas
    @bryan.canelas8 ай бұрын

    Great video! thanks

  • @edwardvonrueden4663
    @edwardvonrueden46639 ай бұрын

    Tremendously amazing💖

  • @ranjeetkumar2051
    @ranjeetkumar2051 Жыл бұрын

    thank you sir for making this video

  • @youssefelmrabet7033
    @youssefelmrabet70332 жыл бұрын

    Thank you so much, it really helps me get the idea behind it.

  • @nullQueries

    @nullQueries

    2 жыл бұрын

    Glad it helped!

  • @sued12345
    @sued123452 жыл бұрын

    Good video as usual

  • @nullQueries

    @nullQueries

    2 жыл бұрын

    Thanks!

  • @WhosShamouz
    @WhosShamouz2 жыл бұрын

    Which video editing software do you use? The quality is insane!

  • @nullQueries

    @nullQueries

    2 жыл бұрын

    Thanks! I use the adobe products (Illustrator for objects, After Effects for animation, Audition for Audio).

  • @deathseal9844
    @deathseal98442 жыл бұрын

    This is the video , i am looking for short but depth knowledge of spark. I was wondering whether should i learn this or not.

  • @SamDawson-gv4rm
    @SamDawson-gv4rm11 ай бұрын

    OOO Best people

  • @neildutoit5177
    @neildutoit5177 Жыл бұрын

    But what is it?!?! Is it a programming language like Python? Is it a type of hardware like a GPU? Is it a database client like MysqlWorkbench? Is it an algorithm like PageRank? Is it a computer program like excel? "Fast general purpose framework for data processing" tells me absolutely nothing. That could be talking about a library like tensorflow, an organisational strategy, a set of design principles, literally anything man.

  • @nullQueries

    @nullQueries

    Жыл бұрын

    It's a Framework (A fast, general purpose one for data). Like .Net and Ruby on rails are Frameworks. Tensorflow is also an AI and ML Framework, written in C++. But most people know it from the python library that allows python to utilize the tensorflow framework

  • @personalbranddata

    @personalbranddata

    Жыл бұрын

    It's a Java application written in Scala. The Executors mentioned in the video run a JVM to do the actual work (after it was assigned by the Driver which also runs inside a JVM). So essentially Spark is a distributed Java application. In addition you have interfaces in Python (pyspark library) and R (sparkR and sparklyr libraries). The SQL interface mentioned in the video is actually a higher level abstraction built (again in Scala) on top of the Spark Core. The hierarchy of abstractions in Spark is like this. The first three ones are natively written in Scala and constitute what's called Spark: Spark Core -> Spark SQL -> Spark DataFrame (-> access via e.g. pyspark)

  • @brothermalcolm
    @brothermalcolm Жыл бұрын

    rdd

Келесі