Python Fundamentals For Data Engineering: Create your first ETL Pipeline

Ғылым және технология

▬▬▬▬▬▬ T I M E S T A M P S ⏰ ▬▬▬▬▬▬
0:00 - Intro
0:25 - Extract Transform Load Example
1:05 - Importing the right packages
1:55 - Extract
2:41 - Transform
4:56 - Load
6:00 - Running first ETL pipeline
8:00 - Outro
Intro
- In the field of data engineering, you are frequently required to read and manipulate data.
- Python is the tool of choice to achieve that.
- In this video, we will cover the fundamentals of Python you need to become a data engineer
- If you are a beginner at coding, there is no need to get overwhelmed with complexities, the idea is to start small and over time as you use more in your daily workflow, the better you will get
- So for Data Engineering you should treat Python as a language to provide a means to an end, don’t need to master it, but you will use this a lot.
In this video, I will cover a sample project to build your first ETL pipeline
GitHub Link: github.com/syalanuj/youtube/b...
FOLLOW ME ON
MEDIUM: / syal.anuj​
INSTAGRAM: / ​
TWITTER: / anuj_syal
GITHUB: github.com/syalanuj​
WEBSITE: anujsyal.com
#python #dataengineering #bigdata #etl #extract #transform #load #tutorial

Пікірлер: 26

  • @nanayaw7159
    @nanayaw7159 Жыл бұрын

    This is really awesome

  • @snapfade
    @snapfade11 ай бұрын

    Excellent video, very clear

  • @Lerahul
    @Lerahul Жыл бұрын

    Great content Anuj!

  • @parkerrex
    @parkerrex8 ай бұрын

    Super helpful, thanks :-)

  • @PradeepPenn
    @PradeepPenn Жыл бұрын

    Thank you Anuj!

  • @travelingsandy900
    @travelingsandy9008 ай бұрын

    That was a very good explanation

  • @_Machiavel_
    @_Machiavel_3 ай бұрын

    excellent video

  • @wayne_vt
    @wayne_vt Жыл бұрын

    Very helpful

  • @calvinallen8425
    @calvinallen84259 ай бұрын

    great video and content!!, i want to ask how to make a two database one is database AdventureWorks database, one for Stagging database, and python code of initial ETL and incremental ETL on the phase of Extraction. thank you

  • @AnujSyal

    @AnujSyal

    6 ай бұрын

    See what pip packages support these databases. I would start from psycppg2. As a last resort you can also check apis by these databases if any and then connect to these using python

  • @brookster7772
    @brookster77725 ай бұрын

    Nice video what editor plug-in are you using I love it

  • @AnujSyal

    @AnujSyal

    5 ай бұрын

    I am using iMovie

  • @brookster7772

    @brookster7772

    5 ай бұрын

    @@AnujSyal thanks man I’m new to python and I was talking about your notebook file in visual studio code. Thanks man.

  • @AnujSyal

    @AnujSyal

    5 ай бұрын

    @@brookster7772 That comes in built with Vscode. Use the following syntax to create a cell # %%

  • @LordPyro
    @LordPyro2 ай бұрын

    Great work! when I try to run import requests, it says No module named requests - what do I need to import to be able to import requests?

  • @savichopra9083

    @savichopra9083

    15 күн бұрын

    pip install requests

  • @jeanfabraruiz7994
    @jeanfabraruiz799411 ай бұрын

    This is cool, only one thing missingg and its how to orchestate this, how to setup automation

  • @AnujSyal

    @AnujSyal

    11 ай бұрын

    Thanks for the suggestion, I do have a video on docker and airflow, maybe that one can give you some idea in terms of automation

  • @Rock6350
    @Rock6350 Жыл бұрын

    Hi Anuj, When I try to code this in google Colab. I'm getting error that "name 'transform' is not defined"

  • @nakul--

    @nakul--

    Жыл бұрын

    make sure you have imported it correctly at the beginning of your code. import via : from sklearn.preprocessing import transform

  • @AnujSyal

    @AnujSyal

    Жыл бұрын

    Yeah Nakul is right, it looks like an import error

  • @iksukekw
    @iksukekw9 ай бұрын

    how did you open dataviewer to see the actual dataframe?

  • @AnujSyal

    @AnujSyal

    9 ай бұрын

    When you configure running jupyter notebook it comes with that. You can see Variables as part of the jupyter environment, and go into each dataframe to check in the values in memory. It is really similar to spyder

  • @nitinmadan4009
    @nitinmadan40093 ай бұрын

    SQL on Steroids is pandas, that was a Good one 😅

  • @nakul--
    @nakul-- Жыл бұрын

    haha pandas is like excel on steriods! lolllll

  • @manojghuge5916
    @manojghuge59166 ай бұрын

    All good except the fake accent

Келесі