learn by doing it

This is my KZread channel where I explain various topics on Data analytics,Data Engineer, Azure Data engineering ,AWS data engineering , aws glue, aws athena, aws s3, aws emr, Cloud Platform ,AWS, Big data ,ETL, machine learning, deep learning, devops, Linux, GCP,big data engineer, data engineer, and AI with many real-world problem scenarios. My main aim is to make everyone familiar by showing them practical example with real case scenario .Please subscribe and support the channel. As i love new technology, all these videos are free and I promise to make more interesting content as we go ahead.

Mail - [email protected]

Join me t.me/+Cb98j1_fnZs3OTA1

6 сағат бұрын

27. Different date functions in Pyspark | pyspark tutorial

8 сағат бұрын

26. date format function in Pyspark | pyspark tutorial

8 сағат бұрын

25. Windows function in Pyspark | PySpark Tutorial

15 сағат бұрын

24. Create Temp view in PySpark | createOrReplaceTempView() function in PySpark

20 сағат бұрын

23 DataFrame.transform() function in PySpark | pyspark tutorial

Күн бұрын

22. UDF in pyspark | UDF(user defined function) in PySpark

14 күн бұрын

21. pivot and unpivot in pyspark | pyspark tutorial

14 күн бұрын

SCD TYPE-2 using ADF | Azure data engineering project

21 күн бұрын

20. StructType & StructField in PySpark | Pyspark Tutorial

21 күн бұрын

19. collect in pyspark| pyspark tutorial

21 күн бұрын

18. fill and fillna in pyspark | pyspark tutorial

21 күн бұрын

17 Union and union all in pyspark | pyspark tutorial

21 күн бұрын

16. Joining in Pyspark | Pyspark Tutorial

21 күн бұрын

15 GroupBy in pyspark | pyspark tutorial

21 күн бұрын

14 sort and orderBy function in pyspark | pyspark Tutorial

21 күн бұрын

13. drop and dropDulicates function in pyspark | pyspark tutorial

28 күн бұрын

12. Filter in Pyspark | pyspark tutorial

28 күн бұрын

11. withColumn in pyspark | Pyspark Tutorial

28 күн бұрын

10. Select function in pyspark | pyspark tutorial

28 күн бұрын

9. Read JSON file using pyspark | pyspark tutorial

Ай бұрын

8. Create dataframe using csv | pyspark lab-1 | pyspark tutorial

Ай бұрын

50. Dataflow import schema error | Import schema failed issue in adf

Ай бұрын

7. Databricks Overview | pyspark playlist

Ай бұрын

49. Import schema failed Error in adf | azure data factory

Ай бұрын

48. Error solution - Dataset is using 'AzureSqlDatabase' linked service with SQLVersion v2 type

Ай бұрын

46 Rank transformation in azure data factory | azure data factory

Ай бұрын

47. Azure data factory SCD Type 1 | Azure data factory project

Ай бұрын

45. Alter row transformation in azure data factory | azure data factory

Ай бұрын

44. Windows transformation in azure data factory | azure data factory

Пікірлер

@bhupendrashukla375318 сағат бұрын

are these tables delta tables?

@rkadeklasikКүн бұрын

Is there any way to have the csv output file name match the source json file name?

@learnbydoingitКүн бұрын

Yes I will show

@ankitapal1504Күн бұрын

In the query to delete the duplicates the query will also print the record which is duplicate as one of the record is ranked as 1

@learnbydoingitКүн бұрын

U can select column what u need

@lavanyareddy6677Күн бұрын

Example for frequent, unfrequent,

@SomeOne-qv2tfКүн бұрын

i got 3 errors 154 commented memebers not got any single error woow great please help

@learnbydoingitКүн бұрын

What is the error

@SomeOne-qv2tfКүн бұрын

aggregate table creation error: IllegalArgumentException: All week-based patterns are unsupported since Spark 3.0, detected: e, Please use the SQL function EXTRACT instead

@learnbydoingitКүн бұрын

Not sure about this error ...which part u are stuck

@cricketmaster7697Күн бұрын

Thank you! Excited for the course.

@learnbydoingitКүн бұрын

You can also follow latest playlist

@45_farmaankhan30Күн бұрын

Covered all the cases, excellent work!!

@SomeOne-qv2tfКүн бұрын

good explanation thanks but all this are not working in a pipeline can u make a new video for all pramaterization videos (19 ,20,21 videos) by creating new pipeline and triggering it

@learnbydoingitКүн бұрын

Sure I will do that

@SomeOne-qv2tfКүн бұрын

@@learnbydoingit thanks eagerly waiting for it

@SomeOne-qv2tf2 күн бұрын

i have create same as above dataset by going to adf ---author---dataset---its working!! fine 2- when i try to create same dataset via using new pipeline and copy activity---source (dataset) same as above in video and sink as blob 3-when i try to run pipeline im getting error no value provided to parameter db name and table name in case my question is not clear excuse me ! 4- how to do the same activity using pipeline and triger it?

@learnbydoingit2 күн бұрын

Have u created parameter...if u click on the blank page on copy activity u will see parameter option there have u specified or not ?

@SomeOne-qv2tf2 күн бұрын

@@learnbydoingit yes i have created still im getting error

@SomeOne-qv2tfКүн бұрын

@@learnbydoingit can u write down steps how to do it in a pipeline n trigger it becoz im struck on this video i need to move on n complete other videos thanks

@SaiKrishna-fg4id2 күн бұрын

Very informative bro tnq

@SomeOne-qv2tf2 күн бұрын

can me rename a column drop a column n re arrange the column with derived cloumn as we did in select? if yes then what is the difference between select n derived

@learnbydoingit2 күн бұрын

If you have to derive new column based on certain expression like do the sum of 2 column and create new column then which one you will use? Hope u will get idea

@SomeOne-qv2tf2 күн бұрын

@@learnbydoingit yes got it thanks with select we can concat the columns

@SomeOne-qv2tf2 күн бұрын

The file '_SUCCESS' may not render correctly as it contains an unrecognized extension triger got sucessful but in container i have recieved file as scuess with 0kb when i open it i get above message without any file to preview or edit

@SomeOne-qv2tf2 күн бұрын

how to delete table from sql via delete activity ?

@learnbydoingit2 күн бұрын

Table u can't delete through delete activity but, if in sql query u can pass drop table statement and then it will be dropped

@Manwithguts12 күн бұрын

this video is helpful for training people who are learning to transition to cloud computing.keep posting

@asthasatija-f1y3 күн бұрын

What does KPI means?

@learnbydoingit2 күн бұрын

KPI stands for Key Performance Indicator, which is a quantifiable metric used to track progress towards a specific business objective.

@DaffodilGirl3 күн бұрын

hi can you explain how to use aliases feature also? thanks for sharing knowledge!

@SomeOne-qv2tf3 күн бұрын

more how many left total for pyspark?

@learnbydoingit3 күн бұрын

Few more are pending then Project

@sarveshtakalkar58573 күн бұрын

Thanks Budy

@ABQ...4 күн бұрын

Is it similar to sql window functions?

@learnbydoingit4 күн бұрын

Yes

@sayalikhairnar31664 күн бұрын

I think we can use 'monthdiff' after where in 1st question right? rather than repeating whole datediff() line

@DA_Guy1235 күн бұрын

Sir, kindly add Azure Synapse videos

@SomeOne-qv2tf5 күн бұрын

im getting below error: wile debuging in get meta data activity no errors and all file copied to output folder evne before trigger then after triggering im getting below erro message Failed to run foreachitrate (Pipeline). {"code":"BadRequest","message":"ErrorCode=InvalidTemplate, ErrorMessage=The template validation failed: 'The 'runAfter' property of template action 'ForEach1Scope' is not valid. The status values for action 'Get Metadata1Scope' must be unique. Found duplicate values: 'Succeeded'","target":"pipeline/foreachitrate/runid/c2f978e0-100a-4235-a1dd-5caa8c62a25e","details":null}

@learnbydoingit4 күн бұрын

For each are u getting error

@SomeOne-qv2tf5 күн бұрын

how do we know that we have to pass only name in wildcard?

@learnbydoingit5 күн бұрын

There will be different requirements and usecase and based on that we have to deal

@SomeOne-qv2tf5 күн бұрын

hi what about in a single trigger cant we provide multiple filepath and table name

@sameenkunwar22315 күн бұрын

Thank you brooo

@knowledge46865 күн бұрын

This playlist has 58 videos, is it enough to learn from scratch and get a job of azure data engineer with 3 yrs of experience?? If not , please let us know on what else need to be done to achieve it

@learnbydoingit5 күн бұрын

Yes and sql also u need to do ....we have another playlist in depth we are covering adf pyspark sql u can follow that too

@simulacrum4435 күн бұрын

So grateful for this content. Thank you!

@kamaljeetkaur58745 күн бұрын

it is nice how you covered all these complex things in such a small timespan

@mouleshmanikandan13925 күн бұрын

for amazon redshift dataware house, can you upload some tutorial ? and also some data engineering project which covers all these services can you do one video? it will be very useful ?

@learnbydoingit5 күн бұрын

Sure

@mouleshmanikandan13925 күн бұрын

bro your content is the best so far :) thankyou so much

@maheswarpalagiri5665 күн бұрын

Hi sir is your playlist for pyspark enough to completely learn pyspark ?

@learnbydoingit5 күн бұрын

Yes we are adding more , as well project

@SomeOne-qv2tf6 күн бұрын

error: Dataset is using 'AzureSqlDatabase' linked service with SQLVersion 'Recommended', which is not supported in data flow.

@learnbydoingit6 күн бұрын

Pls do watch 48-50 video for this error

@hassansaleem37846 күн бұрын

On second slide you have mentioned ' We have to build one pipeline which will transfer data and run daily' What do you mean by run daily ?

@learnbydoingit6 күн бұрын

Daily schedule

@hassansaleem37843 күн бұрын

@@learnbydoingit But how the pipeline will trigger daily ? Because we haven't set any schedule!!

@kalakritibysanskriti6 күн бұрын

Thanks

@bhargavmuppidi49066 күн бұрын

This helps df to convert to spark table very useful 🙂

@sparshraj52077 күн бұрын

can we take Central India in region in free subscription?

@SomeOne-qv2tf7 күн бұрын

while publishing the trigger im getting below error please help The Microsoft.EventGrid resource provider is not registered in subscription 0bc822f0-35e3-4b16-bc69-bb4d69d152d3. Register the provider in the subscription and retry the operation. Activity id:0cc9f3e7-26da-4503-8e86-9bbca505a7f4, please reply

@SomeOne-qv2tf2 күн бұрын

any update on this please reply thanks

@SomeOne-qv2tf7 күн бұрын

@swarnabg69167 күн бұрын

Hello ji, i have a question that can we publish multiple pipelines at once? please respond on this. There is scenario where I'm creating the If Condition pipeline but in the input container i don't have dept table, so I have created pipeline for copy activity but unable to publish the pipeline since the if condition is incomplete. please help to solve this

@learnbydoingit7 күн бұрын

If pipeline is incomplete then u will get error

@learnbydoingit7 күн бұрын

If pipeline is incomplete then u will get error

@SomeOne-qv2tf7 күн бұрын

bhai itne sare playlist of azure data factory kindly delete all confusing and make one playlist in one playlist 72 videos one in 52 which to follow and even datafcatory many playlist and even for projects its getting confused for beginner which playlist to start from?

@learnbydoingit7 күн бұрын

Both same only bro in 72 we have pyspark also

@learnbydoingit7 күн бұрын

You can follow 72 vala

@kamaljeetkaur58748 күн бұрын

Can you please provide these sql data samples resource

@learnbydoingit7 күн бұрын

Azure sql when u create u will get option to select sample table ...you will get sample data there

@RaparthiLakhan8 күн бұрын

Great job bro 🎉

@bhupendrashukla375318 сағат бұрын
are these tables delta tables?
@rkadeklasikКүн бұрын
Is there any way to have the csv output file name match the source json file name?
@learnbydoingitКүн бұрын
Yes I will show
@ankitapal1504Күн бұрын
In the query to delete the duplicates the query will also print the record which is duplicate as one of the record is ranked as 1
@learnbydoingitКүн бұрын
U can select column what u need
@lavanyareddy6677Күн бұрын
Example for frequent, unfrequent,
@SomeOne-qv2tfКүн бұрын
i got 3 errors 154 commented memebers not got any single error woow great please help
@learnbydoingitКүн бұрын
What is the error
@SomeOne-qv2tfКүн бұрын
aggregate table creation error: IllegalArgumentException: All week-based patterns are unsupported since Spark 3.0, detected: e, Please use the SQL function EXTRACT instead
@learnbydoingitКүн бұрын
Not sure about this error ...which part u are stuck
@cricketmaster7697Күн бұрын
Thank you! Excited for the course.
@learnbydoingitКүн бұрын
You can also follow latest playlist
@45_farmaankhan30Күн бұрын
Covered all the cases, excellent work!!
@SomeOne-qv2tfКүн бұрын
good explanation thanks but all this are not working in a pipeline can u make a new video for all pramaterization videos (19 ,20,21 videos) by creating new pipeline and triggering it
@learnbydoingitКүн бұрын
Sure I will do that
@SomeOne-qv2tfКүн бұрын
@@learnbydoingit thanks eagerly waiting for it
@SomeOne-qv2tf2 күн бұрын
i have create same as above dataset by going to adf ---author---dataset---its working!! fine 2- when i try to create same dataset via using new pipeline and copy activity---source (dataset) same as above in video and sink as blob 3-when i try to run pipeline im getting error no value provided to parameter db name and table name in case my question is not clear excuse me ! 4- how to do the same activity using pipeline and triger it?
@learnbydoingit2 күн бұрын
Have u created parameter...if u click on the blank page on copy activity u will see parameter option there have u specified or not ?
@SomeOne-qv2tf2 күн бұрын
@@learnbydoingit yes i have created still im getting error
@SomeOne-qv2tfКүн бұрын
@@learnbydoingit can u write down steps how to do it in a pipeline n trigger it becoz im struck on this video i need to move on n complete other videos thanks
@SaiKrishna-fg4id2 күн бұрын
Very informative bro tnq
@SomeOne-qv2tf2 күн бұрын
can me rename a column drop a column n re arrange the column with derived cloumn as we did in select? if yes then what is the difference between select n derived
@learnbydoingit2 күн бұрын
If you have to derive new column based on certain expression like do the sum of 2 column and create new column then which one you will use? Hope u will get idea
@SomeOne-qv2tf2 күн бұрын
@@learnbydoingit yes got it thanks with select we can concat the columns
@SomeOne-qv2tf2 күн бұрын
The file '_SUCCESS' may not render correctly as it contains an unrecognized extension triger got sucessful but in container i have recieved file as scuess with 0kb when i open it i get above message without any file to preview or edit
@SomeOne-qv2tf2 күн бұрын
how to delete table from sql via delete activity ?
@learnbydoingit2 күн бұрын
Table u can't delete through delete activity but, if in sql query u can pass drop table statement and then it will be dropped
@Manwithguts12 күн бұрын
this video is helpful for training people who are learning to transition to cloud computing.keep posting
@asthasatija-f1y3 күн бұрын
What does KPI means?
@learnbydoingit2 күн бұрын
KPI stands for Key Performance Indicator, which is a quantifiable metric used to track progress towards a specific business objective.
@DaffodilGirl3 күн бұрын
hi can you explain how to use aliases feature also? thanks for sharing knowledge!
@SomeOne-qv2tf3 күн бұрын
more how many left total for pyspark?
@learnbydoingit3 күн бұрын
Few more are pending then Project
@sarveshtakalkar58573 күн бұрын
Thanks Budy
@ABQ...4 күн бұрын
Is it similar to sql window functions?
@learnbydoingit4 күн бұрын
Yes
@sayalikhairnar31664 күн бұрын
I think we can use 'monthdiff' after where in 1st question right? rather than repeating whole datediff() line
@DA_Guy1235 күн бұрын
Sir, kindly add Azure Synapse videos
@SomeOne-qv2tf5 күн бұрын
im getting below error: wile debuging in get meta data activity no errors and all file copied to output folder evne before trigger then after triggering im getting below erro message Failed to run foreachitrate (Pipeline). {"code":"BadRequest","message":"ErrorCode=InvalidTemplate, ErrorMessage=The template validation failed: 'The 'runAfter' property of template action 'ForEach1Scope' is not valid. The status values for action 'Get Metadata1Scope' must be unique. Found duplicate values: 'Succeeded'","target":"pipeline/foreachitrate/runid/c2f978e0-100a-4235-a1dd-5caa8c62a25e","details":null}
@learnbydoingit4 күн бұрын
For each are u getting error
@SomeOne-qv2tf5 күн бұрын
how do we know that we have to pass only name in wildcard?
@learnbydoingit5 күн бұрын
There will be different requirements and usecase and based on that we have to deal
@SomeOne-qv2tf5 күн бұрын
hi what about in a single trigger cant we provide multiple filepath and table name
@sameenkunwar22315 күн бұрын
Thank you brooo
@knowledge46865 күн бұрын
This playlist has 58 videos, is it enough to learn from scratch and get a job of azure data engineer with 3 yrs of experience?? If not , please let us know on what else need to be done to achieve it
@learnbydoingit5 күн бұрын
Yes and sql also u need to do ....we have another playlist in depth we are covering adf pyspark sql u can follow that too
@simulacrum4435 күн бұрын
So grateful for this content. Thank you!
@kamaljeetkaur58745 күн бұрын
it is nice how you covered all these complex things in such a small timespan
@mouleshmanikandan13925 күн бұрын
for amazon redshift dataware house, can you upload some tutorial ? and also some data engineering project which covers all these services can you do one video? it will be very useful ?
@learnbydoingit5 күн бұрын
Sure
@mouleshmanikandan13925 күн бұрын
bro your content is the best so far :) thankyou so much
@maheswarpalagiri5665 күн бұрын
Hi sir is your playlist for pyspark enough to completely learn pyspark ?
@learnbydoingit5 күн бұрын
Yes we are adding more , as well project
@SomeOne-qv2tf6 күн бұрын
error: Dataset is using 'AzureSqlDatabase' linked service with SQLVersion 'Recommended', which is not supported in data flow.
@learnbydoingit6 күн бұрын
Pls do watch 48-50 video for this error
@hassansaleem37846 күн бұрын
On second slide you have mentioned ' We have to build one pipeline which will transfer data and run daily' What do you mean by run daily ?
@learnbydoingit6 күн бұрын
Daily schedule
@hassansaleem37843 күн бұрын
@@learnbydoingit But how the pipeline will trigger daily ? Because we haven't set any schedule!!
@kalakritibysanskriti6 күн бұрын
Thanks
@bhargavmuppidi49066 күн бұрын
This helps df to convert to spark table very useful 🙂
@sparshraj52077 күн бұрын
can we take Central India in region in free subscription?
@SomeOne-qv2tf7 күн бұрын
while publishing the trigger im getting below error please help The Microsoft.EventGrid resource provider is not registered in subscription 0bc822f0-35e3-4b16-bc69-bb4d69d152d3. Register the provider in the subscription and retry the operation. Activity id:0cc9f3e7-26da-4503-8e86-9bbca505a7f4, please reply
@SomeOne-qv2tf2 күн бұрын
any update on this please reply thanks
@SomeOne-qv2tf7 күн бұрын
while publishing the trigger im getting below error please help The Microsoft.EventGrid resource provider is not registered in subscription 0bc822f0-35e3-4b16-bc69-bb4d69d152d3. Register the provider in the subscription and retry the operation. Activity id:0cc9f3e7-26da-4503-8e86-9bbca505a7f4,
@swarnabg69167 күн бұрын
Hello ji, i have a question that can we publish multiple pipelines at once? please respond on this. There is scenario where I'm creating the If Condition pipeline but in the input container i don't have dept table, so I have created pipeline for copy activity but unable to publish the pipeline since the if condition is incomplete. please help to solve this
@learnbydoingit7 күн бұрын
If pipeline is incomplete then u will get error
@learnbydoingit7 күн бұрын
If pipeline is incomplete then u will get error
@SomeOne-qv2tf7 күн бұрын
bhai itne sare playlist of azure data factory kindly delete all confusing and make one playlist in one playlist 72 videos one in 52 which to follow and even datafcatory many playlist and even for projects its getting confused for beginner which playlist to start from?
@learnbydoingit7 күн бұрын
Both same only bro in 72 we have pyspark also
@learnbydoingit7 күн бұрын
You can follow 72 vala
@kamaljeetkaur58748 күн бұрын
Can you please provide these sql data samples resource
@learnbydoingit7 күн бұрын
Azure sql when u create u will get option to select sample table ...you will get sample data there
@RaparthiLakhan8 күн бұрын
Great job bro 🎉