Assumptions of Linear Regression | What are the assumptions for a linear regression model

Assumptions of Linear Regression | What are the assumptions for a linear regression model
#AssumptionsOfLinerRegression #UnfoldDataScience
Hello ,
My name is Aman and I am a Data Scientist.
About this video:
In this video, I explain about assumptions of linear regression. I explain about the importance of assumptions of linear regression in this video. I also explain why this topic is so important from interview point of view.
Below topics are explained well in this video:
1. Assumptions of Linear Regression
2. What are the assumptions of linear regression
3. Linear regression assumptions machine learning
4. Basic assumptions of linear regression
5. Simple linear regression assumptions
About Unfold Data science: This channel is to help people understand basics of data science through simple examples in easy way. Anybody without having prior knowledge of computer programming or statistics or machine learning and artificial intelligence can get an understanding of data science at high level through this channel. The videos uploaded will not be very technical in nature and hence it can be easily grasped by viewers from different background as well.
If you need Data Science training from scratch . Please fill this form (Please Note: Training is chargeable)
docs.google.com/forms/d/1Acua...
Book recommendation for Data Science:
Category 1 - Must Read For Every Data Scientist:
The Elements of Statistical Learning by Trevor Hastie - amzn.to/37wMo9H
Python Data Science Handbook - amzn.to/31UCScm
Business Statistics By Ken Black - amzn.to/2LObAA5
Hands-On Machine Learning with Scikit Learn, Keras, and TensorFlow by Aurelien Geron - amzn.to/3gV8sO9
Ctaegory 2 - Overall Data Science:
The Art of Data Science By Roger D. Peng - amzn.to/2KD75aD
Predictive Analytics By By Eric Siegel - amzn.to/3nsQftV
Data Science for Business By Foster Provost - amzn.to/3ajN8QZ
Category 3 - Statistics and Mathematics:
Naked Statistics By Charles Wheelan - amzn.to/3gXLdmp
Practical Statistics for Data Scientist By Peter Bruce - amzn.to/37wL9Y5
Category 4 - Machine Learning:
Introduction to machine learning by Andreas C Muller - amzn.to/3oZ3X7T
The Hundred Page Machine Learning Book by Andriy Burkov - amzn.to/3pdqCxJ
Category 5 - Programming:
The Pragmatic Programmer by David Thomas - amzn.to/2WqWXVj
Clean Code by Robert C. Martin - amzn.to/3oYOdlt
My Studio Setup:
My Camera : amzn.to/3mwXI9I
My Mic : amzn.to/34phfD0
My Tripod : amzn.to/3r4HeJA
My Ring Light : amzn.to/3gZz00F
Join Facebook group :
groups/41022...
Follow on medium : / amanrai77
Follow on quora: www.quora.com/profile/Aman-Ku...
Follow on twitter : @unfoldds
Get connected on LinkedIn : / aman-kumar-b4881440
Follow on Instagram : unfolddatascience
Watch Introduction to Data Science full playlist here : • Data Science In 15 Min...
Watch python for data science playlist here:
• Python Basics For Data...
Watch statistics and mathematics playlist here :
• Measures of Central Te...
Watch End to End Implementation of a simple machine learning model in Python here:
• How Does Machine Learn...
Learn Ensemble Model, Bagging and Boosting here:
• Introduction to Ensemb...
Build Career in Data Science Playlist:
• Channel updates - Unfo...
Artificial Neural Network and Deep Learning Playlist:
• Intuition behind neura...
Natural langugae Processing playlist:
• Natural Language Proce...
Understanding and building recommendation system:
• Recommendation System ...
Access all my codes here:
drive.google.com/drive/folder...
Have a different question for me? Ask me here : docs.google.com/forms/d/1ccgl...
My Music: www.bensound.com/royalty-free...

Пікірлер: 124

  • @UnfoldDataScience
    @UnfoldDataScience2 жыл бұрын

    In this lecture, 3rd point is "NO heteroscedasticity" , please do not confuse

  • @manushrivetal6098

    @manushrivetal6098

    2 жыл бұрын

    3. Homoscedasticity is one of the most critical It states that there should be an equal distribution of errors. It is not Heteroscedasticity

  • @jeremiahayegbusi2269
    @jeremiahayegbusi2269 Жыл бұрын

    best video on assumptions of OLS on KZread

  • @interestingstudies4422
    @interestingstudies44222 жыл бұрын

    Becoming huge fan of this channel..grt explaination

  • @gauravkamble9702
    @gauravkamble97023 жыл бұрын

    You explained it in the easiest manner possible! Thanks for sharing this, Sir 😊

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome Gaurav.

  • @kratiagrawal2742
    @kratiagrawal2742 Жыл бұрын

    Hi, Could you please answer how should we approach this situation in regression problem: The target variable is distributed in a biased manner(50% of the values lie in the range 0-300 and 30% in 300-500 and 10% in remaining 500-1000) , how will you approach such scenario?

  • @sktarikaziz6529
    @sktarikaziz65292 жыл бұрын

    Good explanation, Thanks

  • @prachigupta1963
    @prachigupta19633 жыл бұрын

    Clear explanation in a very less time . Thank you 😊👍

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Prachi. Keep watching.

  • @nayanparnami8554
    @nayanparnami85542 жыл бұрын

    Very precise and very informative!! thank you sharing for this,Sir...

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Nayan.

  • @salikmalik7631
    @salikmalik76313 жыл бұрын

    great video. we need videos like that sir, (Assumptions of other algorithms)...

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Salik. Will try to create.

  • @UnfoldDataScience
    @UnfoldDataScience2 жыл бұрын

    Access Hindi, English courses here- www.unfolddatascience.com/s/store Plz register on the website

  • @ajaybandlamudi2932
    @ajaybandlamudi29322 жыл бұрын

    could you please answer my question What is the similarities and the difference between a generalised linear model(Glm) and gradient boosting machine(Gbm)?

  • @deepakadik1210
    @deepakadik12102 жыл бұрын

    Very Crystal clear explanation !! Hoping for more of such content.

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Deepak.

  • @RohanDreamerz
    @RohanDreamerz3 жыл бұрын

    Great Explanation! So much clarity on concepts.

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Glad it was helpful Rohan.

  • @anbesivam7686
    @anbesivam76863 жыл бұрын

    I am actively searching job. Sometime I feel like I won't get a job but after watching your videos I feel really learned something and it's give some confident. Thanks for the video. Please keep sharing videos.

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Best of luck buddy. you will do it. mark my words. cheers!

  • @amitbudhiraja7498
    @amitbudhiraja74982 жыл бұрын

    Can u make the live demo video to check all of these assumptions on the data on dataset

  • @rahulramachandran761
    @rahulramachandran7613 жыл бұрын

    You deserve more subscribers aman.. Your teachings are very good👍👍

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks a lot Rahul.

  • @Duke_1978et
    @Duke_1978et2 жыл бұрын

    how i can check assumption in multiple linear Regression with categorical independent variable?

  • @yoyomovieclips8813
    @yoyomovieclips88133 жыл бұрын

    clean,simple and precise explaination

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Neeraj.

  • @siddhawan5190
    @siddhawan5190 Жыл бұрын

    You explanation is very good and easy to understand. Thanks for this awesome video

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Glad it was helpful!

  • @AMVSAGOs
    @AMVSAGOs Жыл бұрын

    Hi Aman, I think we need strong justification for point 3,4,5. Why it should not happen??. I was asked in an interview and I was not able to justify point 3,4,5 . Could you please elaborate little more on these points.

  • @annonymous6555
    @annonymous65552 жыл бұрын

    Please make a video on assumptions of logistic regression

  • @developerboy8341
    @developerboy83413 жыл бұрын

    Very good explanations, keep it up.

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks, will do!

  • @RamanKumar-ss2ro
    @RamanKumar-ss2ro3 жыл бұрын

    Amazing content.

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks a lot :)

  • @peterjohn2899
    @peterjohn2899 Жыл бұрын

    Sir..please explain whether the points 2 and 6 are different or not? why?

  • @nickrogers4408
    @nickrogers44083 жыл бұрын

    So clear and informatic❤️

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Glad you think so Sai.

  • @abhinavkale4632
    @abhinavkale46323 жыл бұрын

    It will be really helpful if you can provide a video lecture in which you put all the assumptions to a test on a Kaggle dataset(any). Cheers.. great work sir..

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Hi Abhinav, All the assumptions in the test means?

  • @abhinavkale4632

    @abhinavkale4632

    3 жыл бұрын

    @@UnfoldDataScience all the assumptions as in multicollinearity, normality of residuals, autocorrelation.. all these assumptions applied on real dataset (basically executing all the assumptions in python)

  • @raofaizanali1548
    @raofaizanali15482 жыл бұрын

    Lovely Brother ..Thanks

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks a lot.

  • @nivednambiar6845
    @nivednambiar68452 жыл бұрын

    good way of explanation

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Nived

  • @ashulohar8948
    @ashulohar8948 Жыл бұрын

    Please make a vedio on end to end projects of all the algos

  • @sandipansarkar9211
    @sandipansarkar92112 жыл бұрын

    finished watching

  • @nagnathsatav9978
    @nagnathsatav99783 жыл бұрын

    Thank u for this video, clean explanation, waiting for this. I want video on kaggle specially using submission file using smothe technique.

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Nagnath. Sure :)

  • @manushrivetal6098
    @manushrivetal60982 жыл бұрын

    Yor are always great

  • @sandhya_exploresfoodandlife
    @sandhya_exploresfoodandlife3 жыл бұрын

    Good explanation! thank you!

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Sandhya.

  • @sadhnarai8757
    @sadhnarai87573 жыл бұрын

    Very much needed .

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks a lot :)

  • @umeshrawat8827
    @umeshrawat88273 ай бұрын

    Why are these called Assumptions? They seem to be the mandatory conditions to ensure better regression model...?

  • @shadiyapp5552
    @shadiyapp5552 Жыл бұрын

    Thank you sir ♥️

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Most welcome

  • @deangibson5283
    @deangibson52832 жыл бұрын

    Very easy to understand. Please provide a video about RMSE

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    I have video of R squared, not specific to RMSE, will create one.

  • @divyanshuchaudhari5416
    @divyanshuchaudhari54163 жыл бұрын

    Good Explanation.

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Divyanshu.

  • @sreevidyahothur2313
    @sreevidyahothur23133 жыл бұрын

    Sir, shall I ask a doubt? Should we consider multicollinearity in multiple linear regression, on a time series financial data?

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Yes Srividya, if you are training a regression model.

  • @ultra_legend23
    @ultra_legend233 жыл бұрын

    should the data have homescedasticity or hetero? Other blogs says it should have homoscedasticity. Please clarify. Thanks

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    No Hetero, that is what I gave example and said it should not happen - may be missed to write "NO" before hetero in 3rd point

  • @anirbansarkar6306
    @anirbansarkar63062 жыл бұрын

    Thank you so much for this wonder content. It was really helpful. In multicolinearity part, I have a small doubt. I understood through your example, it is better to remove one feature out of 2 if they are positively correlated. Does the same applies for negatively correated features too? I mean shall I drop one feature, in case two features are positively correlated?

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Anirban. Postive/Negative correlation does not matter, we should not keep two variables which are highly correlated.(any direction)

  • @sandipansarkar9211
    @sandipansarkar92113 жыл бұрын

    great explanation

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Sandipan.

  • @yoyomovieclips8813
    @yoyomovieclips88133 жыл бұрын

    In next video please tell us how to check these assumptions and how to correct the data to follow these assumptions

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Sure Neeraj, Will do.

  • @AshokKumar-rh7ey
    @AshokKumar-rh7ey3 жыл бұрын

    My data has no multicolinerity ( value is 1) and is not normal. Can i run regression ?? Plz answer

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    You can run but model may not be very robust.

  • @ankurdubey960
    @ankurdubey9603 жыл бұрын

    you should explain why in the last three assumptions.

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Sure, thanks for feedback.

  • @_itachi7904
    @_itachi79043 жыл бұрын

    excellent explanation. can you demonstrate a linear regression?

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    kzread.info/dash/bejne/aoR61paDmdGdcpM.html

  • @firassami7399
    @firassami73993 жыл бұрын

    Thanks I just have couple equestion . 1- What is the disadvantages of multicolinearity 2- in several cases, the distibutiin of error vector is not following the normat distribution. How can I deal with that

  • @AshokKumar-rh7ey

    @AshokKumar-rh7ey

    3 жыл бұрын

    The same question i do have..

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    I explained in multicollinearity video. kzread.info/dash/bejne/l5-pppqDhceyZrA.html

  • @rohitbhosale4614
    @rohitbhosale46143 жыл бұрын

    Thanks! Please explain 1st assumption

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Sure.

  • @luistorres7297
    @luistorres72972 жыл бұрын

    Will (or could) heteroskedasticity imply autocorrelation?

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    here heteroskedasticity is in context of error terms. It will not mean autocorrelation.

  • @yashpandey5484
    @yashpandey54843 жыл бұрын

    How to remove multicollinearity from tha data set if the features are highly correlated can we solve the problem without removing any features or any information loss does PCA helfull?

  • @abhinavkale4632

    @abhinavkale4632

    3 жыл бұрын

    I think we can use VIF "Variance inflation factor" and then decide which features should be included in the model. In addition, we also need to check the significance value from the OLS regression model. There is a threshold limit(generally for VIF

  • @arvindadari3390

    @arvindadari3390

    2 жыл бұрын

    PCA will definitely help to tackle the multicollinearity but will loose out Interpretability.

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    To Answer question from Yash, there is very low information loss if you remove a highly correlated variable.

  • @ramub7657
    @ramub76572 жыл бұрын

    Awesome explanation. Thank you Aman. Have one question, isn't assumptions 3 and 4 are same Heteroscedasticity and No auto correlation of errors?

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    No, not same,

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    It's little long explanation, sorry to not able to write, will talk in live or interviews

  • @sowjiadabala

    @sowjiadabala

    2 жыл бұрын

    @@UnfoldDataScience Sir have you explained this in any of your videos ?

  • @dsklife
    @dsklife2 жыл бұрын

    Can you help me with regression models with multi-dimensional data?

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Its there in my channel. Search with channel name and topic

  • @sharanm5718
    @sharanm57182 жыл бұрын

    Similar can we have any assumptions and limitations to logistic regression

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Yes Sharan. That comes also under regression umbrella.

  • @veenap3682
    @veenap36822 жыл бұрын

    sir can u please provide the notes

  • @genai-guru
    @genai-guru2 жыл бұрын

    1. Linear relationship 2. Very low/No multicollinearity (independent variable correlation each other 3. heteroscedasticity 4. No autocorrelation Normally distributed error 5. All the observations are independent of the each other

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks. 3rd point - " No heteroscedasticity"

  • @abhishekanand5898
    @abhishekanand58982 жыл бұрын

    Because of this question, I was rejected in final round interview of ZS-Data Science Associate😂

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Now that you have understood, you will do great 😊

  • @spandanswain2879
    @spandanswain28793 жыл бұрын

    Does it applicable for all regression model or only in linear regression models?

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    All regression based model mostly.

  • @leecreations9133
    @leecreations91335 ай бұрын

    Explanations is ok but Writing on the board is provide notes for the students it's my suggestion

  • @arvindadari3390
    @arvindadari33902 жыл бұрын

    Just a note. The relationship between dependent and independent variables should be linear, linear in terms of coefficients but not in variables. When we are doing polynomial regression, the linearity between variable with target will not hold true. As we have raised power terms.

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Arvind.

  • @rajeshvenaganti6797
    @rajeshvenaganti67973 жыл бұрын

    what is mean by linear relation? this is the interview question

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Linear relation means y = mx+c kind of relation

  • @pixelff5044
    @pixelff50443 жыл бұрын

    Is this a assumption of OLS

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Yes. OLS is the internal methodology for Linear regression(If we don't use gradient descent).

  • @sanketsanap1076
    @sanketsanap10762 жыл бұрын

    What are assumptions for Classification?

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    if its logistic regression u are asking about it will be mostly same

  • @RajdeepBorgohainRajdeep
    @RajdeepBorgohainRajdeep2 жыл бұрын

    I got this question today and couldn't answer other assumptions! I felt really very bad about myself

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Np. it happens witj all of us. Keep trying.

  • @RajdeepBorgohainRajdeep

    @RajdeepBorgohainRajdeep

    2 жыл бұрын

    @@UnfoldDataScience thanks for motivating Aman :)

  • @shreyashyadav1521
    @shreyashyadav15213 жыл бұрын

    Not able to get a job in field of Data science. Dont know what to do.. Frustated !!

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Hi Shreyash, lease make your resume strong. Learn new stuff, approach people for more opportunities also.

  • @minhazuddin5169
    @minhazuddin51693 жыл бұрын

    You are extremely good in teaching I am looking for. Can I have your email address? I am from Bangladesh, beginner in research (M.Phil). I'm struggling in some topics of data analysis. I would like to contact with you if you approve. Thanks

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    please connect on LinkedIn

  • @kalam_indian
    @kalam_indian2 жыл бұрын

    can you explain by taking a real life example more deeply because whatever you explained are the basic things with no depth explanation, so if possible please explain deeply by taking a good example even if the video becomes longer

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks for the suggestion, Will definitely take in consideration.

  • @TJ-wo1xt
    @TJ-wo1xt2 жыл бұрын

    very generic explanation, u should cover them in a little detail

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks for feedback, will try to cover.

Келесі