Imbalanced Data in Machine Learning | Undersampling | Oversampling | SMOTE

Imbalanced data refers to datasets where the distribution of classes is heavily skewed, with one class significantly outnumbering the others. Dealing with imbalanced data is crucial as it can lead to biased models that perform poorly on minority classes. Addressing Class Imbalance with Undersampling, Oversampling, SMOTE, and Ensemble Methods. Imbalanced datasets pose challenges for machine learning models, but techniques like undersampling (reducing majority class samples), oversampling (increasing minority class samples), SMOTE (Synthetic Minority Over-sampling Technique), and ensemble methods (combining multiple models) help mitigate bias and improve predictive performance on minority classes.
Code - colab.research.google.com/dri...
============================
Did you like my teaching style?
Check my affordable mentorship program at : learnwith.campusx.in
DSMP FAQ: docs.google.com/document/d/1O...
============================
📱 Grow with us:
CampusX' LinkedIn: / campusx-official
CampusX on Instagram for daily tips: / campusx.official
My LinkedIn: / nitish-singh-03412789
Discord: / discord
E-mail us at support@campusx.in
✨ Hashtags✨
#Datascience #Machinelearning #Imbalanceddata #CampusX
⌚Time Stamps⌚
00:00 - Intro
00:54 - What is Imbalanced Data?
04:10 - Problems with Imbalanced Data
08:00 - Imbalanced Data Demo
11:13 - Why studying imbalanced data is important?
16:58 - Undersampling
25:56 - Oversampling
31:06 - SMOTE
42:43 - Ensemble Learning
47:06 - Cost Sensitive Learning
51:30 - Other techniques

Пікірлер: 44

  • @campusx-official
    @campusx-official2 ай бұрын

    I had to reupload this video because I forgot to include the part on ensemble techniques due to an editing error in the previous upload. Check timestamps.

  • @Ashishkumar-id1nn

    @Ashishkumar-id1nn

    2 ай бұрын

    Sir, please make a video on the difference between encoding and embedding

  • @mohitjoshi8984

    @mohitjoshi8984

    2 ай бұрын

    Sir please make a video on AB testing

  • @AMANRAJ-dt8gu

    @AMANRAJ-dt8gu

    2 ай бұрын

    I am writing to request your assistance in creating videos that delve into metaheuristic approaches, such as genetic algorithms, ant colony optimization, and others. It has come to my attention that there is a noticeable scarcity of resources covering these topics on platforms like KZread.

  • @sagarbp-2854

    @sagarbp-2854

    2 ай бұрын

    Sir make video about AB testing

  • @ansh-t8e
    @ansh-t8e3 күн бұрын

    Thanks sir for this beautiful playlist. Never have i ever thought i would be able to understand all concepts of ml so easily. i am really grateful to you. i just completed this playlist after grinding for around 2 months. I have been going with some problems in my life that i am fighting but i will come out stronger now that I have completed this, soon I will start the DL playlist too and complete it too. Thank you for everthing sir.

  • @advaitdanade7538
    @advaitdanade75382 ай бұрын

    Thank you sir for the best series on KZread, I just completed it in 2 months by watching 4 hr daily at 1.5x speed

  • @shripaddeshpande5766
    @shripaddeshpande57662 ай бұрын

    Another fantastic video by Nitish! Wonderful!!!

  • @manikarnikatiwari199
    @manikarnikatiwari1992 ай бұрын

    THANK you so much Nitish 😊u are the best in everything.🎉 Thanks for being my teacher 😊🙏

  • @user-mg5fk7mf5c
    @user-mg5fk7mf5cАй бұрын

    I understood everything sir Thank you so much You are the best

  • @divyakarlapudi
    @divyakarlapudi2 ай бұрын

    Thankyou so much for this video, very helpful sir 🤌

  • @vinayakvijay108
    @vinayakvijay1082 ай бұрын

    Awesome Content

  • @mukeshrajpurohit5593
    @mukeshrajpurohit55932 ай бұрын

    Hi Sir, Big Fan!! I was searching for class imbalance video and you have uploaded it on right time. I am training an ANN model for customer churn prediction where my dataset has class imbalance issues 96:4. I have used Upsampling, Downsampling, SMOTE, SMOTE-ENN, Class Weight but neither of them gave promising results and fail to predict well on minority class the recall value is very low. What should be done in such case where the model is not predicting well on minority class. I have also trained XGBoost classifier but that model also did not perform well.

  • @wamiqmushtaq2825
    @wamiqmushtaq28252 ай бұрын

    Sir pls do a session on cross validation.... There's no sperate video on cross validation in the ml playlist

  • @soumyaranjandas7394
    @soumyaranjandas73942 ай бұрын

    Dear Nitish sir, plz make video on how to fine tune our custom data using LLama llm.

  • @ParthivShah
    @ParthivShah12 күн бұрын

    Thank You Sir.

  • @AMANRAJ-dt8gu
    @AMANRAJ-dt8gu2 ай бұрын

    I am writing to request your assistance in creating videos that delve into metaheuristic approaches, such as genetic algorithms, ant colony optimization, and others. It has come to my attention that there is a noticeable scarcity of resources covering these topics on platforms like KZread.

  • @uditbhandari5791
    @uditbhandari57912 ай бұрын

    Sir, when will you start a new batch for DSMP?

  • @souvik5560
    @souvik55602 ай бұрын

    Nitish :- At 7:00 It will be "Testing data" for determining the accuracy. Am I correct ?

  • @tusharshukla9361
    @tusharshukla93612 ай бұрын

    Nitishi Sir please update your Machine Learning Roadmap and add links of your new videos (We want more and more videos of yours)

  • @balrajprajesh6473
    @balrajprajesh64732 ай бұрын

    Thank you very much sir

  • @nsbipritam9682
    @nsbipritam9682Ай бұрын

    very helpful video

  • @himanshurathod4086
    @himanshurathod40862 ай бұрын

    please continue your llm transformers series.and also please upload nlp ner and topic modeling

  • @haroonmalik2195
    @haroonmalik21952 ай бұрын

    Sir Also make video on multi label classification problem.

  • @muhammadikram375
    @muhammadikram3752 ай бұрын

    Sir please do some working on MLOps playlist

  • @bhushansonawane5915
    @bhushansonawane5915Ай бұрын

    Hello sir, how can i connect with you ? Need urgent help please

  • @Sulehri226
    @Sulehri2262 ай бұрын

    Thanks Sir

  • @pujarameet9699
    @pujarameet9699Ай бұрын

    Is this series complete or anything remaining sirm

  • @not_amanullah
    @not_amanullah2 ай бұрын

    Thanks

  • @chandrimapramanick1111
    @chandrimapramanick11112 ай бұрын

    Sir, I truly admire your work and love all of your videos, learning so much from them. Thank you!!! I have one question: at the end of the video you said that in spam filtering false positive is the critical one but if one msg is spam and classified as not spam(false negative) that will be the critical case isn't it? false negatives are generally considered to be more dangerous in this case because they can expose the recipient to potential harm.

  • @RajatTomar-r7i

    @RajatTomar-r7i

    27 күн бұрын

    I think false positive is more critical because it may send your important mail in spam which is more harmful rather than showing some spam mails as important mail.

  • @parth.mandaliya
    @parth.mandaliya2 ай бұрын

    Please make a new video on transformers 🙏

  • @not_amanullah
    @not_amanullah2 ай бұрын

    🖤

  • @anandshaw-ie3qk
    @anandshaw-ie3qk2 ай бұрын

    it's better

  • @user-vj3nx7sh8r
    @user-vj3nx7sh8rАй бұрын

    Playlist ke end tak aate aate aisa lag rha ki aap jawan se budhe ho gye.

  • @mohitnemade5320
    @mohitnemade53202 ай бұрын

    Nitesh bhai aapka knowledge perfect hai but video itne long hote h ki chahke bhi pura nahi dekh pate.. please try to make video in short way🙏🤝👍

  • @Awm_king-y9i
    @Awm_king-y9i2 ай бұрын

    Sab LLM ki bat Kar Rahe hai app Machine learning par ruke hai

  • @abhinavkale4632

    @abhinavkale4632

    2 ай бұрын

    Bhai LLM ke bhi videos cover kar Rahe hai nitesh sir. To us, these concepts are still gold and they are used everywhere.

  • @Awm_king-y9i

    @Awm_king-y9i

    2 ай бұрын

    @@abhinavkale4632 bhai sir ke sare video mere laptop me hai all total video LLM ka history padhe hai abhi tak

  • @omsaikommawar

    @omsaikommawar

    2 ай бұрын

    From an interviewer's perspective, an imbalanced dataset is a common topic in interviews. Focusing on simple topics can increase your chances of success in cracking the interview.

  • @samarmohanty6109

    @samarmohanty6109

    5 күн бұрын

    ML hogya kya apka

  • @Awm_king-y9i

    @Awm_king-y9i

    5 күн бұрын

    @@samarmohanty6109 ha Mera generative Ai bhi ho gaya

  • @Awm_king-y9i
    @Awm_king-y9i2 ай бұрын

    Sir app bahut peeche hai