Imbalanced Data in Machine Learning | Undersampling | Oversampling | SMOTE
Imbalanced data refers to datasets where the distribution of classes is heavily skewed, with one class significantly outnumbering the others. Dealing with imbalanced data is crucial as it can lead to biased models that perform poorly on minority classes. Addressing Class Imbalance with Undersampling, Oversampling, SMOTE, and Ensemble Methods. Imbalanced datasets pose challenges for machine learning models, but techniques like undersampling (reducing majority class samples), oversampling (increasing minority class samples), SMOTE (Synthetic Minority Over-sampling Technique), and ensemble methods (combining multiple models) help mitigate bias and improve predictive performance on minority classes.
Code - colab.research.google.com/dri...
============================
Did you like my teaching style?
Check my affordable mentorship program at : learnwith.campusx.in
DSMP FAQ: docs.google.com/document/d/1O...
============================
📱 Grow with us:
CampusX' LinkedIn: / campusx-official
CampusX on Instagram for daily tips: / campusx.official
My LinkedIn: / nitish-singh-03412789
Discord: / discord
E-mail us at support@campusx.in
✨ Hashtags✨
#Datascience #Machinelearning #Imbalanceddata #CampusX
⌚Time Stamps⌚
00:00 - Intro
00:54 - What is Imbalanced Data?
04:10 - Problems with Imbalanced Data
08:00 - Imbalanced Data Demo
11:13 - Why studying imbalanced data is important?
16:58 - Undersampling
25:56 - Oversampling
31:06 - SMOTE
42:43 - Ensemble Learning
47:06 - Cost Sensitive Learning
51:30 - Other techniques
Пікірлер: 44
I had to reupload this video because I forgot to include the part on ensemble techniques due to an editing error in the previous upload. Check timestamps.
@Ashishkumar-id1nn
2 ай бұрын
Sir, please make a video on the difference between encoding and embedding
@mohitjoshi8984
2 ай бұрын
Sir please make a video on AB testing
@AMANRAJ-dt8gu
2 ай бұрын
I am writing to request your assistance in creating videos that delve into metaheuristic approaches, such as genetic algorithms, ant colony optimization, and others. It has come to my attention that there is a noticeable scarcity of resources covering these topics on platforms like KZread.
@sagarbp-2854
2 ай бұрын
Sir make video about AB testing
Thanks sir for this beautiful playlist. Never have i ever thought i would be able to understand all concepts of ml so easily. i am really grateful to you. i just completed this playlist after grinding for around 2 months. I have been going with some problems in my life that i am fighting but i will come out stronger now that I have completed this, soon I will start the DL playlist too and complete it too. Thank you for everthing sir.
Thank you sir for the best series on KZread, I just completed it in 2 months by watching 4 hr daily at 1.5x speed
Another fantastic video by Nitish! Wonderful!!!
THANK you so much Nitish 😊u are the best in everything.🎉 Thanks for being my teacher 😊🙏
I understood everything sir Thank you so much You are the best
Thankyou so much for this video, very helpful sir 🤌
Awesome Content
Hi Sir, Big Fan!! I was searching for class imbalance video and you have uploaded it on right time. I am training an ANN model for customer churn prediction where my dataset has class imbalance issues 96:4. I have used Upsampling, Downsampling, SMOTE, SMOTE-ENN, Class Weight but neither of them gave promising results and fail to predict well on minority class the recall value is very low. What should be done in such case where the model is not predicting well on minority class. I have also trained XGBoost classifier but that model also did not perform well.
Sir pls do a session on cross validation.... There's no sperate video on cross validation in the ml playlist
Dear Nitish sir, plz make video on how to fine tune our custom data using LLama llm.
Thank You Sir.
I am writing to request your assistance in creating videos that delve into metaheuristic approaches, such as genetic algorithms, ant colony optimization, and others. It has come to my attention that there is a noticeable scarcity of resources covering these topics on platforms like KZread.
Sir, when will you start a new batch for DSMP?
Nitish :- At 7:00 It will be "Testing data" for determining the accuracy. Am I correct ?
Nitishi Sir please update your Machine Learning Roadmap and add links of your new videos (We want more and more videos of yours)
Thank you very much sir
very helpful video
please continue your llm transformers series.and also please upload nlp ner and topic modeling
Sir Also make video on multi label classification problem.
Sir please do some working on MLOps playlist
Hello sir, how can i connect with you ? Need urgent help please
Thanks Sir
Is this series complete or anything remaining sirm
Thanks
Sir, I truly admire your work and love all of your videos, learning so much from them. Thank you!!! I have one question: at the end of the video you said that in spam filtering false positive is the critical one but if one msg is spam and classified as not spam(false negative) that will be the critical case isn't it? false negatives are generally considered to be more dangerous in this case because they can expose the recipient to potential harm.
@RajatTomar-r7i
27 күн бұрын
I think false positive is more critical because it may send your important mail in spam which is more harmful rather than showing some spam mails as important mail.
Please make a new video on transformers 🙏
🖤
it's better
Playlist ke end tak aate aate aisa lag rha ki aap jawan se budhe ho gye.
Nitesh bhai aapka knowledge perfect hai but video itne long hote h ki chahke bhi pura nahi dekh pate.. please try to make video in short way🙏🤝👍
Sab LLM ki bat Kar Rahe hai app Machine learning par ruke hai
@abhinavkale4632
2 ай бұрын
Bhai LLM ke bhi videos cover kar Rahe hai nitesh sir. To us, these concepts are still gold and they are used everywhere.
@Awm_king-y9i
2 ай бұрын
@@abhinavkale4632 bhai sir ke sare video mere laptop me hai all total video LLM ka history padhe hai abhi tak
@omsaikommawar
2 ай бұрын
From an interviewer's perspective, an imbalanced dataset is a common topic in interviews. Focusing on simple topics can increase your chances of success in cracking the interview.
@samarmohanty6109
5 күн бұрын
ML hogya kya apka
@Awm_king-y9i
5 күн бұрын
@@samarmohanty6109 ha Mera generative Ai bhi ho gaya
Sir app bahut peeche hai