Silhouette Score for clustering Explained | Silhouette (clustering)- Validating Clustering Models

Silhouette Score for clustering Explained | Silhouette (clustering)- Validating Clustering Models
#SilhouetteScore #UnfoldDataScience
Hello ,
My name is Aman and I am a Data Scientist.
About this video:
In this video, I speak about Silhouette Score and explain step by step how Silhouette Score works for cluster validation. I explain how to validate clusters and how to measure goodness of clusters. I explain the mathematical formula of Silhouette Score and intuition behind it. Below points are discussed in this video:
1. Silhouette Score for clustering
2. Validation on K-means clusters
3. Cluster validation techniques
4. How to measure goodness of clusters
5. Unsupervised machine learning accuracy
About Unfold Data science: This channel is to help people understand basics of data science through simple examples in easy way. Anybody without having prior knowledge of computer programming or statistics or machine learning and artificial intelligence can get an understanding of data science at high level through this channel. The videos uploaded will not be very technical in nature and hence it can be easily grasped by viewers from different background as well.
If you need Data Science training from scratch . Please fill this form (Please Note: Training is chargeable)
docs.google.com/forms/d/1Acua...
Book recommendation for Data Science:
Category 1 - Must Read For Every Data Scientist:
The Elements of Statistical Learning by Trevor Hastie - amzn.to/37wMo9H
Python Data Science Handbook - amzn.to/31UCScm
Business Statistics By Ken Black - amzn.to/2LObAA5
Hands-On Machine Learning with Scikit Learn, Keras, and TensorFlow by Aurelien Geron - amzn.to/3gV8sO9
Ctaegory 2 - Overall Data Science:
The Art of Data Science By Roger D. Peng - amzn.to/2KD75aD
Predictive Analytics By By Eric Siegel - amzn.to/3nsQftV
Data Science for Business By Foster Provost - amzn.to/3ajN8QZ
Category 3 - Statistics and Mathematics:
Naked Statistics By Charles Wheelan - amzn.to/3gXLdmp
Practical Statistics for Data Scientist By Peter Bruce - amzn.to/37wL9Y5
Category 4 - Machine Learning:
Introduction to machine learning by Andreas C Muller - amzn.to/3oZ3X7T
The Hundred Page Machine Learning Book by Andriy Burkov - amzn.to/3pdqCxJ
Category 5 - Programming:
The Pragmatic Programmer by David Thomas - amzn.to/2WqWXVj
Clean Code by Robert C. Martin - amzn.to/3oYOdlt
My Studio Setup:
My Camera : amzn.to/3mwXI9I
My Mic : amzn.to/34phfD0
My Tripod : amzn.to/3r4HeJA
My Ring Light : amzn.to/3gZz00F
Join Facebook group :
groups/41022...
Follow on medium : / amanrai77
Follow on quora: www.quora.com/profile/Aman-Ku...
Follow on twitter : @unfoldds
Get connected on LinkedIn : / aman-kumar-b4881440
Follow on Instagram : unfolddatascience
Watch Introduction to Data Science full playlist here : • Data Science In 15 Min...
Watch python for data science playlist here:
• Python Basics For Data...
Watch statistics and mathematics playlist here :
• Measures of Central Te...
Watch End to End Implementation of a simple machine learning model in Python here:
• How Does Machine Learn...
Learn Ensemble Model, Bagging and Boosting here:
• Introduction to Ensemb...
Build Career in Data Science Playlist:
• Channel updates - Unfo...
Artificial Neural Network and Deep Learning Playlist:
• Intuition behind neura...
Natural langugae Processing playlist:
• Natural Language Proce...
Understanding and building recommendation system:
• Recommendation System ...
Access all my codes here:
drive.google.com/drive/folder...
Have a different question for me? Ask me here : docs.google.com/forms/d/1ccgl...
My Music: www.bensound.com/royalty-free...

Пікірлер: 162

  • @shaneglean217
    @shaneglean2172 жыл бұрын

    The explanation is so concise and simple, even in spite of the fact that english isn't your 1st language. Amazing work my dude.

  • @aaryalad7775
    @aaryalad777518 күн бұрын

    I'm glad i found a teacher like you

  • @chrisrichardson8864
    @chrisrichardson88642 жыл бұрын

    Clear and right to the point! Thank you for your work! Much appreciated!

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks for watching Chris. Stay safe. Tc

  • @nicolaevasiliu9430
    @nicolaevasiliu9430 Жыл бұрын

    Amazing explanations! THANK YOU !!! You are an excellent teacher !!!

  • @Pannafreestyle
    @Pannafreestyle Жыл бұрын

    Sir you are the best teacher for DS ! Keep up the great work

  • @isaacyekoniya8067
    @isaacyekoniya80673 жыл бұрын

    This is simple and well explained, thank you Aman!

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Most welcome!

  • @kushik.naveen
    @kushik.naveenАй бұрын

    This is so useful for understanding & revising. Amazing work! Thankyou!

  • @victorcardenas9621
    @victorcardenas96212 жыл бұрын

    Simple but powerful, thanks and greetings from Colombia!

  • @samiayasmin5782
    @samiayasmin5782 Жыл бұрын

    Thank you so much. Simple and clean explanation.

  • @JuliaCh47
    @JuliaCh472 жыл бұрын

    Thank you so much, best explanation I've seen on the Internet!

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks a lot for watching.

  • @Felipe90820
    @Felipe908203 ай бұрын

    Amazing explanation! Congrats and thank you!

  • @user-hs6gz4yn1u
    @user-hs6gz4yn1uАй бұрын

    Ur Explanation is simple and understandable,TQ sir

  • @the.quantum.paradox
    @the.quantum.paradox2 жыл бұрын

    Very simple and easy to understand!

  • @KHakimin
    @KHakimin3 жыл бұрын

    This is gold! thanks for putting it very simple to understand

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Glad it was helpful Khairul.

  • @dbapple
    @dbapple6 ай бұрын

    thank you, straight to the point!

  • @Krishna-pm8ty
    @Krishna-pm8ty Жыл бұрын

    Excellent Amen. Perfect explanation👏👏👏

  • @sanjithaamarathunga9949
    @sanjithaamarathunga99492 жыл бұрын

    Thank you! Simple and well explained!

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Glad it was helpful!

  • @uchennanwosu5327
    @uchennanwosu53272 жыл бұрын

    Excellent illustration and explanation.

  • @YouTribalChief
    @YouTribalChief Жыл бұрын

    What a gem of an explanation you give bhai! Truly you make things so simple and understandable! You deserve more!

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Thanks a lot. Pls share with friends as well. Have a nice weekend ahead.

  • @mdyounusahamed6668
    @mdyounusahamed6668 Жыл бұрын

    very concise and simple explanation.

  • @MyerNore
    @MyerNore7 ай бұрын

    Clearer than my professor, for sure

  • @paulholzapfel1856
    @paulholzapfel1856 Жыл бұрын

    Perfect explanation. Thank you Aman

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Welcome Paul. Your feedback is precious to me.

  • @liqs_xd
    @liqs_xd10 ай бұрын

    Great video, thanks for your time!

  • @UnfoldDataScience

    @UnfoldDataScience

    10 ай бұрын

    My pleasure!

  • @omniscienceisdead8837
    @omniscienceisdead88376 ай бұрын

    you are the best bro!

  • @krishnabhadke6161
    @krishnabhadke61612 жыл бұрын

    very well explained sir thank you

  • @RezoanurRahman
    @RezoanurRahman Жыл бұрын

    You explained it so well even a 5th grader would get it. Its so easily explained!.

  • @Suno-99
    @Suno-992 ай бұрын

    beautiful explanation, keep it up

  • @blackyogurt
    @blackyogurtАй бұрын

    Great Explanation . thanks

  • @keinestudiere-chillzone633
    @keinestudiere-chillzone6332 жыл бұрын

    Straight to the point. Thank you

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks a lot for watching

  • @user-dd3mf2sp7s
    @user-dd3mf2sp7s2 жыл бұрын

    Great explanation, Thank you!

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome

  • @sangeethaagoudar2175
    @sangeethaagoudar21752 жыл бұрын

    Thank You so much for this video sir

  • @optimalsolutionprogramming1299
    @optimalsolutionprogramming12992 жыл бұрын

    Thank you for your help. Really great explanation!

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome

  • @Anon11229
    @Anon112292 жыл бұрын

    Too good...Way better than online courses that take thousands and thousands of money keep up the hard work

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Zaid.

  • @anshuraghav5620
    @anshuraghav5620Ай бұрын

    Great video

  • @mmmommm237
    @mmmommm237 Жыл бұрын

    Great, ty!

  • @shaikhuzma786
    @shaikhuzma7867 ай бұрын

    Tqsm sir so well explained ❤

  • @muhammedthayyib9202
    @muhammedthayyib9202 Жыл бұрын

    Clearly explained!

  • @lhadz7290
    @lhadz72908 ай бұрын

    Thank you so much for Sir, I am so glad I came across your video. I look forward to learn more from you. Thank you again.

  • @UnfoldDataScience

    @UnfoldDataScience

    8 ай бұрын

    So nice of you

  • @ravisavaliya4333
    @ravisavaliya43332 жыл бұрын

    at first it was very difficult for me to understand this, but you explained this in a very easy way....THANKYOU AMAN SIR FOR YOUR EASY AND INTUITIVE EXPLANATION!!!

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    It's my pleasure

  • @chrisx742
    @chrisx7425 ай бұрын

    Thank you.

  • @stevemungai3542
    @stevemungai35422 жыл бұрын

    Crystal clear

  • @akashpb4044
    @akashpb40442 жыл бұрын

    Awesome ...plain and simple 👍🏼

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Glad to hear that Akash 🙂, please share with others as well who could be benefited from such content.

  • @amoghbharadwaj9252
    @amoghbharadwaj92522 жыл бұрын

    beautifully explained!!! Thank u so much

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Amogh. Happy Holi.

  • @kamal_douma
    @kamal_douma5 ай бұрын

    very very good explnation

  • @vallimuthaiyah5098
    @vallimuthaiyah50983 жыл бұрын

    Thank you sir for very clear explanation with example.. 👍

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Welcome Valli :)

  • @kavyasharma4738
    @kavyasharma47382 жыл бұрын

    how can i thank you brother . you explained this topic so easily.

  • @sajjaddehghani3280
    @sajjaddehghani32802 жыл бұрын

    Thanks for the very simple and easy to understand explanation

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome

  • @parvsharma8767
    @parvsharma87673 жыл бұрын

    Thanks bro for explaining so perfectly

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Welcome Parv.

  • @veyselaytekin8734
    @veyselaytekin8734 Жыл бұрын

    thank you

  • @theone-jz2rf
    @theone-jz2rf Жыл бұрын

    keep up the good work!!!

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Thank you.

  • @sourabhpramanik3957
    @sourabhpramanik39572 жыл бұрын

    Its really a simple and good explanation for anybody to understand...thanks for making this simple explanation video.

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome Sourabh

  • @PranavVashisth
    @PranavVashisth2 жыл бұрын

    verry well explained !!!!

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Pranav.

  • @nandankakadiya1494
    @nandankakadiya14943 жыл бұрын

    Amazing explaination thank you sir

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Welcome Nandan.

  • @legolife3012
    @legolife30123 жыл бұрын

    Thanks a lot for this simple explenation

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    You are welcome

  • @anisadillap.7131
    @anisadillap.71312 жыл бұрын

    oke i understand, thankyou sir ...

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome

  • @diyap7343
    @diyap73432 жыл бұрын

    This is amazing!

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Diya.

  • @almonddonut1818
    @almonddonut1818 Жыл бұрын

    Thank you!!

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Welcome.

  • @venkataprashanth9831
    @venkataprashanth9831 Жыл бұрын

    great and simple explanation, a big clap to ur efforts

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Thanks Venkat.

  • @sandipansarkar9211
    @sandipansarkar92112 жыл бұрын

    finished watching

  • @mohe4ever514
    @mohe4ever5142 жыл бұрын

    Very well explained!

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Glad it was helpful!

  • @nandinik7816
    @nandinik78162 жыл бұрын

    Always to the point...and you dont waste any time actually...

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks again.

  • @dr.zliiiiii
    @dr.zliiiiii2 жыл бұрын

    nice video

  • @SACHINKUMAR-px8kq
    @SACHINKUMAR-px8kq2 жыл бұрын

    Thanks you so much sir

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome Sachin.

  • @puodzius
    @puodzius2 жыл бұрын

    Very good explanation, thanks

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome.

  • @dosendaring
    @dosendaring2 жыл бұрын

    This is very good explanation, thank you Sir....

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome Yudy.

  • @shaheerrehman3151
    @shaheerrehman3151 Жыл бұрын

    Thank you. Much love to you from Pakistan

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Thanks Shaheer, pls share with friends if you like the content.

  • @MohdRashid-hi7cp
    @MohdRashid-hi7cp10 ай бұрын

    Very nice explanation

  • @UnfoldDataScience

    @UnfoldDataScience

    10 ай бұрын

    Thanks Rashid.pls share with friends also

  • @beprepare8128
    @beprepare81282 жыл бұрын

    Very well explained sir

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks and welcome

  • @bangarrajumuppidu8354
    @bangarrajumuppidu83542 жыл бұрын

    excellent

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks a lot.

  • @megalaramu
    @megalaramu3 жыл бұрын

    Hi Aman, I do have a general question in clustering -I have heard of people do clustering in the dataset initially and then train separate model for each cluster. my question is lets take an example of telecom churn - there will be many services like internet, Phone, Dish etc. is there anyway we can cluster like internet users in one cluster, phone service in one cluster like that.

  • @user-ur2en1zq4f
    @user-ur2en1zq4f Жыл бұрын

    thanks sir

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Most welcome

  • @Chaseaction-outdoor-activities
    @Chaseaction-outdoor-activities3 жыл бұрын

    thank you!!!

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Welcome.

  • @Live_life_with_no_regrets
    @Live_life_with_no_regrets4 ай бұрын

    is there any way to run it in SPSS? Is it correct if I use the average of the Euclidean distance of the cluster?

  • @jeanhwang18
    @jeanhwang18 Жыл бұрын

    Thank you so much for your explaination. I have a small question: after I got point i ' s silhouette coefficient, I calculate other two points in cluster A, then I take the average of the three silhouette coefficients, let's say it's NO.1. What we can learn from No.1? If No.1 is close to +1, it's good, it means data points in cluster A are most similar to each other, right?

  • @vishwass5103
    @vishwass51033 жыл бұрын

    Hi Aman thanks for this explanation. Please explain about assumptions of k means clustering

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Assumption at high level is, you data has clusters and centriods.

  • @salilshukla6488
    @salilshukla64882 жыл бұрын

    thanks

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Welcome.

  • @neajinnovations8587
    @neajinnovations85878 ай бұрын

    I got all the concept clear, but please when is it appropriate to use silhouette distance? it when you have mixed data? or mix data is best for elbow method?

  • @bipulnath4602
    @bipulnath46023 жыл бұрын

    Amazing explaination..plz make a video on Dunn index.

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Bipul.

  • @gnaneshgn8341
    @gnaneshgn83413 жыл бұрын

    Nice video Sir thank you. please make an video of EDA sir ..!

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Ganesh.

  • @iftikhar58
    @iftikhar582 жыл бұрын

    clear

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thank you

  • @prashanths4455
    @prashanths44552 жыл бұрын

    super bro

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thanks Prashanth

  • @ranajaydas8906
    @ranajaydas89063 жыл бұрын

    Sir please make a video on PCA . It would be very helpful

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Sure Ranajay.

  • @handsomemehdi3445
    @handsomemehdi3445 Жыл бұрын

    Thank u for simpler explanation! A question: In our case, it seems that for both clusters, b should be the same unless we have more than two clusters, (or since those clusters are mutually nearest) doesn't it?

  • @lkjhlkjh1954

    @lkjhlkjh1954

    Жыл бұрын

    I think that’s why we do not calculate silhouettes score for less than 2 cluster

  • @christopherlegarda5164
    @christopherlegarda51643 ай бұрын

    This guy is god!

  • @TheGentlemanGR
    @TheGentlemanGR2 жыл бұрын

    So which score is better? -0.2 or 0.21 ?

  • @himanshumangoli6708
    @himanshumangoli67082 жыл бұрын

    As we have find cohesion and separation for only one point in one cluster A, so we have to find Silhouette score for all points for all points in same cluster so just to ensure that points are correctly clustered ?? Please explain??

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Yes , one point is just for example.

  • @mahipatil369
    @mahipatil36911 ай бұрын

    hello sir.. great video i understand this topic very well,, thank you so much... just want to ask did you make video of implementation of silhoutte score in python?? if yes plz provide link..

  • @UnfoldDataScience

    @UnfoldDataScience

    11 ай бұрын

    Thanks a lot. Its very simple to do in python - see this link - stackoverflow.com/questions/59919627/how-to-calculate-the-silhouette-score-for-each-cluster-separately-in-python

  • @subhasisdutta3711
    @subhasisdutta37113 жыл бұрын

    you have said "Min distance of A1,A2,A3 is the value of b ". Here you are talking about mean value of three or the which one has the minimum value among three , we'll chose it. Please clear it once, if possible

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    HI Subhasis, A1, A2 etc are "mean" Distance from various other clusters ok. Now let's take att these A1 TO AN values and take the minimum from this set. This mimum is b.

  • @subhasisdutta3711

    @subhasisdutta3711

    3 жыл бұрын

    @@UnfoldDataScience thanq for the clarification. Have a blessed life ahead

  • @sampathvinaykumarreddymajj790
    @sampathvinaykumarreddymajj7903 жыл бұрын

    Topic is very well explained sir .. But should we learn DB Index, Dunn Index, Jaccard score ... or only this Silhouette score is enough

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    Thanks Sampath.

  • @gretchenbekker2371
    @gretchenbekker23712 жыл бұрын

    How do I get the amount of data points within the clusters ?

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Levels can be found out after clustering then we can count using pandas.

  • @arisurya9867
    @arisurya98672 жыл бұрын

    Nice video sir. But i have problem when calculate cohesion in cluster just has one data point. How i should set value of cohesion? 0 or 1?

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    Thats not a good clustering process, clusters need to be reformed.

  • @arisurya9867

    @arisurya9867

    2 жыл бұрын

    Thank you sir 🙏

  • @parikshitgurjar5545
    @parikshitgurjar55453 жыл бұрын

    hello sir, Plese make a video on the explanation and implementation of ppf,pdf,cdf in python with simple explanation and understanding, I am very confused in this.

  • @UnfoldDataScience

    @UnfoldDataScience

    3 жыл бұрын

    pdf cdf video is available Parikshit. Please search in my channel.

  • @parikshitgurjar5545

    @parikshitgurjar5545

    3 жыл бұрын

    @@UnfoldDataScience Thankyou sir❤️

  • @pruthvi491
    @pruthvi4912 жыл бұрын

    How to get silhouette score for spectral clustering

  • @UnfoldDataScience

    @UnfoldDataScience

    2 жыл бұрын

    I Will check.

  • @nerdymath6
    @nerdymath6 Жыл бұрын

    is 0.4 is good silhouette score?

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    Dont think so

  • @dariomelconian9502
    @dariomelconian9502 Жыл бұрын

    Is 0.39 a bad score?

  • @UnfoldDataScience

    @UnfoldDataScience

    Жыл бұрын

    can be better

  • @abeerhamid
    @abeerhamid24 күн бұрын

    How can i read the graph of silhouette score? Most important thing you didn't tell

  • @UnfoldDataScience

    @UnfoldDataScience

    20 күн бұрын

    Positive Scores (close to 1): Samples are well-clustered, and there is good separation between clusters. Scores around 0: Samples are on or near the boundary between clusters, indicating potential overlap. Negative Scores: Samples are likely in the wrong cluster.

  • @sameeransari9129
    @sameeransari9129 Жыл бұрын

    apki english samajhne me time lg jata hai.. ki aap kya bolna chahte ho

  • @sameeransari9129
    @sameeransari9129 Жыл бұрын

    apki english me confuse ho jate hai log ki aap kya bolna chahte ho...isko negitive comment ki tarah mt lena...aap samjhate aacha ho pr agar hindi me samjhaoge toh hm logo ko clear samaj ayega

  • @sameeransari9129
    @sameeransari9129 Жыл бұрын

    Sir aap hindi me samjhaya karo toh zyada aacha samjha payoge...

  • @sandipansarkar9211
    @sandipansarkar92112 жыл бұрын

    finished watching

Келесі