End To End Machine Learning Project With Deployment | Customer Churn Analysis | Churn Prediction

This is an end to end machine learning project starting from the business understanding, data collection, data exploration, model building with deployment, explained in a very detailed way.
Code & Datasets for Practice: github.com/pik1989/MLProject-ChurnPrediction
𝗔𝗴𝗲𝗻𝗱𝗮:
0:00 Class begins!!
0:30 Introduction to the session
2:05 What is churning, How customer retention is important in various industries?
4:45 Background - Internal & External outlook of the problem statement
5:27 Different churn scenarios
7:20 Decision cycle of a subscriber
12:34 Different churn segments
13:27 High level overview of a Data Science led approach to manage churn
15:35 Suggested data to be captured to solve this problem
20:10 Future scope: Maturing the churn model
21:05 Solution Overview
21:30 Data Explanation
22:19 Exploratory Data Analysis Begins
1:42:25 Model Building - Building a predictive model based on the EDA done to identify probable churners
2:20:36 Flask explanation in short
2:22:36 Model Deployment process begins
To get a detailed understanding on model deployment & flask, please refer: kzread.info/dash/bejne/f3Wpk8iJoc_Pps4.html, for other deployment codes, please refer the below links.
Our other playlists:
𝗣𝘆𝘁𝗵𝗼𝗻 - 𝗕𝗮𝘀𝗶𝗰𝘀 𝘁𝗼 𝗔𝗱𝘃𝗮𝗻𝗰𝗲𝗱: kzread.info/head/PLymcv5WXEpKh6uK1ak5-ReqTluWdSEK2p
𝗣𝗼𝘄𝗲𝗿 𝗕𝗜: kzread.info/head/PLymcv5WXEpKhw7gb4JYU0JhXYqJmUHYN-
𝗗𝗶𝗺𝗲𝗻𝘀𝗶𝗼𝗻𝗮𝗹𝗶𝘁𝘆 𝗥𝗲𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗧𝗲𝗰𝗵𝗻𝗶𝗾𝘂𝗲𝘀: kzread.info/head/PLymcv5WXEpKjnWi7PMp2nul74TVXJ8q3E
𝗦𝗤𝗟 𝗳𝗼𝗿 𝗗𝗮𝘁𝗮 𝗦𝗰𝗶𝗲𝗻𝗰𝗲: kzread.info/head/PLymcv5WXEpKgmgthfC-Q2MmfqNBOFiAMH
𝗠𝗮𝗰𝗵𝗶𝗻𝗲 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 𝗗𝗲𝗽𝗹𝗼𝘆𝗺𝗲𝗻𝘁𝘀: kzread.info/head/PLymcv5WXEpKg14VB_uDEYjruW-VTowEjk
Any further questions?
𝗥𝗲𝗮𝗰𝗵 𝗼𝘂𝘁 𝘁𝗼 𝗺𝗲 𝗱𝗶𝗿𝗲𝗰𝘁𝗹𝘆 𝗮𝘁: www.linkedin.com/in/satyajitpattnaik
#datascience #machinelearning #customerchurnanalysis #customerretention #customerchurnanalytics #customeranalysis #telecom #telco #usecase #deeplearning #flask #eda #exploratorydataanalysis #python
Tags Related
#Customer #customer_service #machine_learning #Churn_Analysis_Prediction_Machine #ai #artificial_intelligence #machine_learning_projects #Customer_Churn_Analysis_and_Prediction #data_science #data_science_course #data_science_for_beginners #what_is_data_science #python #python_tutorial #learn_python #python_programming #Churn_Analysis_and_Prediction|_Machine_Learning #Churn_Analysis_Prediction|_Machine #Customer_Churn_Analysis_Prediction #customer_churn #CustomerChurning #TelcoRetention #TelecomRetention #ChurnAnalysis

Пікірлер: 314

  • @SatyajitPattnaik
    @SatyajitPattnaik3 жыл бұрын

    Code & Dataset for practice: github.com/pik1989/MLProject-ChurnPrediction Also watch, Churn Analysis dashboard creation using Power BI: Part 1: kzread.info/dash/bejne/dqatyM9yhsSrp5c.html Part 2: kzread.info/dash/bejne/n3WExbmYiae7e7Q.html

  • @WOW-vn5jd

    @WOW-vn5jd

    3 жыл бұрын

    can i get ppt of this ?? or can you tell how w can make such a wonderfull ppt on telecom customer churn like this ??

  • @VenuGopal-dr8ln

    @VenuGopal-dr8ln

    2 жыл бұрын

    Thanks a lot 🎉🎉

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    @@WOW-vn5jd Canva 😁

  • @jahangagan5396

    @jahangagan5396

    2 жыл бұрын

    Didn't get the insights that you find from describe() method like 75% clients have 55 months tenure and 25% clients pay some USD

  • @depeshkumarmohanty9522

    @depeshkumarmohanty9522

    2 жыл бұрын

    @@jahangagan5396 i thnk 25 is wrongly typed it should be 75

  • @PriyankaKumari-of7se
    @PriyankaKumari-of7se2 жыл бұрын

    Wow, you really have put in great effort to make such a detailed video. Thanks a ton! Keep sharing your knowledge 🙂

  • @AbhirupBhattacharya
    @AbhirupBhattacharya Жыл бұрын

    This channel is one of the best I've come across for ML, Data Analysis and Python. The level of content uploaded here is so remarkable. Thank you so much Sir. ❤

  • @ZepAnalytics

    @ZepAnalytics

    Жыл бұрын

    😊

  • @sameerpandey5561
    @sameerpandey55613 жыл бұрын

    One of the best End to End ML project explanation and implementation I have seen till now.... Thank you @Satyajit for taking out your time and preparing such a wonderful video for us...

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Thanks, that keeps me motivated.

  • @dilakshaaveesh4360
    @dilakshaaveesh4360 Жыл бұрын

    Really appreciate your effort best project explanation so far for me ! Thank you !

  • @tonynikolaos3527
    @tonynikolaos3527 Жыл бұрын

    I am at half the lecture and can not contain my gratitude: Totally awesome lecture and explanations! Please do other lectures related to ML- you have rare gift to impart understanding. Million Thanks! God Bless!

  • @abhishekpatra5020
    @abhishekpatra5020 Жыл бұрын

    Wonderful explanation. Love how you get into detail wrt business understanding and coding

  • @subramanialatentview1542
    @subramanialatentview1542 Жыл бұрын

    Really an awesome lecture looking forward for more valuable content

  • @anirbansarkar6306
    @anirbansarkar63063 жыл бұрын

    Thank you so much Satyajit for this wonderful end-to-end full tutorial. That was really helpful.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Thanks Anirban

  • @rishabsharma7114
    @rishabsharma71142 жыл бұрын

    The best end-to-end machine learning project. I have learnt a lot from this video. Thank you, sir.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Thanks Rishab

  • @mugen05
    @mugen054 ай бұрын

    Very great project and explanation! Thank you! ❤

  • @chr1112
    @chr11123 жыл бұрын

    thank you for the tutorial you are really amazing and really have a gift to teach. Keep the good work up

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Thanks for your kind words 👍

  • @sachin-ll1by
    @sachin-ll1byАй бұрын

    it felt so real life...like i am a da in an company and my manager is giving us walkthrough of the project...kudos

  • @surajghogare8931
    @surajghogare89312 жыл бұрын

    Highest way of teaching.. U covered each and every minute details in just one project.... 🙏thanks

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Thanks a lot..

  • @AIMLOdeysey
    @AIMLOdeysey3 жыл бұрын

    Thank you Satyajit. The sessions was very informative and helpful!!

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Thanks Sainath

  • @kirankumar-ir5ir
    @kirankumar-ir5ir9 ай бұрын

    Big fan for ur efforts and explanation. I feel motivated from u

  • @bilalahmad3730
    @bilalahmad37302 жыл бұрын

    What a Great project, thanks Sir

  • @raghavverma1094
    @raghavverma10942 жыл бұрын

    what an amazing video..thanks for it

  • @priyankahello8849
    @priyankahello8849 Жыл бұрын

    Awesome video!

  • @snehalpophale6287
    @snehalpophale6287 Жыл бұрын

    Very very helpful!!

  • @anushkasaraswat1813
    @anushkasaraswat18136 ай бұрын

    thank you so much for this video

  • @iamdare
    @iamdare2 жыл бұрын

    Thank you for this, I really learnt a lot

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Great to hear this 😁

  • @erukullasrikanth15
    @erukullasrikanth153 жыл бұрын

    Amazing work..clear explanation of EDA and explaining insights by analysing...Keep doing more end to end ML projects for us in different domains.. thank you

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Thanks Srikanth.

  • @depeshkumarmohanty9522

    @depeshkumarmohanty9522

    2 жыл бұрын

    @@SatyajitPattnaik thanks sir and please do one more end to end ml project like this in a different domain.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    @@depeshkumarmohanty9522 Already done, pls search "end to end satyajit" you will get more projects

  • @muskanpiplani87
    @muskanpiplani87 Жыл бұрын

    Hi, i would want to know the procedure of what tweak in code for next 3 month churn prediction you were talking about

  • @ashwinin633
    @ashwinin6332 жыл бұрын

    Hi actually I am having doubt how to handle the imbalanced data set like having only two col x and y and y is having both Pos and negative values in it how to approach with it I tried transforming x to log x and many more but didn't worked

  • @mohitpandey5190
    @mohitpandey51902 жыл бұрын

    Hi Satyajit, I believe performing resampling on entire data might be responsible for data leakage since some of the resampled examples will be the part of both training and testing datasets now contributing to a high F1 score.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Yes, that's a mistake I did here, you can do resampling on training data instead of the full data, I already acknowledged it in some other comment 👍

  • @mohitpandey5190

    @mohitpandey5190

    2 жыл бұрын

    Noted @@SatyajitPattnaik, Make sense. Amazing work on churn modeling 🙌🏼

  • @nitishsantpur2412
    @nitishsantpur24122 жыл бұрын

    Hi how can we compare two float numbers in python with greater than operation I am getting error saying >= is not a function in numpy?

  • @darshankokal4670
    @darshankokal4670 Жыл бұрын

    Awesome content

  • @zephyrindia5173
    @zephyrindia51733 жыл бұрын

    Lovely project..

  • @rajeshwarraotangellapalli3791
    @rajeshwarraotangellapalli379115 күн бұрын

    Many thanks

  • @a2sirmotivationdoses782
    @a2sirmotivationdoses7823 жыл бұрын

    Thanks for the amazing End To End Explanation...... Sir can you please make a video playlist on Speech To Text Machine learning Model building from Scratch?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Sure, will do 👍

  • @mrgroom8108
    @mrgroom810810 ай бұрын

    Hello sir, Can I know the ML model of which you are using like linear regression, logistic regression, random forest

  • @srijanakhatri7854
    @srijanakhatri7854 Жыл бұрын

    Hello, I wanted to thank you. I got the job as data analyst. In the technical interview, I was asked to do any dummy projects and walk through the process .I used this data and topic of churn analysis and did similar project like yours with the help of your code. I used Power for data visualization and used python for data modeling for the same data and was able to impress the interviewers . I also had the theoretical knowledge on what I was doing but I am not good at technical stuff because all I learnt in the school was to calculate eigen-value, PCA, manually, definition of p-value and regression analysis😂😂. I can understand what the code is doing but if I have to code,I will spent all day on youtube. Its my second day on job but I don't have proper understanding of relational database. They are using Powerbi and SSMS together for the daily reports and all. I need to sustain on the job😂😁 Please advise me with the link of videos if you have made on SQL,powerbi, and python on how they are working together.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    Жыл бұрын

    Check my playlists, there are videos on power bi and sql too (end to end)

  • @anant_gpt

    @anant_gpt

    11 ай бұрын

    @srijanakhatri7854 can you please guide me how to go on as i am someone who is starting from scratch

  • @bikramaditya1855
    @bikramaditya18552 жыл бұрын

    Asking if u. Have any playlist for the very first things of excel, Powerpoint or ms word. If u have, than pls provide me with the link.

  • @lakshmikanth1988
    @lakshmikanth1988Ай бұрын

    Hi how tenure_group hot coded into dummies? and how it generated 0's and 1's

  • @user-rp5ce5od4x
    @user-rp5ce5od4x Жыл бұрын

    I run project and i gave input to the fields,it does not showing any output

  • @iramkhan4014
    @iramkhan40142 жыл бұрын

    This is a great project and I got to learn a lot from it. I wanted to ask will this be a good project for my final year dissertation?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Yes surely

  • @akhileshkushwaha6596
    @akhileshkushwaha65962 жыл бұрын

    smoothen function is showing error

  • @NiteshMaurya-vl1kz
    @NiteshMaurya-vl1kz2 ай бұрын

    thank you sir

  • @beyou7893
    @beyou78932 жыл бұрын

    thank you so so much sir............

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Welcome 🔥🔥

  • @riyatiwari4767
    @riyatiwari47672 жыл бұрын

    Finding telecom domain projects on youtube are very difficult. Thank you for sharing..Please share more end to end projects focused on telecom domain..

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Definitely

  • @riyatiwari4767

    @riyatiwari4767

    2 жыл бұрын

    Please bring projects which can be showcased on CV. Like..customer segmentation, customer retention, customer call anlysis.It would be very helpful. Thanks for ur work!

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    @@riyatiwari4767 noted, pls wait for 2 weeks, you would see new Telecom projects 🔥🔥

  • @lambertagunbiade2114
    @lambertagunbiade21142 жыл бұрын

    thank you sir.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    welcome sir

  • @suyogshah1108
    @suyogshah1108 Жыл бұрын

    which algorithm is used??

  • @abdulhamidpatel230
    @abdulhamidpatel2304 ай бұрын

    Hey there love you teaching skills... I have a question that the EDA steps used over here can we use it as the base for different DS projects or there is something different

  • @SatyajitPattnaik

    @SatyajitPattnaik

    4 ай бұрын

    Same 😀

  • @varunmalhotra666
    @varunmalhotra666 Жыл бұрын

    I am getting error while implementing SMOTENN....when I am running it is giving me error 'nonetype' object has no attribute 'split'. Kindly help me to solve this error I am stucked

  • @AshishSingh-po5sh
    @AshishSingh-po5sh27 күн бұрын

    Project explanation is great, just one problem, when u applied smoteenn before splitting train and test data, u r getting 92% accuracy through wrong approach, on correcting this , accuracy comes down below 80%, if u could fix this issue.

  • @saimanohar4830
    @saimanohar48303 жыл бұрын

    Hi Satyajit. Nice End-to-End project. Could you let us know when we need to do Hyper-parameter tuning, and why is it required.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Once you build your prototype model, and want to take a further step in enhancing the model, you might have to do hyperparameters Optimization....

  • @sarfrazjaved330

    @sarfrazjaved330

    3 жыл бұрын

    Hyper parameter tuning is required to reduce the variance. With introducing small amount of bias we are able to reduce variance to a large extent.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    @@sarfrazjaved330 Spot on 👍👍

  • @knightbliss1
    @knightbliss13 жыл бұрын

    Hi Satyajit. .In addition to predicting whether a customer churned out or not, we can also predict when the customer is going to churn out for e.g. 6 months /9 months etc. How can we accomplish that?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    There's a column called as tenure, so if you are testing the model on customer X and he's in the system for 24 months, just pass tenure as 30 for that customer and u will get the results for that customer if he's going to be churned in next 6 months or not.

  • @230489shraddha
    @230489shraddha2 жыл бұрын

    Thanks a lot for this session Satyajit .... I have a doubt that how data scientists typically finalise a machine learning model for a given problem since there are many machine learning models available?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    The model that predicts better on test data is fine tuned and pushed into production to get tested on live data, further the results are evaluated by sales and various teams, they give feedbacks whether the model is actually doing good or not, and then further remodelling is done, it's a long process

  • @x_x3557
    @x_x35572 жыл бұрын

    Sir, can you please tell what is the use of concatenating df1 with the input queries ?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    To test the model on a new data, I am just capturing the input values in a new dataframe and then calling model predict on that dataframe

  • @larissaonyeakagbu3631
    @larissaonyeakagbu36313 ай бұрын

    Hi! How do I deploy on heroku?

  • @pardhuparvataneni1754
    @pardhuparvataneni17543 жыл бұрын

    why is cross validation not performed while building the model. Is there any criteria that you have considered to not use CV.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    A lot of things can be done in this model, this is not the most optimized model, like you said CV can be performed, other feature engineering steps plus hyperparameters Optimization can also be done.

  • @shrinathlanjudkar7490
    @shrinathlanjudkar74903 жыл бұрын

    How can we apply Artifical Intelligence , eager and lazy learning in it, finding common patterns in it

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    You can read more about eager and lazy learning, the algo used here in this use case is probably DecisionTree if I recall correctly, and DT is basically a eager learner. So how the learning happens is something that is internal to the algorithm, to know more about it, you have to dig in to the model's base codes, here we are just implementing the algorithm and performing our predictions. I hope you got my point, for example, if u want to know why Decision Trees are called eager Learners, you will have to check how a decision tree is implemented and what's the code within the algos. A link which can help you: towardsdatascience.com/machine-learning-classifiers-a5cc4e1b0623 In case you still have the doubts, we can probably connect over LinkedIn.

  • @ankanaboral3891
    @ankanaboral389110 ай бұрын

    Very detailed explanation. Thanks a lot for the efforts in creating this. I have a query regarding selection of training dataset for a subscription based model. If I select historical data with churners in the last 3months, for example, would I need to restrict the subscription start date in the training dataset for creating the base data?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    10 ай бұрын

    You can create tenure as a derived metrics based on expiration date - subscription date

  • @ankanaboral3891

    @ankanaboral3891

    10 ай бұрын

    @@SatyajitPattnaik Thanks for replying, but my question was should I filter out the data by subscription start date too like only taking subscriptions for a particular time period? I am taking churned customers for last 3months.

  • @chamangupta4624
    @chamangupta46243 жыл бұрын

    Very thank u

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Welcome Chaman

  • @shashankpandey1966
    @shashankpandey1966 Жыл бұрын

    Could you please explain --telco_data.TotalCharges--...AS we dont know which columns contains the null data , then how should we direct write it ?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    Жыл бұрын

    Can you point the time where you have this doubt

  • @subramanialatentview1542
    @subramanialatentview1542 Жыл бұрын

    Can you pls post the ppt in the github it will be very helpful pls

  • @shrutimadan4451
    @shrutimadan44513 жыл бұрын

    Hi ,How to create the churn data in retail industry.. ?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    You have to read some papers about it.

  • @debdutesarkar6370
    @debdutesarkar63702 жыл бұрын

    Sir As i am from Pharmaceutical sales background,is it possible make a video on end to end Business Analytics project on pharmaceutical company used cases like ur very nice Telecom project,it would be very helpful for us.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    there are some healthcare case studies on my channel

  • @nitishsantpur2412
    @nitishsantpur24122 жыл бұрын

    Very good explanantion but I have a doubt as in if we choose decision tree in this case is there a way to find out the root node? any code available to do so?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    You can plot a tree, sklearn.tree.plot_tree

  • @nitishsantpur2412

    @nitishsantpur2412

    2 жыл бұрын

    @@SatyajitPattnaik Thank you very much but I got it very small nodes I used fig, axes = plt.subplots(nrows = 1,ncols = 1,figsize = (5,5), dpi=300) tree.plot_tree(model_dt_smote,filled=True) and I am getting nodes that are not clearly visible any way to increase size of nodes and make it look clear? That would be great help!!

  • @alloydasufferer3803
    @alloydasufferer3803 Жыл бұрын

    1:53:00 where is it

  • @digitalnomad2196
    @digitalnomad2196 Жыл бұрын

    anyone get else get a blank page when they press submit their values on the flask part. The results don't print for me

  • @rakshitaagrawal9546

    @rakshitaagrawal9546

    Жыл бұрын

    Yes

  • @kusumsolanki387
    @kusumsolanki387 Жыл бұрын

    can i get the ppt

  • @shrinathlanjudkar7490
    @shrinathlanjudkar74903 жыл бұрын

    Upsampling + ENN what does ENN stands for ?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Read this: glemaitre.github.io/imbalanced-learn/generated/imblearn.combine.SMOTEENN.html I will try to make a separate video explaining about SMOTEENN in depth..

  • @doitsecretely7930
    @doitsecretely79302 жыл бұрын

    ValueError: X has 52 features, but DecisionTreeClassifier is expecting 50 features as input. How to Fix this problem , please?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    The error message is quite clear, you must be passing some value that is not present in original data, hence once you are doing encoding a new feature is getting generated hence you got 2 extra features

  • @ShubhamAware18
    @ShubhamAware185 ай бұрын

    sir can you please share the ppt?

  • @akhileshkushwaha6596
    @akhileshkushwaha65962 жыл бұрын

    X_resampled, y_resampled = sm.fit_sample(x,y) become X_resampled, y_resampled = sm.fit_resample(x,y) thanks for the video

  • @pragmatic_p8
    @pragmatic_p82 жыл бұрын

    Can I know what are pre-requisites of this project?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Python and basics of ML should be enough

  • @amankumar-vz9ds
    @amankumar-vz9ds Жыл бұрын

    Sir, Can we get the PPT you have used to explain the model.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    Жыл бұрын

    I don't share ppts for free, but you can create the ppts by yourself by taking ideas from mine.

  • @user-op3jd6rq7e
    @user-op3jd6rq7e10 ай бұрын

    sir i have a problem in model deployment in spyder. can you help me("UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte") this is the error anyways thanks for great and amazing video

  • @SatyajitPattnaik

    @SatyajitPattnaik

    10 ай бұрын

    Its an encoding error, spot the error in the exact line, it must be failing while you are reading the file

  • @mutasimahmed8975
    @mutasimahmed89752 жыл бұрын

    can you give exact pre reqrusite for this projects ? so i learn and directly do this project

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Python, Stats, Machine Learning is what you need to learn.

  • @harshithayadavittaboyina7390
    @harshithayadavittaboyina73902 жыл бұрын

    @Satyajit Pattnaik Thanks a lot for helping out and providing the vedio for college project..but I'm getting an error at the last...after submitting the user values in webpage..I'm getting the error as internal server error and the same in Anaconda prompt..I'm getting the Value error : X has 51 features,but RandomForestClassifier is expecting 50 features as input...But I'm not getting the error always but mostly...for 1st input given in CSV file I'm getting the output but for 2nd input I'm getting the output...I followed the same code given in github...could u please help..

  • @harshithayadavittaboyina7390

    @harshithayadavittaboyina7390

    2 жыл бұрын

    Sorry,for 2nd input in CSV file I'm getting output,but for 1st input I'm getting Value Error and internal server error

  • @harshilshah9628

    @harshilshah9628

    2 жыл бұрын

    @@harshithayadavittaboyina7390 Hey, did you find the solution as i am facing same kind of issue..

  • @shrutimadan4451
    @shrutimadan44513 жыл бұрын

    After training the model , you have shown the accuracy on test data i.e 93% but how we can get churn/Not churn against that in test dataset, not through API but in the python only

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Yes, we can get it, just run the entire test data in a loop, and append the prediction score in a new column, for ex: df["Prediction"] = model.predict_proba(X_test) Just giving you ideas..

  • @shrutimadan4451

    @shrutimadan4451

    3 жыл бұрын

    @@SatyajitPattnaik Thanks for the clarification. I will try that . One more ques . - We have done the correlation part, so where we used that? those variables which were not affecting the target variable can be deleted before model building.. Please let me know if i m thinking in right way ? Or correct me if i m wrong. would be really helpful.

  • @shrutimadan4451

    @shrutimadan4451

    3 жыл бұрын

    Thanks for the clarification. I will try that . Two more ques . - 1.We have done the correlation part, so where we used that? those variables which were not affecting the target variable can be deleted before model building.. Please let me know if i m thinking in right way ? Or correct me if i m wrong. would be really helpful. 2.max_depth=6, min_samples_leaf=8 , how we have decided this in this e.g. ?

  • @user-mh2to4bq6y
    @user-mh2to4bq6y9 ай бұрын

    having problem while running it, could you please help me out. while running app.py error occur in browser.... Internal Server Error The server encountered an internal error and was unable to complete your request. Either the server is overloaded or there is an error in the application.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    9 ай бұрын

    Pls go through comments, many had this issue because of the input that they pass

  • @AIMLOdeysey
    @AIMLOdeysey3 жыл бұрын

    Hi Satyajit, May I ask you to share the link for python code used in today’s session!!

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    I will upload it tomorrow first half for sure, it's a busy day for me today, thanks for understanding!!

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    github.com/pik1989/MLProject-ChurnPrediction

  • @digitalnomad2196
    @digitalnomad2196 Жыл бұрын

    For deployment can you please explain what is in df_1 ? and why you concat with new_df to create df_2. I am working on a similar project with this data set.

  • @digitalnomad2196

    @digitalnomad2196

    Жыл бұрын

    When I check that df_1 'first_telc.csv'. It has 19 columns and 75 rows ? can you please explain where this comes from

  • @digitalnomad2196

    @digitalnomad2196

    Жыл бұрын

    and why you concat this with the data coming from the user ?

  • @national3737
    @national37373 жыл бұрын

    Sir how we can show the reason that he is churning

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    That's a second problem, churn or not churn is one classification problem, you can also train a multiple class classification by having classes like: not churned, churned due to service, churned due to cost etc, so it's a multi class classification 👍

  • @suyoghole3501
    @suyoghole35012 жыл бұрын

    hey man create an recommendations engine to the people not get churn recommended something so they got customers back N already have they get more interested

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Sure, will build one

  • @avinashnair5064
    @avinashnair50643 жыл бұрын

    Hello sir can you tell me the source from where did you get the Dataset from?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    It's a public dataset easily available on Kaggle and various other websites.

  • @avinashnair5064

    @avinashnair5064

    3 жыл бұрын

    @@SatyajitPattnaik thankyou sir

  • @onestopzz6446
    @onestopzz6446 Жыл бұрын

    Is it mandatory to be good enough to write code on our own, If we are looking for data engineer jobs? .........from a final year student

  • @SatyajitPattnaik

    @SatyajitPattnaik

    Жыл бұрын

    Not mandatory

  • @iramkhan4014
    @iramkhan40142 жыл бұрын

    Sir, If we use KNN, Random Forest, SVM, Decision Trees, Linear regression. Then which one would be the most accurate model?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Its very difficult to say, we should always try all the models and compare them

  • @iramkhan4014

    @iramkhan4014

    2 жыл бұрын

    @@SatyajitPattnaik Okay sir. Thank you so much for the quick reply!

  • @dhanushn1787
    @dhanushn17872 жыл бұрын

    I am getting Internal server error When we run the project

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    You need to see the error and fix it, its a very basic project and lot of error handling is required

  • @codesoupp
    @codesoupp2 жыл бұрын

    Hi sir..can i use this in data analyst role resume?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    Жыл бұрын

    Can, as Predictive Analytics is a part of Analytics :)

  • @iqballatifable
    @iqballatifable3 жыл бұрын

    thank you for the tutorial, i have a question, when im running all your code, ValueError: X has 64 features, but DecisionTreeClassifier is expecting 45 features as input. can you explain it? thank you

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Shouldn't be, can u check again, many people are using this, And it works fine. If u can't figure out, upload your code on GitHub, share me the link and tell me which line is it failing..

  • @iqballatifable

    @iqballatifable

    3 жыл бұрын

    Ok I will check it again, I hope running well like others, thank you very much

  • @iqballatifable

    @iqballatifable

    3 жыл бұрын

    @@SatyajitPattnaik Your code is awsome, it's really work finally, thank you very much

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    @@iqballatifable thanks Iqbal, do share my channel with your friends & colleagues 🔥🔥

  • @ashwathvinod516

    @ashwathvinod516

    3 жыл бұрын

    @@iqballatifable I too got same error how did you resolve it

  • @naziaakhtar8575
    @naziaakhtar85752 жыл бұрын

    how do we incorporate the 6 month or 3 month prediction into this?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Your time related parameters will play a vital role here, let say, you want to test your existing customers, and latest tenure period of that customer is 27 months, if u want to test whether he stays after 6 months or not, u need to pass the tenure as 33 for that customer with other details..

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    there are other columns that would change as well, like total charges, age etc etc..

  • @KumarHemjeet
    @KumarHemjeet3 жыл бұрын

    You have label encoded the data before splitting the data into test and train that is why you are getting such high precision and recall. There is data leakage here.

  • @hssp1534

    @hssp1534

    2 жыл бұрын

    But isn't that normal to do

  • @DilipKumar-ww5si
    @DilipKumar-ww5si3 жыл бұрын

    Hi Satyajit, Thank you for your detailed steps to implement End-to-End ML with deployment. I have few questions, can you please clarify? Please explain the usage of the following lines df_1=pd.read_csv("first_telc.csv") # how did you get this file "first_telc.csv? df_2 = pd.concat([df_1, new_df], ignore_index = True) single = model.predict(new_df__dummies.tail(1)) # as per this line, you are considering only the input which we enter from html so, df_1, df_2 steps may not be required. If I just process the input from html using pd.get_dummies and then do model.predict , this will be enough for prediction. Am I correct? Any how, Thank you once again for your contribution

  • @DilipKumar-ww5si

    @DilipKumar-ww5si

    3 жыл бұрын

    Hi Satyajit, I found the reason for using 'first_telc.csv' and the above clarifications which I raised. I have figured it out, you have to use some sample data and then add your input data because, then only all combination of categorical values are covered and it will be converted into equal number of columns as compared with Analysis file using pd.get_dummies. To match 51 columns, you are loading that csv file. However to predict, you are using only the last row (tail) which is entered by user in UI. Thanks a lot.

  • @thingsicando9662

    @thingsicando9662

    2 жыл бұрын

    Okay a

  • @bilalahmad3730

    @bilalahmad3730

    2 жыл бұрын

    @@DilipKumar-ww5si very helpfull, Thanks

  • @anepomnjashiy
    @anepomnjashiy2 жыл бұрын

    At first, thank you a lot for this video! At second )))) - can you provide a subtitles for it?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Well, it's in English already, let me know if you're facing any difficulties

  • @anepomnjashiy

    @anepomnjashiy

    2 жыл бұрын

    Yes, I'm experience some difficulties with understanding the pronunciation. You would be so kind if allow subtitles in this video. Thank you, again for this great content!

  • @datascienceworld
    @datascienceworld Жыл бұрын

    Can any of you please tell me what is the difference between monthly charges and total charges? Thanks in advance.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    Жыл бұрын

    In the telecom churn dataset, "Monthly Charges" refers to the total amount charged to the customer for their telecom services each month , while "Total Charges" refers to the total amount charged to the customer over the entire duration of their service with the telecom company .

  • @datascienceworld

    @datascienceworld

    Жыл бұрын

    @@SatyajitPattnaik Thanks a lot.

  • @ManigandlaSanjay-uo1dk
    @ManigandlaSanjay-uo1dk8 ай бұрын

    'SMOTEENN' object has no attribute 'fit_sample' and name 'X_resampled' is not defined can you explain the error and correct answer sir please

  • @ManigandlaSanjay-uo1dk

    @ManigandlaSanjay-uo1dk

    8 ай бұрын

    Sir please reply

  • @SatyajitPattnaik

    @SatyajitPattnaik

    8 ай бұрын

    its changed to "fit_resample", please use chatGPT for faster resolution :)

  • @jaideepsingh8109
    @jaideepsingh81092 жыл бұрын

    Thank you brother, You saved me from failing my assignment. You are really a Savior sent by god.

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Thanks Jaideep ☺️😍

  • @sasumsudha
    @sasumsudha3 жыл бұрын

    dear sir, what is first_telc.csv file which was in app.py , please explain

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    It's not used anywhere, pls ignore.

  • @genuinelag9936

    @genuinelag9936

    5 ай бұрын

    yes i also have the same doubt

  • @VenuGopal-dr8ln
    @VenuGopal-dr8ln3 жыл бұрын

    Hi, Every industry have predicting wheather a person churn or not churn how about making this project more realistic with prescriptive analytics by converting churn customer into non churn and if person is already a churn by providing him to good premium to convert churn customer into non churn customer..it will be helpfull to entire community..please make video on this

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Good suggestion, but the part you explained is really difficult to implement as it's need cost analysis and ROI calculation, and it needs sales team involvement too, which is really difficult to show.

  • @VenuGopal-dr8ln

    @VenuGopal-dr8ln

    2 жыл бұрын

    @@SatyajitPattnaik thanks for ur reply please can u make video on reinforcement learning on banking or insurance data it will helpfull for aspiring data scientist .hope that video soon 🎉

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    @@VenuGopal-dr8ln Reinforcement learning is a huge topic, let me know a specific topic that you want, and I can make a video on it.

  • @VenuGopal-dr8ln

    @VenuGopal-dr8ln

    2 жыл бұрын

    @@SatyajitPattnaik I have idea in customer retention in financial industry but i don't know how to implement reinforcement learning although it requires huge data ...my idea is to implement prescriptive analytics in churn prediction ..it needs normal any ensemble algorithm to predict churn or non churn customers then taking churn customer data we implement some prescriptive analytics by before customer going to churn we will recommend some premiums to them then using LSTM models and reinforcement learning and time series ..LSTM needs because we need to focus on customer past behaviour with the company and time series needs because it's customers travel how loyal they are with company and finally reinforcement learning needed because we need to train our model cantinously with the data then model learn it self for future data

  • @gauravmore8578
    @gauravmore85782 жыл бұрын

    Hey, I'm getting this error What am I doing wrong? ValueError: X has 65 features, but RandomForestClassifier is expecting 50 features as input.

  • @ittaboyinaharshithayadav7759

    @ittaboyinaharshithayadav7759

    2 жыл бұрын

    Yeah me too...could u please help..In Webpage i'm getting the error as internal server error and in anaconda prompt Value error as: X has 51 features,but RandomForestClassifier is expecting 50 features as input...

  • @ittaboyinaharshithayadav7759

    @ittaboyinaharshithayadav7759

    2 жыл бұрын

    I'm not getting the error always but 3 in 5 times..for the 2nd input i'm getting the output but for 1st i'm getting the error

  • @gauravmore8578

    @gauravmore8578

    2 жыл бұрын

    @@ittaboyinaharshithayadav7759 Did you manage to get output ?

  • @harshilshah9628

    @harshilshah9628

    2 жыл бұрын

    @@gauravmore8578 were you able ro solve it..?

  • @ittaboyinaharshithayadav7759

    @ittaboyinaharshithayadav7759

    2 жыл бұрын

    @@gauravmore8578 No..I'm not getting the output everytime...for most of the cases I'm getting Value error: X has 51 features, but randomforestclassifier expecting 50 has input...

  • @abhishek7718
    @abhishek77183 жыл бұрын

    sir,there are many objects columns , why are not going to convert in numeric

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    We should convert them

  • @abhishek7718

    @abhishek7718

    3 жыл бұрын

    @@SatyajitPattnaik when we will convert

  • @akshayrajput2043
    @akshayrajput204311 ай бұрын

    hello Sir SAP Fico + PCAD (Certified Associate in Data Analytics with Python) will be beneficial ?

  • @risasalimp587

    @risasalimp587

    11 ай бұрын

    Pls help do u get what is the findings and result

  • @akshayrajput2043

    @akshayrajput2043

    11 ай бұрын

    @@risasalimp587 yes it will

  • @srujankumar1838
    @srujankumar18383 жыл бұрын

    Hi Satyajit, thank you well-done project. I had a data science interview this week can I take this end-to-end project to present in front of the interviewer?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    3 жыл бұрын

    Ofcourse you can..

  • @susantakumarsahoo1982

    @susantakumarsahoo1982

    2 жыл бұрын

    @@SatyajitPattnaik are you from odisha??

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    @@susantakumarsahoo1982 yes

  • @chaitu037

    @chaitu037

    2 жыл бұрын

    Even am planning for the same. Thank you for the indepth explanation.

  • @user-bj7mk1oz5l
    @user-bj7mk1oz5l11 ай бұрын

    how to get the customer chrun dataset

  • @SatyajitPattnaik

    @SatyajitPattnaik

    11 ай бұрын

    Checj video description

  • @madhavisantpur5967
    @madhavisantpur59672 жыл бұрын

    Hi I am using a dataset which is highly imbalanced and 99% no and 1 % yes(interested in yes) I am getting very low precision recall and f1 score I tried printing the thresholds but they are very low what can i do to increase recall and f1 score?

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Did you apply upsampling or Smote techniques?

  • @madhavisantpur5967

    @madhavisantpur5967

    2 жыл бұрын

    @@SatyajitPattnaik I am using automl and yes I tried class weighting but scores dont improve much although I am getting good scores when threshold is 0.03 but that is not desirable right

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    @@madhavisantpur5967 Need to check the code to comment further.

  • @anketsonawane6651
    @anketsonawane66512 жыл бұрын

    Someone please help me. I have a confusion between customer segmentation and customer churn

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    Customer segmentation is an unsupervised technique, where you need to create customer clusters based on their behaviour, when you have a variable where you know who is churning, and who's not, you can have a classification model to predict whether a new customer will churn or not.

  • @anketsonawane6651

    @anketsonawane6651

    2 жыл бұрын

    @@SatyajitPattnaik thanks :)

  • @SatyajitPattnaik

    @SatyajitPattnaik

    2 жыл бұрын

    @@anketsonawane6651 I can make a video on customer segmentation if you promise me 100 subs 😂