Beginner Kaggle Data Science Project Walk-Through (Titanic)

In this video I walk through an entire Kaggle data science project. I use the titanic kaggle competition to show you how I start thinking about the problems. I also show you the systematic approach that I use to explore the data, build the models, and submit the solution.
Kaggle notebook: www.kaggle.com/kenjee/titanic...
My Kaggle Profile: www.kaggle.com/kenjee
Feel free to follow along with the code! You don't need to understand everything that is going on under the hood of the algorithms, for a beginner, learning to implement them should be enough.
This video covers
- Project Planning
- Data exploration
- Data Visualization (light)
- Replacing null values
- Feature engineering
- Data Cleaning
- Model Production
- Model Tuning
- Kaggle model submission
#DataScience #KenJee
⭕ Subscribe: kzread.info?sub...
🎙 Listen to My Podcast: / kensnearestneighborspo...
🕸 Check out My Website - kennethjee.com/
✍️Sign up for My Newsletter - www.kennethjee.com/newsletter
📚 Books and Products I use - www.amazon.com/shop/kenjee (affiliate link)
Partners & Affiliates
🌟 365 Data Science - Courses ( 57% Annual Discount): 365datascience.pxf.io/P0jbBY
🌟 Interview Query - www.interviewquery.com/?ref=k...
MORE DATA SCIENCE CONTENT HERE:
🐤My Twitter - / kenjee_ds
👔 LinkedIn - / kenjee
📈 Kaggle - www.kaggle.com/kenjee
📑 Medium Articles - / kenneth.b.jee
💻 Github - github.com/PlayingNumbers
🏀 My Sports Blog -www.playingnumbers.com
Check These Videos Out Next!
My Leaderboard Project: • I Built the FIRST EVER...
66 Days of Data: • What is the #66DaysOfD...
How I Would Learn Data Science in 2021: • How I Would Learn Data...
My Playlists
Data Science Beginners: • Data Science Beginners
Project From Scratch: • Data Science Project f...
Kaggle Projects: • Kaggle Projects

Пікірлер: 439

  • @KenJee_ds
    @KenJee_ds3 жыл бұрын

    Thanks for watching! Feel free to upvote the kaggle notebook if you found it helpful! Kaggle notebook: www.kaggle.com/kenjee/titanic-project-example My Kaggle Profile: www.kaggle.com/kenjee Try watching my kaggle project from scratch series next! kzread.info/dash/bejne/pGF4tJuBcsTPoLg.html&ab_channel=KenJee

  • @olabisioremade4784

    @olabisioremade4784

    3 жыл бұрын

    Hi Ken, please long does the whole data science course 365datascience take?

  • @vedantbhardwaj4058
    @vedantbhardwaj40583 жыл бұрын

    I gotta be honest here, started learning Data Science on my own but every now and then I become lazy AF and I just stop for a period of 2-3 weeks. It's difficult to be consistently committed to the program and learning. Although I hope I slowly complete the training.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    It happens to me as well! I use youtube to have people hold me accountable to continued learning. Maybe find a friend or someone to keep you on top of your learning journey!

  • @omotayoonike9825

    @omotayoonike9825

    Жыл бұрын

    Pls don't bring this bad energy here everybody who is a data scientist feels the same even me myself don't want to do it and God know it difficult but if you stick around the barber shop for long you will get your hair cut if you like become somebody through data science or another means but all is difficult.

  • @davidologunoba4703

    @davidologunoba4703

    Жыл бұрын

    Same sort of situation with me. But you know what, let's keep moving, we can do it!.

  • @struanclark5971
    @struanclark59713 жыл бұрын

    You’re content is always top class Ken! As a beginner in this field you’ve taught me so much through your videos. Please keep them coming

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for the kind words Struan! I will definitely keep them coming!

  • @larryhatcher8927
    @larryhatcher8927Ай бұрын

    I took several days and went through this. It was a good starting point. You simply can't learn it all in a few days. As Ken said, this is to be used as a framework. Learning the various models and the revaluations are extremely important

  • @paigec5017
    @paigec50173 жыл бұрын

    This video came at such good timing! I just taught myself python and started the titanic project today but was feeling so unsure about everything! Thank you for your videos!!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Great stuff! Thank you for watching them!

  • @emmanuelagyemang3738

    @emmanuelagyemang3738

    Жыл бұрын

    How did you teach yourself how to code?

  • @sarthaksharma070
    @sarthaksharma0703 жыл бұрын

    Great video dude, exactly what i was looking for, its really great to see creators actually listening to the audience and working on it. Keep it up pal

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Glad it was what you were looking for!! Thanks for watching!

  • @hornfan722
    @hornfan7223 жыл бұрын

    Thanks ken- never used Kaggle or even done any data science projects. The detailed analysis (including the nuances MOST IMPORTANTLY) is really making this digestible- not to mention applicable

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Will do my best to include even more nuances going forward!

  • @adamploof3528
    @adamploof35283 жыл бұрын

    Looking forward to more videos like this . It's incredibly helpful to get an experienced viewpoint on how to think about and dissect these sorts of problems.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Glad you found it helpful! Thanks for watching Adam!

  • @mehsaniphysicsmathmatics2147
    @mehsaniphysicsmathmatics21472 жыл бұрын

    Thank you Ken, just now I finished one course that focused on Titanic survival, your attitude makes more sense for me.

  • @KenJee_ds

    @KenJee_ds

    2 жыл бұрын

    Awesome!! Thank you for following along!

  • @sandrafield9813
    @sandrafield98133 жыл бұрын

    Thank you so much for your videos, I watch them all the time. I'm in a masters DS program, and I feel like I'm actually on the titanic right now, going down down down. Here you are handing me a raft, a dingy, and also giving me a map to a huge lush closeby island where there's an escape airport.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching them Sandra!! I also love the analogy haha. Hopefully, one day I will provide you with a more stable yacht so you can enjoy the data science journey in style!

  • @nikhilatluri1569
    @nikhilatluri15693 жыл бұрын

    Thank Ken Jee For spending your time during this lockdown for educating youngsters like us

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Glad I could help!

  • @alexmyers3716
    @alexmyers3716 Жыл бұрын

    I'm here because of GPT4. Before GPT4 was released, I had a decent basic understaning of data science applications, but did not have the time to learn all of the Python syntax. Now, with GPT4, all I have to do it understand how to explain what i want to do, and GPT4 takes care of all the coding. It wouldn't be hard to create this entire notebook in 2-3 hours of time. Wild times we live in!

  • @MaiNguyen-nl3pp
    @MaiNguyen-nl3pp3 жыл бұрын

    You have saved us hours of self-exploration! Thank you, Ken :D Hope you can make more videos like this!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    You should still definitely self explore as well! Thank you for watching, more to come!

  • @solaawodiya7360
    @solaawodiya73603 жыл бұрын

    Hi Ken, thanks for the help on learning about data science. I struggle a lot using Kaggle to learn python. The user experience for me is quite intimidating compared to other platforms I used as there are times even when I know the question, I get lost on how to answer and follow the steps.

  • @moghegaurav
    @moghegaurav3 жыл бұрын

    Love your videos, Ken. They are no-nonsense and stick to just DS. Your content is well made up and your voice is clear. Thanks for sharing your knowledge. I am sure with such quality content you will soon hit 100k subscribers and more.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for the kind words and for watching my videos!

  • @aimenbaig6201
    @aimenbaig62013 жыл бұрын

    You are my absolute Guide to data science. THANKYOU KEN

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for the kind words Aimen!! Glad the videos have been helpful!

  • @lucrieffel5018
    @lucrieffel50183 жыл бұрын

    This video was extremely helpful, I have been searching the internet for a video that would walk me through this exact project! Your videos are the best, keep up the good work!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Excellent! Glad it was what you were looking for!

  • @chinmaygondhalekar2591
    @chinmaygondhalekar25913 жыл бұрын

    Just the notification I was waiting for thanks man 👍

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I hope you enjoy!

  • @justinhuang8034
    @justinhuang80343 жыл бұрын

    Love your content man! Keep it up 100k subs is just around the corner!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks Justin! Glad to hear the content has been useful to you!

  • @ahmedhassan9379
    @ahmedhassan93792 жыл бұрын

    Thanks so much, i feel happy that i could undersrand 90% of the content months ago i didnt knew a thing!

  • @KenJee_ds

    @KenJee_ds

    Жыл бұрын

    Amazing!!

  • @JBB685
    @JBB6853 жыл бұрын

    Would you consider doing one for the linear regression example you suggested on your 3 beginners’ projects? It’s the Aimes housing prices project

  • @ashikka5902
    @ashikka59023 жыл бұрын

    Thank you Ken! Doing this first thing in the morning tomorrow!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I hope it helps!!

  • @wasimraja2980

    @wasimraja2980

    3 жыл бұрын

    Done ?

  • @josefftan1203
    @josefftan12033 жыл бұрын

    Aw, kaggle series here we goooo ♥️

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Enjoy!

  • @fahadreda3060
    @fahadreda30603 жыл бұрын

    Thanks Ken, I was waiting for this video , Good Luck

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I hope you enjoy it Fahad!

  • @anurekha137
    @anurekha1373 жыл бұрын

    I am glad that I came across your channel. Always wanted to try titanic dataset on kaggle but didn't. now I m gonna try it. thanks.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    That is one of my favorite things to hear! It makes me really happy that my video helped you get started!

  • @mustafamegahed7873
    @mustafamegahed78733 жыл бұрын

    Great job! Thank you so much! Sadly, I have some work at college and couldn't finish the video but I will definitely come back to it hopefully next week.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    No problem! It is there for you to learn at your own pace!

  • @henriquebonacelli2981
    @henriquebonacelli29813 жыл бұрын

    Man, great video! I'm starting on data science and this hands on project explanation was super helpfull!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Glad to hear it was helpful! Thank you for watching!

  • @s8x.
    @s8x.Ай бұрын

    thanks for this video. Just started this problem and realized I have no idea what I'm doing

  • @arick2050
    @arick20503 жыл бұрын

    Super informative, thanks Ken!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching Aric!!

  • @dakadoodle6295
    @dakadoodle62953 жыл бұрын

    Literally was looking at this today

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Awesome!

  • @alyona1311
    @alyona1311 Жыл бұрын

    I learned so much from your video, thank you!

  • @KenJee_ds

    @KenJee_ds

    Жыл бұрын

    Amazing! Thank you for watching!

  • @abdelrahmanashraf7636
    @abdelrahmanashraf76362 жыл бұрын

    Thanks a lot for this video, having learning a lot of things and didn't know how to tie all the ropes together. This video was for it. Thanks a lot Ken Jee :)

  • @KenJee_ds

    @KenJee_ds

    2 жыл бұрын

    Thanks for checking it out!

  • @jfr543
    @jfr5433 жыл бұрын

    This video is gold!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for the kind words! I'm glad you found it helpful!

  • @mohithedaoo6968
    @mohithedaoo69683 жыл бұрын

    This was much needed... Thank you very much!!l

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Happy I could help! Thank you for watching!

  • @arthurmlcc
    @arthurmlcc3 жыл бұрын

    Keep up with great the work you've been doing in this channel ken, really helping us beginners.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I absolutely will! Thanks for watching!

  • @communicationvast9949
    @communicationvast9949 Жыл бұрын

    fantastic video, my friend. I started this project in R studio, ran into some walls, and got extremely frustrated. Listening to your process is extremely helpful. Thanks for the upload.

  • @KenJee_ds

    @KenJee_ds

    Жыл бұрын

    Thanks for watching!! Really glad to hear it was helpful

  • @DatascienceConcepts
    @DatascienceConcepts3 жыл бұрын

    Nice insights Ken Jee. In fact I remember working with this dataset in my early days of ML :)

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Awesome! I definitely think this dataset is a great starting point. It was even helpful for me to go back and review some of the basics!

  • @Om-id1qr
    @Om-id1qr Жыл бұрын

    I'd like to say that I discovered a gem of a channel today.

  • @KenJee_ds

    @KenJee_ds

    Жыл бұрын

    Makes me really happy to hear!

  • @jonasschroder7244
    @jonasschroder72443 жыл бұрын

    Great! Very inspiring and helpful!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching Jonas!

  • @albertosei3558
    @albertosei35589 ай бұрын

    I will try this very soon. Bookmarking this

  • @KenJee_ds

    @KenJee_ds

    9 ай бұрын

    💪

  • @augustthenerd4213
    @augustthenerd4213 Жыл бұрын

    Thanks for the video! I have some DS experience but it was very helpful to see how an expert would approach a Kaggle problem.

  • @KenJee_ds

    @KenJee_ds

    Жыл бұрын

    Thanks for watching!

  • @DataProfessor
    @DataProfessor3 жыл бұрын

    Ken, Great video and great initiative! Sounds like fun, I also haven't done a Kaggle submission yet, will follow your path and do one soon.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Let's definitely partner on one!

  • @salikmalik7631

    @salikmalik7631

    3 жыл бұрын

    @@KenJee_ds Yes. It'll great to watch..

  • @DataProfessor

    @DataProfessor

    3 жыл бұрын

    @@KenJee_ds Yes, let's definitely do that 😃

  • @sauravsahay8803
    @sauravsahay8803 Жыл бұрын

    I keep getting tired and demotivated and I keep picking myself up to learn this :(

  • @kefahelhelou9418
    @kefahelhelou9418 Жыл бұрын

    Thanks for the great efforts

  • @KenJee_ds

    @KenJee_ds

    Жыл бұрын

    Thanks for watching!

  • @fablab21
    @fablab213 жыл бұрын

    Since you made a confession at the beginning, Imma hit you with one myself: I'm been trying to study DS consistently for a year and half and bruh... I find it incredibly frustrating. I do not feel particularly smart enough to do projects on my own but I really like your content, so I will stick around. 😬

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Confessions are important! I am confident you can do it. I think you would actually be quite surprised at the progress you've made. I've come a long way myself, and even now I still have impostor syndrome or feel like I don't know as much as I should. I would watch my "the data scientist's secret video", I think it may give you a little boost! kzread.info/dash/bejne/int2y8mjhtyYddI.html

  • @ramonsantiago4573

    @ramonsantiago4573

    3 жыл бұрын

    IMO its unlikely that you're not smart enough to learn this stuff, its probably the way you go about learning it. You need to spend a lot of time on the basics and have a really good understanding of python. Its hard... i personally kept trying to jump ahead and go through concepts as fast as possible but it didn't really work. However, now that I've been studying at a slower pace everything is starting to make sense, and i managed to complete a few ML projects completely by myself. A really good slow paced course that teaches the majority of the basics was "Python for Data Science and Machine Learning Bootcamp" by Jose Portilla. Good luck!

  • @moajjem04
    @moajjem043 жыл бұрын

    This video is a great help!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Glad to hear! Thank you for watching!

  • @imakonkonvicted
    @imakonkonvicted3 жыл бұрын

    Thanks! I will try to do this alongside your video! :D

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Awesome! Would love to hear how it goes!

  • @hendrywijaya1017
    @hendrywijaya10172 жыл бұрын

    Ken, I Think about the project planning which on Histogram and Boxplot should be place after missing data, So Here's the plan order from the top - understand the Type of data - value counts - missing data - histogram and boxplot Then continue by following step you make from - correlarion analysis - exploring interesting fact Until scaling

  • @omjeeshukla5758
    @omjeeshukla57583 жыл бұрын

    I don't understand who are these people to dislike. If you can't support him stop disliking him at least someone is putting in efforts to make knowledge and learning process easy what is the problem of you dislikers.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for looking out for me omjee! All work has its detractors though. I am always looking to improve, so constructive feedback is welcomed!

  • @zahinnazhan7200
    @zahinnazhan72003 жыл бұрын

    This is great walkthrough for beginner like me. Thanks Ken Jee

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Glad it was helpful Zahin!

  • @kbillotta
    @kbillotta3 жыл бұрын

    Thanks Ken... I just got my physics degree and i want to become a data scientist..Your videos are helping a lot! Thanks

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    That's what I like to hear! Thanks for watching!

  • @surajkumarmaurya8088
    @surajkumarmaurya80883 жыл бұрын

    Thanks a lot Sir, this help me a lot to clear my doubts.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Great!! Thanks for watching!

  • @sadiakamal6866
    @sadiakamal68663 жыл бұрын

    Great job..Please do these sort of videos more often!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thank you for watching! Will definitely be trying to make more of these!

  • @denizbalkaya8356
    @denizbalkaya83563 жыл бұрын

    Hi Ken....Deniz is speaking from Turkey! Your videos are helping me a lot! You force me to keep up :)

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Glad to hear they are helping! Thank you for watching!

  • @dxzgamingtricks5938
    @dxzgamingtricks5938 Жыл бұрын

    you are a genius!!!!

  • @KenJee_ds

    @KenJee_ds

    Жыл бұрын

    Thanks for watching!

  • @amrelshabasy1183
    @amrelshabasy1183 Жыл бұрын

    Thanks, Ken for this great video. Can you please explain, how did you measure that the Model XGboost is overfitting?

  • @manasagrawal8365
    @manasagrawal83652 жыл бұрын

    thanks Ken this was really helpful

  • @KenJee_ds

    @KenJee_ds

    2 жыл бұрын

    Thanks for watching!

  • @AdityaKumar-cj2ms
    @AdityaKumar-cj2ms3 жыл бұрын

    It was a very insightful explanation of this project, really liked it. And, at cell [5] if you execute training.describe(include = "all"), it will also give you the values which appear the most for every categorical variable. Which I think can be really helpful.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I actually didn't know that! Thank you for sharing!

  • @bencantc2548
    @bencantc25483 жыл бұрын

    Amazing video! I hope you do a similar video on regression and clustering problems in the future!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching! I plan to do a regression problem next!

  • @dhristovaddx
    @dhristovaddx3 жыл бұрын

    Thank you for the great video! It's very helpful! ^_^

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching! Glad it was helpful!

  • @mimikoko4299
    @mimikoko42993 жыл бұрын

    U have a best data science chanel, I love u

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thank you Mimi!

  • @Mario-ox5dm
    @Mario-ox5dm3 жыл бұрын

    I sense a rising Kaggle Grandmaster in the future!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Haha I don't know about that! Long road ahead

  • @2ash94
    @2ash948 ай бұрын

    Wow this is a gold mine! Can't believe you went through all that work! Looking through all this, it seems like to become a great data scientist, it's not just about the skill. It is about intelligence and your ability to understand and see things that aren't clear to the normal human being. I have a fairly normal IQ and i am currently wondering if i should continue building my skills in order to become a data scientist.

  • @KenJee_ds

    @KenJee_ds

    8 ай бұрын

    I don't think you have to have a high IQ. You can learn to ask the right questions and create frameworks for yourself. I could not have done the analysis in the same way when I started. I am certain you can learn to approach the problem in the same way I did!

  • @samuelwondim5906
    @samuelwondim59063 жыл бұрын

    This is just great

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching Samuel!

  • @hugochung9909
    @hugochung99093 жыл бұрын

    I've been following your videos for a while now and making my way through all the microcourses on Kaggle. This is the exact video I was looking for to begin the next stage of learning by diving into some data science projects . Top content and keep up the great work Ken!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for the kind words! This is exactly what I like to hear haha. Glad you found it helpful!

  • @MarsLanding91
    @MarsLanding913 жыл бұрын

    Thanks Ken!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching!

  • @hehermosilla13
    @hehermosilla133 ай бұрын

    very good!

  • @anoopashware9539
    @anoopashware95392 жыл бұрын

    thank you sir to make this video I can't explain it in words. how much information in this video. which is really helpful for me to become a good data scientist. thank you so much

  • @KenJee_ds

    @KenJee_ds

    2 жыл бұрын

    Really glad to hear this video helped!

  • @nailujretuas2093
    @nailujretuas20932 жыл бұрын

    very helpful, thank you. comment for the algorithm.

  • @KenJee_ds

    @KenJee_ds

    2 жыл бұрын

    Thank you!

  • @muhammadtalmeez3276
    @muhammadtalmeez32763 жыл бұрын

    thanks for this video

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching!

  • @RichardOnData
    @RichardOnData3 жыл бұрын

    Loving this video and the thumbnail dude!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for noticing the thumbnail Richard! Would love to colab at some point if you're interested!

  • @RichardOnData

    @RichardOnData

    3 жыл бұрын

    @@KenJee_ds Absolutely! My email is richardondata@gmail.com - I have a number of items on my backlog of videos that I'd love to cover in the future as I'm sure you do too, and some of them I think would make total sense! I'll drop you an email in a day or two myself.

  • @tomasagustin2243
    @tomasagustin22433 жыл бұрын

    Amazing!, i learn a lot with your videos, thanks for sharing your knowledge, hug from Argentina!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thank you for watching! I would love to visit Argentina some day!

  • @tomasagustin2243

    @tomasagustin2243

    3 жыл бұрын

    Hope you comee there are a lot of beatiful people here and a lot of party hahahaha

  • @AIPlayerrrr
    @AIPlayerrrr3 жыл бұрын

    I’d be super interested in seeing you competing in a real Kaggle Competition.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I will likely be trying one in a few months! Stay tuned!

  • @AIPlayerrrr

    @AIPlayerrrr

    3 жыл бұрын

    Ken Jee great! I am excited

  • @Gamma3

    @Gamma3

    3 жыл бұрын

    Me too! Great channel

  • @MrBlack-cv8qn
    @MrBlack-cv8qn2 жыл бұрын

    Huge thanks from beginner DS switching from mechanical engineering!

  • @KenJee_ds

    @KenJee_ds

    2 жыл бұрын

    Thanks for watching! Glad to hear it was helpful!

  • @lemonandliam
    @lemonandliam3 жыл бұрын

    Thanks Ken. I loved the video! Do you have any videos that deal in more detail with the correlation heat map output in line 9? I would love to know more about what I can learn from this and when to use it.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I would look into my data science fundamentals playlist, I think I have a video where I go more in depth there!

  • @alexanderlindsey4066
    @alexanderlindsey40663 жыл бұрын

    Hi Ken, great video. Thank you! Please consider making a similar video with panel data!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching Alexander! Can you expand on what you mean by panel data further?

  • @alexanderlindsey4066

    @alexanderlindsey4066

    3 жыл бұрын

    @@KenJee_ds Time Series! Something like the M5 Forecasting challenge on Kaggle, or predicting house prices, predicting blood sugar metabolism (see www.diabits.com), other ideas.

  • @EngrDS
    @EngrDS2 жыл бұрын

    Hello, beginner data scientist. Great video! I'm trying this out and my pivot table outputs don't have scrollbars. What could be the reason? I would want to view all the columns via the pivot table. I'm using Kaggle. Thanks.

  • @alexfilo7929
    @alexfilo7929 Жыл бұрын

    Very helpful

  • @KenJee_ds

    @KenJee_ds

    Жыл бұрын

    Really glad to hear! Thanks for watching!

  • @abdallahsiyabi4784
    @abdallahsiyabi47842 жыл бұрын

    Hi Ken, any reason for calling fit() method twice on rf? I mean best_estimator is used already. It is not the case with xgb..

  • @shakilarosli8757
    @shakilarosli87572 жыл бұрын

    Hi Ken Jee, thanks a lot for the video! Can you please share on Feature Engineering on Cabin? What does the number 0-4 indicates? I don't get that part. Thank you!!

  • @elsins9790
    @elsins97903 жыл бұрын

    Thank you for this video and your explanation how you approach data science problems. I was just able to reach the baseline from the Titanic tutorial by my own approach with XGB and GridsearchCV. Did you tried stacked denoising autoencoder in your projects and how did it work out? It is kinda like an automatic unsupervised learning approach that can be fed into a neural network. Your channel is golden! Keep it up and stay healthy!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching! For this one, I didn't stack any models. That would be a good way to go a step past what I did though!

  • @user-ni1md2sk2r
    @user-ni1md2sk2r3 жыл бұрын

    I think it would be great if you could show how you would present this project in a markdown file in order to add it to your github. Thanks for the great work!!!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I will work on it!

  • @risperbevalyn9670
    @risperbevalyn96703 жыл бұрын

    Thanks a lot ken jee

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Glad I could help!

  • @KhaliDALKhafaji
    @KhaliDALKhafaji Жыл бұрын

    hi, thanks for this explanation I have a question is this dataset suitable for semantic annotation and event extraction?

  • @AlexKite68
    @AlexKite682 жыл бұрын

    Thank you for this great video! I've already subscribed to your channel, digging to find a lots of DS insights )) But please improve the audio quality in future videos: background noises are really frustrating, and a background music seems to be a little bit loud. But again, you're making a great resource that is very useful for Data Science beginners like me!

  • @KenJee_ds

    @KenJee_ds

    2 жыл бұрын

    Thanks for watching! I have adjusted the music in the newer videos

  • @gupnir
    @gupnir3 жыл бұрын

    Hi Ken, your videos are really helpful for beginners like me. Can you do a similar walk-through video for House Prices problem as well.... thanks in advance.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I plan to! Thanks for watching Nirmit!

  • @daedalusdreamjournal5925
    @daedalusdreamjournal59253 жыл бұрын

    Hello there :) I haven't watched the full video yet, but there's a reason for this and is linked to a suggestion I'd like to propose to you for similar videos in the future: Despite being very VERY green in this, I decided to have a first go at this all by myself ... and boy was and is it still frustrating :P The reason behind this was that I wanted to try a first attempt without a guiding hand. Once I finished my first model, I quickly realized that there were tons of ways where I blundered like a total noob ... which is actually totally fine :) And despite the frustration of the experience, it felt like I gathered valuable experience from this. And it is only now that I am starting to watch this video .. but only bit by bit, as I want to try to do as much by myself as possible (mistake be damned since they are being done at home where it won't hurt anyone and where I can learn safely from the experience). SO my suggestion is this: Could it be possible for future similar videos to have it in several parts? Or, at the very least, to timestamp the different section of your handling of a particular problem? I feel like it could be very valuable, especially for very recent newcomers like me. Anyways, thanks a ton for your videos, very much appreciated ! (especially some of the code where you use apply and lambda functions to handle data transformations, this is definitely something that will be useful for me in the near and long future! :) Signed: A total newbie at this.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    This is a great idea! I think I will try the time stamp portion for the next one. I would also recommend my project from scratch series: kzread.info/head/PL2zq7klxX5ASFejJj80ob9ZAnBHdz5O1t . I broke this one into each phase of the data science lifecycle. I think your approach is really great though! I highly recommend that for other people going through this.

  • @TV-in3tt
    @TV-in3tt3 жыл бұрын

    Ken Jee👍👍👍

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    😆

  • @kartikeyanamdev4471
    @kartikeyanamdev44713 жыл бұрын

    First of all Thanks Ken for this, and secondly it's a request if you can make a dedicated video on how data analysis can work in cricket, I know you may not be knowing about the game but I really want to implement some data analysis into the game of cricket, so just need your help and it will do great if you make a video on the same. Have a good day mate.

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I will try to bring someone in who is familiar on the game!

  • @d.p.1980
    @d.p.19802 ай бұрын

    Thanks for your great tutorial on youtube! I have a question In regrading to this analysis. In 24:40 you start talking about cross validation. I'm not sure If I clear understand your code here. You're making cross validation on X_train /y_train data set? Is this correct aproach? Or we should do this on whole data set X/y?

  • @prabirbiswas440
    @prabirbiswas4403 жыл бұрын

    Wow what a in-depth analysis. You really put a lots of efforts into this. This is my first try in Kaggle too, after spending this much time i wonder how much time it will take for even tougher Data , i also checked the House Rent Competition. It have 81 Features. how can we do such a detailed analysis on all the features. Not sure how the real-world ML problems are solved where they might have 100 or even more features. I am really excited to know more :)

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching! I will be doing the housing dataset next, so stay tuned!!

  • @ImportData1
    @ImportData13 жыл бұрын

    Learned something new - VotingClassifier!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Awesome! Yeah, it is super useful and easy to use! Next time I will probably experiment more with some pipelines to clean up the feature engineering a bit!

  • @ImportData1

    @ImportData1

    3 жыл бұрын

    @@KenJee_ds I find the feature engineering/selection process the toughest. Sometimes you think you engineered features well enough, but the model accuracy doesn't necessarily resonate. Would love to see how you experimenet with pipelines!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    @@ImportData1 Yep! This is definitely the case where I could have done more!

  • @saurabhjoshi4887
    @saurabhjoshi48873 жыл бұрын

    Hi ken Great Video, I just completed your 7 part data science from beginning series. I am a beginner in data science and your video helped me a lot. Thanks 😊

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for working through the whole project series! I hope that this video helps you as well!

  • @cshivani
    @cshivani3 жыл бұрын

    Thanks!

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I hope you enjoyed it!

  • @DarkPrince1996
    @DarkPrince19963 жыл бұрын

    You did a great job explaining your approach to solving the task at hand and walking us through the process and so Im wanting to know what would be the next steps for someone wanting to use this competition to learn data science? Like I dont have a detailed understanding of all the algorithms that you used in this competition so would it be best to pick the one that produced the best score and learn how to tune that particular algorithm model metrics to get a better score or would it be best to transfer your process to another beginner competition altogether to create a better understanding of the complete data science process as a whole?

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I think this is great for learning how to tune the algorithms and seeing what results you get with different ones. It is also a good one for practicing feature engineering like I did with some of the seats etc.. I think transferring things to another competition would be a good idea!

  • @DarkPrince1996

    @DarkPrince1996

    3 жыл бұрын

    @@KenJee_ds appreciate your advice and I will definitely do that.

  • @tobakudan
    @tobakudan3 жыл бұрын

    Awesome tutorial. I have a question. Why do you log normalize Sibsp and Fare in addition to using StandardScaler? What does the log normalization accomplish that the StandardScaler doesn't?

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Thanks for watching! So scaling only puts it between 0-1, it doesn't change the distribution. When we use log norm we reduce the skew of the data which can sometimes help with the model. I hope this helps!

  • @kushagrayadav.fitness
    @kushagrayadav.fitness3 жыл бұрын

    Thank You Ken for providing this video...your new subscriber from India...🧡✌

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    Awesome! Thank you for subscribing! I hope my other videos are helpful as well!

  • @nikhilatluri1569

    @nikhilatluri1569

    3 жыл бұрын

    @@KenJee_ds yes for sure Watched almost all your videos And got a lot of information in building my career

  • @kushagrayadav.fitness

    @kushagrayadav.fitness

    3 жыл бұрын

    @@KenJee_ds just finished my data science beginners playlist...🙂✌... after this going to start my first project for beginners....thank you so much, Ken, earlier I was going in the wrong path, I will be your fan...🧡🧡🧡want to get in touch with you please sir...

  • @dunghuy6389
    @dunghuy63893 жыл бұрын

    Hello, Thanks for the video. I heard you told about deep learning for this dataset (included categorical and non categorical features). It is a typical data that we usually see. Could you please make a video and build a deep learning model?

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    I will use deep learning in an upcoming video!

  • @fadinayfeh4490
    @fadinayfeh44903 жыл бұрын

    Amazing video. I got a question, don't we need to detect the outlier data in the models? or its not a necessary step in calcification?

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    It can be useful here. I can't remember off the top of my head if I did it haha. It really depends on the models you use as well. Some are not as sensitive to outliers.

  • @DhrECraig
    @DhrECraig3 жыл бұрын

    Hey Ken Jee, thank you for the video, it's helping me a lot. :) I wanted to ask though, how did you know that you overfitted (the spoiler alert) with XGBoost?

  • @KenJee_ds

    @KenJee_ds

    3 жыл бұрын

    If you start producing poor results on your test or validation set, it is likely a sign of overfitting!

Келесі