Introduction to NLP | Bag of Words Model

This is the first video of Introduction to NLP series. In this video, I have explained the concept of the Bag of Words model and how to implement it in python. The next video will be about TF-IDF. Stay tuned for the whole series! #nlp #machinelearning #python
For more videos please subscribe -
bit.ly/normalizedNERD
Support me if you can ❤️
www.paypal.com/paypalme2/suji04
www.buymeacoffee.com/normaliz...
Scikit learn playlist -
• Learn Scikit Learn
Source code -
github.com/Suji04/NormalizedN...
Data source -
www.kaggle.com/marklvl/sentim...
Facebook -
/ nerdywits
Instagram -
/ normalizednerd
Twitter -
/ normalized_nerd

Пікірлер: 61

  • @user-jo7pq2ti7r
    @user-jo7pq2ti7r Жыл бұрын

    I am Bengali as well. I am glad that you are doing this and feeling proud of you brother.

  • @balipavankalyan5008
    @balipavankalyan50082 жыл бұрын

    Rocked in very less time. This can only come with very keen knowledge.

  • @fahdciwan8709
    @fahdciwan87094 жыл бұрын

    excellent vids brother ... keep going. You've compressed months & months of learning into a easy-to-learn videos ... please don't stop

  • @NormalizedNerd

    @NormalizedNerd

    4 жыл бұрын

    Thanks a lot for appreciating the effort :D

  • @rajaroy4731
    @rajaroy47313 жыл бұрын

    thanks a lot man, after two days of such boring videos on youtube finally found a good one to learn NLP, great work man keep going on machine learning and other data science topic, it's really rare to find such a great video.

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Thanks a lot mate! Keep supporting...

  • @dimaelzein9908
    @dimaelzein99083 жыл бұрын

    I like the video. Simple and straight to the point. Keep it up!

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Thanks! :D

  • @dien2971
    @dien29712 жыл бұрын

    Thank you for your best explanation!

  • @Easy_way_of_codeing
    @Easy_way_of_codeing Жыл бұрын

    Amazing. W8ing for more video😊

  • @akalankagamage8112
    @akalankagamage81124 жыл бұрын

    Really Amazing and very clear . Keep this up . New subscriber .

  • @NormalizedNerd

    @NormalizedNerd

    4 жыл бұрын

    Thanks for the sub!

  • @vindyaabeygunawardena6222
    @vindyaabeygunawardena62222 жыл бұрын

    excellent. thanks Normalised Nerd

  • @shivanineeli5392
    @shivanineeli53923 жыл бұрын

    My 4 months struggle to learn nlp got done in 23 min. Thanks a ton bro!

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Glad to hear that bro! ❤

  • @thegamooo7310
    @thegamooo73103 жыл бұрын

    Very nice explanation also covered much things in very less time. Keep it up Man👍

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Glad to hear that!

  • @sumitchhabra2419
    @sumitchhabra24193 жыл бұрын

    I am at present studying the Machine Learning A-Z course from Udemy. Trust me, the tutor has no Idea how to explain stuff and it is the best selling course on Udemy for Data Science. I scratched my head while I was going through his tutorial, then I came here and all my doubts were cleared. Coding in python is not a difficult task but understanding the concept is the most important thing. And I got that understanding of Bag of words Model from your tutorial. Thanks a lot for your help. P.S. One who understands the concept and has strong Fundamentals has the ability to explain stuff in the simplest manner possible. Please keep it Simple like this in upcoming tutorials also. ALREADY SUBSCRIBED.

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Thanks a lot @Sumit Chhabra. I'll try my best to maintain this level of simplicity :)

  • @elirhm5926
    @elirhm59263 жыл бұрын

    Thank you so much

  • @shaminmohammed672
    @shaminmohammed6722 жыл бұрын

    Thank you sir 🙏

  • @soumadeepacharya9513
    @soumadeepacharya9513 Жыл бұрын

    Thanks brother❤

  • @prajnapn5946
    @prajnapn59463 жыл бұрын

    Well explained !!!

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Thanks!!

  • @user-jz3wo1om2c
    @user-jz3wo1om2c3 жыл бұрын

    you are great.

  • @kanchangandhi1200
    @kanchangandhi12003 жыл бұрын

    Very informative video , thank you for uploading the NLP series , having one queries how can we use auto text summary generator in other human lanaguages text like(Japanese , chinese & Korean)????? your reply would be very helpfull.

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Unfortunately there's no library for that can summarize every language. However, you'll find many github repos where people have built text summarizers for other languages using the same method!

  • @christinavalavani1561
    @christinavalavani15612 жыл бұрын

    Please make a video or two about neural machine translation. With an example.

  • @doyourealise
    @doyourealise4 жыл бұрын

    amazing keep posting videos :)

  • @NormalizedNerd

    @NormalizedNerd

    4 жыл бұрын

    Will do 😁

  • @thetensordude
    @thetensordude4 жыл бұрын

    chalie jao dada!!!

  • @NormalizedNerd

    @NormalizedNerd

    4 жыл бұрын

    Thank you bhai!

  • @thetensordude

    @thetensordude

    4 жыл бұрын

    Can you make some videos about seaborn?

  • @oludotunode
    @oludotunode2 жыл бұрын

    Shouldn't you initialize the regular expression outside the loop?

  • @sumitchhabra2419
    @sumitchhabra24193 жыл бұрын

    What does CountVectorizer do in this model ? Does it convert the words in the instances/Document into 0/1 please suggest.... Thanks for your help in advance.

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    It'll generate the feature matrix that I started drawing at 7:06

  • @apoorvshrivastava3544
    @apoorvshrivastava35444 жыл бұрын

    Great, New subscriber

  • @NormalizedNerd

    @NormalizedNerd

    4 жыл бұрын

    Thanks a lot :D

  • @assiaben1220
    @assiaben12202 жыл бұрын

    Hello Normalize Nerd, i've got an error in y(target) y = data.as_matrix(['Review_class'], AttributeError: 'DataFrame' object has no attribute 'as_matrix', By the way, thanks 💚 the tutorial is very clear and well explained . 👏Bravo

  • @vindyaabeygunawardena6222

    @vindyaabeygunawardena6222

    2 жыл бұрын

    change it into data.(['Review_class'].to_numpy().

  • @elirhm5926
    @elirhm59263 жыл бұрын

    how can we go back to the original sentence from X_test? I mean how can I see what sentences the algorithm doesn't classify correctly?

  • @lpereira95

    @lpereira95

    3 жыл бұрын

    Compare y_pred and y_test. The indices where they don't match are the mistaken samples. Then use that indices to access the sentences in X_test. I hope it helps.

  • @roopeshroope5988
    @roopeshroope59883 жыл бұрын

    We first split the data and do Preprocessing right why you perform on whole dataset

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Yes, ideally we should first split then preprocess. Here, the text preprocessing will remain the same for both train and test set so I did them together. However, I also formed the BOW model on the whole data; which is not the correct way. We should build it only on the training set then apply it on the test set. I did it just to make things a little easier.

  • @user-or7ji5hv8y
    @user-or7ji5hv8y3 жыл бұрын

    Is X_train the bag of words?

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    X_train is the feature matrix.

  • @dhusor9633
    @dhusor96333 жыл бұрын

    plz a make tutorial how nlp is work on Bangla text datasets.

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Sure I'll...stay tuned!

  • @apoorvshrivastava3544
    @apoorvshrivastava35444 жыл бұрын

    why we use deimitor as '/t'

  • @NormalizedNerd

    @NormalizedNerd

    4 жыл бұрын

    In the .txt file, the values are separated by tab('/t') just like the values are separated by a comma in a .csv file. In pandas we have the function read_csv(reads .csv files by default). We need to pass the parameter to read tab-separated files.

  • @user-or7ji5hv8y
    @user-or7ji5hv8y3 жыл бұрын

    How can we access the text files used?

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    I've provide the link in the video description

  • @sumitchhabra2419
    @sumitchhabra24193 жыл бұрын

    One suggestion here. Please ZOOM-IN your screen while you are explaining the coding part. Press Ctrl and scroll up from your mouse, it will zoom in. It puts strain on our eyes and understanding the coding part becomes a punishment.

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Point noted. Thanks for the feedback.

  • @rakeshmanathana
    @rakeshmanathana3 жыл бұрын

    Is it 'Lov' or 'Love' the root?

  • @NormalizedNerd

    @NormalizedNerd

    3 жыл бұрын

    Good question. The thing is...stemming should give us 'lov'. But, Porter stemmer gives us 'love'. I guess the reason lies in the details of Porter stemmer's implementation.

  • @thamizhansudip6644
    @thamizhansudip66443 жыл бұрын

    powerpoint could have been used for nice presentation

  • @xritzx
    @xritzx4 жыл бұрын

    I would have loved it if it was good. 😀😅Is this positive?

  • @sujandutta8981

    @sujandutta8981

    4 жыл бұрын

    Such complex shit ;__;

  • @mdarifulislam2736
    @mdarifulislam27364 жыл бұрын

    ভাই আমিও বাঙালি.! ❤❤❤

  • @NormalizedNerd

    @NormalizedNerd

    4 жыл бұрын

    বেশ ভালো লাগলো আপনার কমেন্টটি পেয়ে। চ্যানেলটিকে আপনার পরিচিতদের মধ্যে শেয়ার করার অনুরোধ রইল। ❤️