dplyr: Joins

Ғылым және технология

Joins let you combine two data tables together based on a shared column that uniquely identifies the records, also known as a key column. When your data is spread out across multiple tables, you may need to perform one or more joins to get it all into one big table before doing other data cleaning and analysis tasks.
Link to the Kaggle Notebook code used for this video series:
www.kaggle.com/hamelg/dplyr-in-r
View the whole dplyr in R playlist here:
• dplyr: Getting Started
dplyr cheat sheet from RStudio:
www.rstudio.com/wp-content/up...
dplyr documentation:
cran.r-project.org/web/packag...
Follow DataDaft on social media for news and updates:
Twitter: / datadaft
Join the DataDaft Discord to discuss all things data science:
/ discord
#dplyr #rprogramming #datascience

Пікірлер: 21

  • @j7andrew
    @j7andrew2 жыл бұрын

    This is awesome! You have a gift for teaching. Thank you

  • @michaelhudson2375
    @michaelhudson23752 жыл бұрын

    Thank you! Excellent explanation and coverage of the joins in dplyr. I learnt a lot from coding along with you. I hope you make more videos to share your knowledge of R, and data science.

  • @svendavidsson
    @svendavidsson3 жыл бұрын

    Straight to the point! Very helpful. cheers

  • @mariamaroni8104
    @mariamaroni81043 жыл бұрын

    Thank you so much! This was really helpful. Hugs from Argentina!

  • @bukolaadebayo1891
    @bukolaadebayo1891 Жыл бұрын

    Thank you so much!! The join multiple columns was really helpful.

  • @data_kom
    @data_kom2 жыл бұрын

    Great video on joins. Thanks

  • @lichaccakaven146
    @lichaccakaven1464 жыл бұрын

    very basic ,very good.

  • @simonebellisario3682
    @simonebellisario3682 Жыл бұрын

    THANK YOU

  • @amrsalaheldinabdallahhammo663
    @amrsalaheldinabdallahhammo663 Жыл бұрын

    Thank you genius :)

  • @KirtiTewari
    @KirtiTewari2 жыл бұрын

    Very helpful

  • @carolwrezende
    @carolwrezende3 жыл бұрын

    FINALLY A VIDEO ABOUT THIS SUBJECT IN AMERICAN ENGLISH

  • @nitufahmidakhalique2698
    @nitufahmidakhalique26983 жыл бұрын

    thank you

  • @Timska
    @Timska4 жыл бұрын

    Thank you for making these videos on dplyr and R studio, its really helping me out! If also may ask why is it that the full_join is the only command of the joins that uses by=c( ) instead of by=

  • @DataDaft

    @DataDaft

    3 жыл бұрын

    When joining on only one column, you can pass in that single column name using --> by = "join_col" In the full_join example where we are joining on two columns, we have to pass a vector of columns so we use --> by = c("join_col1", "join_col2")

  • @joankettfield6194
    @joankettfield61942 жыл бұрын

    Thank you very clear presentation. Can dplyr replace SQL?

  • @shantileitch
    @shantileitch2 жыл бұрын

    awesome

  • @jamesleleji6984
    @jamesleleji6984 Жыл бұрын

    which IDE are you using to run the codes? Thanks

  • @walterbondzie2464
    @walterbondzie24643 жыл бұрын

    Please what if you have two datasets with different colnames and rows but you are asked to merge the two

  • @DataDaft

    @DataDaft

    3 жыл бұрын

    If you have two columns with different names but that contain the same information/unique identifiers, such as "P_ID" in one data set and "Patient_ID" in the other, you can use the argument by = c("P_ID" = "Patient_ID") to join on that column despite the different column names. If the data sets don't have any variables in common (regardless of whether they actually have the column names) I'm not sure how they can be joined/merged in a meaningful way.

  • @jhonataoliveira2175
    @jhonataoliveira21753 жыл бұрын

    i fucking love u you saved me

  • @jamesleleji6984
    @jamesleleji6984 Жыл бұрын

    which IDE are you using to run the codes? Thanks

Келесі