Combine Two Data Frames with Different Variables by Rows in R (Example)
In this tutorial you’ll learn how to join two data frames with a different set of variable names in R.
The table of content is structured like this:
It’s time to dive into the example…
Example Data
Have a look at the following example data:
data1 <- data.frame(x1 = 1:5, # First data frame x2 = letters[1:5]) data1 # x1 x2 # 1 1 a # 2 2 b # 3 3 c # 4 4 d # 5 5 e
The previous output of the RStudio console shows that our first example data frame has five rows and two columns. The variables of our first data frame are called x1 and x2.
Let’s create a second data frame in R:
data2 <- data.frame(x2 = LETTERS[11:15], # Second data frame x3 = 777) data2 # x2 x3 # 1 K 777 # 2 L 777 # 3 M 777 # 4 N 777 # 5 O 777
The second data frame also consists of two columns. However, the column names of this data frame are different compared to the first data frame (i.e. x2 and x3 instead of x1 and x2).
Example: Merging Data Frames with Unequal Column Names Using bind_rows() of dplyr Package
The rbind function of the basic installation is not able to merge data frames with different column names.
Fortunately, the dplyr package provides this possibility.
First, we need to install and load the dplyr add-on package:
install.packages("dplyr") # Install & load dplyr library("dplyr")
Now, we can apply the bind_rows function provided by the dplyr package to bin the rows of our two data frame in R:
bind_rows(data1, data2) # Applying bind_rows function # x1 x2 x3 # 1 1 a NA # 2 2 b NA # 3 3 c NA # 4 4 d NA # 5 5 e NA # 6 NA K 777 # 7 NA L 777 # 8 NA M 777 # 9 NA N 777 # 10 NA O 777
As you can see based on the previous output of the RStudio console, the bind_rows function inserts NA values in columns that didn’t exist is both data frames.
Video & Further Resources
Do you need further info on the R programming code of this tutorial? Then I can recommend watching the following video of my YouTube channel. In the video, I’m explaining the examples of this article in a live session.
Please accept YouTube cookies to play this video. By accepting you will be accessing content from YouTube, a service provided by an external third party.
If you accept this notice, your choice will be saved and the page will refresh.
In addition, you may read some of the other tutorials of this homepage. I have published numerous other articles on related topics such as ggplot2, variables, graphics in r, and merging.
- bind_rows & bind_cols R Functions of dplyr Package
- rbind in R
- Merge Two Unequal Data Frames & Replace NA with 0
- Insert New Column Between Two Data Frame Variables
- All R Programming Examples
In summary: At this point you should have learned how to combine data frame rows where the column names are different and add the second data frame at the end or bottom of the first data frame in R programming.
Tell me about it in the comments section below, in case you have additional questions. Besides that, don’t forget to subscribe to my email newsletter for updates on the newest articles.
Statistics Globe Newsletter