Draw Multiple Variables as Lines to Same ggplot2 Plot in R (2 Examples)

 

In this tutorial you’ll learn how to plot two or more lines to only one ggplot2 graph in R programming.

The article is structured as follows:

You’re here for the answer, so let’s get straight to the exemplifying R syntax.

 

Example Data, Packages & Default Plot

The following data is used as basement for this R programming tutorial:

set.seed(6532465)                    # Create example data
data <- data.frame(x = 1:50,
                   y1 = sort(rnorm(50)),
                   y2 = sort(rnorm(50, 0.5)))
head(data)                           # Head of example data
#   x        y1         y2
# 1 1 -2.233737 -0.9549823
# 2 2 -1.836179 -0.9039053
# 3 3 -1.828040 -0.7433467
# 4 4 -1.691616 -0.6736192
# 5 5 -1.522380 -0.6325588
# 6 6 -1.437409 -0.6307781

Have a look at the previous output of the RStudio console. It shows that our example data consists of three columns. The variable x ranges from 1 to 50 and represents the x-axis values of our plot. The variables y1 and y2 represent the y-axis values of two different lines we will draw in this tutorial.

If we want to create a plot of our data with the ggplot2 package, we also have to install and load ggplot2:

install.packages("ggplot2")          # Install ggplot2 package
library("ggplot2")                   # Load ggplot2 package

Now, we can move on to the plotting of our data.

 

Example 1: Plotting Two Lines in Same ggplot2 Graph Using geom_line() Multiple Times

In this Example, I’ll illustrate how to draw two lines to a single ggplot2 plot using the geom_line function of the ggplot2 package. For this, we have to specify our x-axis values within the aes of the ggplot function. The values for the y-axis are specified within the two geom_line commands:

ggp1 <- ggplot(data, aes(x)) +       # Create ggplot2 plot
  geom_line(aes(y = y1), color = "red") +
  geom_line(aes(y = y2), color = "blue")
ggp1                                 # Draw ggplot2 plot

 

r graph figure 1 draw multiple lines same ggplot2 r

 

The output of the previous R programming syntax is shown in Figure 1: It’s a ggplot2 line graph showing multiple lines.

 

Example 2: Plotting Two Lines in Same ggplot2 Graph Using Data in Long Format

In Example 1 you have learned how to use the geom_line function several times for the same graphic. However, this methodology is not convenient for a large number of lines, since we would have to specify the geom_line function for each line we want to draw.

The following syntax shows a more general approach for the plotting of multiple lines for each group in a ggplot2 plot by reshaping our data frame from wide to long format.

We will use the functions of the reshape2 package to transform our data from wide to long format. Hence, we first have to install and load the reshape2 package:

install.packages("reshape2")         # Install & load reshape2 package
library("reshape2")

Now, we can convert our data from wide to long format as shown below:

data_long <- melt(data, id = "x")    # Convert data to long format
head(data_long)                      # Head of long data
#   x variable     value
# 1 1       y1 -2.233737
# 2 2       y1 -1.836179
# 3 3       y1 -1.828040
# 4 4       y1 -1.691616
# 5 5       y1 -1.522380
# 6 6       y1 -1.437409

Finally, we can use our long data to draw a ggplot2 graph containing multiple lines as shown below:

ggp2 <- ggplot(data_long,            # Create ggplot2 plot
               aes(x = x,
                   y = value,
                   color = variable)) +
  geom_line()
ggp2                                 # Draw ggplot2 plot

 

r graph figure 2 draw multiple lines same ggplot2 r

 

As shown in Figure 2, the previous R programming syntax created a similar ggplot2 plot as in Example 1. However, this time the R code is more general and can easily be applied to large data sets.

 

Video & Further Resources

In case you need further info on the R programming code of this article, you may have a look at the following video of my YouTube channel. In the video, I show the topics of this page.

 

 

Besides the video, you may want to read the related articles on this website. A selection of tutorials on related topics such as dates, graphics in r, regression models, and lines can be found below.

 

In this R tutorial you learned how to create a ggplot2 plot containing multiple lines with multiple groupings for the geom_line function. Don’t hesitate to let me know in the comments, in case you have further questions or comments. Besides that, please subscribe to my email newsletter for updates on new tutorials.

 

Subscribe to the Statistics Globe Newsletter

Get regular updates on the latest tutorials, offers & news at Statistics Globe.
I hate spam & you may opt out anytime: Privacy Policy.


22 Comments. Leave new

  • Do I see it correctly?
    January 17, 2021 10:35 pm

    I copied the procedure in your Example 1, and noticed my plots had color problems. I returned to your site, and see that your results have the same problems I did. Your seem to call for y1 to be red; however, the y1 line in the chart appears to be blue-green (not really even “blue”). Likewise, the other line appears switched and the legend appears to be mixed up as well. I am not sure yet what causes the problems.

    Reply
    • Wow, thanks a lot for this! I didn’t notice this error in my code when I was creating the tutorial.

      I have just adopted your code and have changed the graphic.

      Thanks again!

      Joachim

      Reply
  • Do I see it correctly?
    January 17, 2021 10:46 pm

    I think I found the correct program:

    ggp1 <- ggplot(data, aes(x)) + # Create ggplot2 plot
    geom_line(aes(y = y1), color = "red") +
    geom_line(aes(y = y2), color = "blue")
    ggp1 # Draw ggplot2 plot

    Reply
  • How do you deal with NA in your time series?

    Reply
  • Sonwil Yakubu
    August 19, 2021 12:02 pm

    I want to determine structural breaks in mean and variance at the end of sample period that is outside the admissible region of (0.15T, 0.85T).

    I want to determine the size of the breaks as well and estimate breaks size on forecasting as well. Any help?

    Reply
  • This is a nice tutorial and has been very helpful. The only thing missing is how to plot a subsection of the y values. For example my long data set has six outcome variables which I want to look at in pairs rather than all at once. How would I specify only two rather than the whole data set?

    Reply
    • Hey Daniel,

      Thank you, glad you like the tutorial!

      Regarding your question: One way to do this is to subset your data frame accordingly. For instance, you could extract all rows with the groups y1 and y2 from the long data set: data_long[data_long$variable %in% c(“y1” , “y2”), ]

      Regards,
      Joachim

      Reply
  • Un excelente aporte, muchas gracias por brindar ayuda. Saludos

    Reply
  • I appreciate your help
    I have a data set which contains 41 columns. First column is date and rest of the column represents places. I want to draw multiple lines(40 lines) using ggplt. I used the following code

    library(reshape2) #
    em <- melt(data, id = "date")

    plot1 <- qplot(date, value, data = em, geom = "line", group = variable) +
    facet_grid(variable ~ ., scale = "free_y")
    ggsave("plot1.jpg",plot1, width = 6.5, height =6.5 )
    But the resulting graph is too small. It does not give any information.
    Can u give me some information how can I better my graph?

    Reply
  • Yes, the size of the graph is too small that I can’t get any information from it.

    Reply
  • Hallo. I have this data that I want to produce four graphs. That is a graph for every five timesteps. (This is just a sample. The data is very huge.) My problem is I can only produce the graphs if I change the data format and then change it to long data. Is there a way I can use loops and ggplot to do it?
    Timestep Rate
    0 -0.00061
    1 -0.006777
    2 -0.004195
    3 -0.000712
    4 0.003213
    5 0.006641
    6 0.009079
    7 0.010617
    8 0.011552
    9 0.012175
    10 0.012618
    11 0.012815
    12 0.012762
    13 0.012649
    14 0.012477
    15 0.012268
    16 0.012184
    17 0.012238
    18 0.012422
    19 0.01265
    20 0.012922

    Reply
  • Hi! I am currently trying to examine a dataset regarding which distribution fits it the best. Here I want to plot the histogram with the data, plot the density curve AND the fitted distribution. But when plotting the exponential distribution on top of the histogram and density curve it just lays in the bottom of the plot.. I guess it is because of the scale of the rate, but the rate is the one I get from using the fitdistr… However, I am not sure how to get around this problem.. do you have any suggestions?

    Reply

Leave a Reply

Your email address will not be published. Required fields are marked *

Fill out this field
Fill out this field
Please enter a valid email address.

Top