Using ggplot2
: Solutions¶
In [3]:
suppressPackageStartupMessages(library(tidyverse))
Warning message:
“Installed Rcpp (0.12.12) different from Rcpp used to build dplyr (0.12.11).
Please reinstall dplyr to avoid random crashes or undefined behavior.”Warning message:
“package ‘dplyr’ was built under R version 3.4.1”
In [4]:
head(iris)
Sepal.Length | Sepal.Width | Petal.Length | Petal.Width | Species |
---|---|---|---|---|
5.1 | 3.5 | 1.4 | 0.2 | setosa |
4.9 | 3.0 | 1.4 | 0.2 | setosa |
4.7 | 3.2 | 1.3 | 0.2 | setosa |
4.6 | 3.1 | 1.5 | 0.2 | setosa |
5.0 | 3.6 | 1.4 | 0.2 | setosa |
5.4 | 3.9 | 1.7 | 0.4 | setosa |
In [5]:
options(repr.plot.width=4, repr.plot.height=3)
Map data attributes to aesthetics¶
In [6]:
g <- ggplot(iris, aes(x=Sepal.Length, y=Sepal.Width, color=Species))
g
data:image/s3,"s3://crabby-images/a8b9a/a8b9a90e9f20305dffcda4295ee9e11b41526afa" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_5_1.png"
Use different geometries for the same mapping¶
In [7]:
g + geom_point()
data:image/s3,"s3://crabby-images/0185a/0185a2b493b79dedb95bbe4330226b24ff0e1ec6" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_7_1.png"
In [8]:
g + geom_density2d()
data:image/s3,"s3://crabby-images/2030c/2030c5c38ccc05ad82bce1aee9cf91efdc35ac57" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_8_1.png"
In [9]:
g + geom_smooth()
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/86e80/86e80406bdbf6772646d44d5da6f766ec8d606d0" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_9_2.png"
Combine geometries¶
In [10]:
g + geom_point() + geom_smooth()
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/3c88b/3c88bf98a5630e87776906bdde8da21e973a94f8" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_11_2.png"
Changing labels¶
In [11]:
g + geom_point() + geom_smooth() +
labs(x="Sepal Length", y="Sepa; Width", title="Iris", subtitle="Plotted with ggplot")
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/eb1a3/eb1a3468f8c564dbf07bcb0c2cd0c96a723c6860" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_13_2.png"
Changing scales¶
In [12]:
g +
geom_jitter(width = 0.2) +
scale_y_log10()
data:image/s3,"s3://crabby-images/43890/4389065146501ccc9d05847fdb1c3e25d81e6b55" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_15_1.png"
Group plots using facets
¶
In [13]:
g + geom_point() + geom_smooth() +
facet_wrap(~ Species)
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/f8514/f851461fe9ccddde4ead10ca4a607d86a5ee0d5e" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_17_2.png"
In [14]:
g + geom_point() + geom_smooth() +
facet_wrap(~ Species, ncol = 1)
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/ad5a5/ad5a552e616e7b8cf322e894bea6abd29d1f939f" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_18_2.png"
Turning guides
on and off¶
In [15]:
g + geom_point() + geom_smooth() +
facet_wrap(~ Species) +
guides(color=FALSE)
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/d65a9/d65a9bee92677198aaee49661fb2dcf15577b12d" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_20_2.png"
Reshaping data.frames
with gather
and plotting¶
In [16]:
iris %>% gather(measure, value, -Species) %>% head
Species | measure | value |
---|---|---|
setosa | Sepal.Length | 5.1 |
setosa | Sepal.Length | 4.9 |
setosa | Sepal.Length | 4.7 |
setosa | Sepal.Length | 4.6 |
setosa | Sepal.Length | 5.0 |
setosa | Sepal.Length | 5.4 |
In [17]:
options(repr.plot.width=6, repr.plot.height=4)
In [18]:
g2 <- ggplot(iris %>% gather(measure, value, -Species),
aes(x=Species, y=value, fill=Species, color=Species))
In [19]:
g2 + facet_wrap(~ measure) + geom_boxplot()
data:image/s3,"s3://crabby-images/5cc13/5cc13e07c9b716383b7d107683405159a6a2e73e" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_25_1.png"
In [20]:
g2 + facet_wrap(~ measure) + geom_jitter()
data:image/s3,"s3://crabby-images/c424e/c424e5b957dc8d4aaadfb8155de8cdea24a2fc1f" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_26_1.png"
In [21]:
g2 + facet_wrap(~ measure) + geom_bar(stat="identity")
data:image/s3,"s3://crabby-images/ad437/ad43761309001094e4704dd2fba27074ca9a4569" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_27_1.png"
Allowing different scales for each plot¶
In [22]:
g2 + facet_wrap(~ measure, scales="free") + geom_boxplot()
data:image/s3,"s3://crabby-images/ad6bd/ad6bdfb6f960caa760d6a486f2f229096c8fe6d7" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_29_1.png"
Change coordinates¶
In [23]:
g2 +
geom_boxplot() +
coord_flip()
data:image/s3,"s3://crabby-images/e480e/e480e827f793b6da1b75f77c664e3b60f98bdf52" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_31_1.png"
In [24]:
options(repr.plot.width=6, repr.plot.height=6)
In [25]:
polar <- g2 +
facet_wrap(~ measure) +
geom_jitter(width = 0.2, size=1) +
coord_polar() +
guides(color=FALSE, fill=FALSE) +
labs(x="", y="")
In [26]:
polar
data:image/s3,"s3://crabby-images/fd018/fd018eabe401b81108d36966e1d45e1e333f16fb" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_34_1.png"
Transparency¶
In [27]:
ggplot(iris, aes(x=Sepal.Length, fill=Species)) +
geom_density(alpha=0.5)
data:image/s3,"s3://crabby-images/74c45/74c45180b77868c3b475b03f6d9e3f440fb72ccc" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_36_1.png"
Themes¶
In [28]:
ggplot(iris, aes(x=Sepal.Length, fill=Species)) +
geom_density(alpha=0.5) +
theme_bw()
data:image/s3,"s3://crabby-images/b78b2/b78b2ac15c846ad2132312c84cfadf36b7fb01bb" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_38_1.png"
In [30]:
ggplot(iris, aes(x=Sepal.Length, fill=Species)) +
geom_density(alpha=0.5) +
theme_classic()
data:image/s3,"s3://crabby-images/b25e7/b25e7fcdd2058d26530004497949cd63ad248d94" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_39_1.png"
In [31]:
ggplot(iris, aes(x=Sepal.Length, fill=Species)) +
geom_density(alpha=0.5) +
theme_dark() +
theme(axis.text.x = element_text(colour = 'red', size=20),
axis.text.y = element_text(color = 'blue', size=20))
data:image/s3,"s3://crabby-images/daf4a/daf4a6cc2a9b98bfda8ed4b3ca105bf752431143" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_40_1.png"
Using color scales¶
In [32]:
options(repr.plot.width=4, repr.plot.height=3)
Discrete colors or fills¶
In [33]:
g3 <- ggplot(iris %>% gather(measure, value, -Species),
aes(x=Species, y=value, fill=Species)) +
geom_bar(stat="identity")
In [34]:
g3
data:image/s3,"s3://crabby-images/df0b9/df0b945bc4badc08a7a77773453d495c3c778b4c" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_45_1.png"
In [35]:
g3 + scale_fill_brewer(type='seq')
data:image/s3,"s3://crabby-images/187d0/187d059bb9cfca9f4fa10c9a1d0589b4ba27874b" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_46_1.png"
In [36]:
g3 + scale_fill_brewer(type='div')
data:image/s3,"s3://crabby-images/2b79a/2b79ae0aa1399480a4c2c5031d62468151c3d16b" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_47_1.png"
In [37]:
g3 + scale_fill_brewer(type='qual')
data:image/s3,"s3://crabby-images/aadd2/aadd2a8bed75e358fd1515a1ef5df516acdfa139" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_48_1.png"
In [38]:
g3 + scale_fill_brewer(type='seq', palette = 'Reds')
data:image/s3,"s3://crabby-images/58565/585656ad6afeb57fd3a79fc98a2a5c3eb1f655ce" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_49_1.png"
In [39]:
g3 + scale_fill_brewer(type='seq', palette = 'Reds', direction = -1)
data:image/s3,"s3://crabby-images/24d51/24d51ff66bd8d0d69b107b01a80def5501df3a38" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_50_1.png"
Continuous colors or fills¶
In [40]:
suppressPackageStartupMessages(library(genefilter))
In [41]:
n <- 20
m <- 50000
EXPRS <- matrix(rnorm(m * 2 * n), m, 2*n)
rownames(EXPRS) <- paste('g', 1:m, sep='')
colnames(EXPRS) <- paste('pt', 1:(2*n), sep='')
grp <- as.factor(rep(c("Control", "Treated"), each=n))
In [42]:
p.values <- rowttests(EXPRS, grp)$p.value
ii <- order(p.values)
TOPEXPRS <- EXPRS[ii[1:100], ]
In [43]:
M <- data.frame(t(TOPEXPRS)) %>% rownames_to_column("pid") %>% gather(gene, expression, -pid)
In [44]:
head(M)
pid | gene | expression |
---|---|---|
pt1 | g36200 | 0.537766165 |
pt2 | g36200 | -0.447619190 |
pt3 | g36200 | 1.262114958 |
pt4 | g36200 | 0.205835640 |
pt5 | g36200 | 0.292946391 |
pt6 | g36200 | -0.007104385 |
In [45]:
options(repr.plot.width=6, repr.plot.height=4)
In [46]:
g4 <- ggplot(M, aes(gene, pid, fill=expression)) +
geom_tile(colour='white') +
theme(axis.text.x = element_blank(),
axis.text.y = element_blank(),
axis.ticks.x = element_blank(),
axis.ticks.y = element_blank())
In [47]:
g4
data:image/s3,"s3://crabby-images/d8cc1/d8cc1d4faed8e74873bbd439a5ef6cfdecb4f394" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_59_1.png"
In [48]:
g4 + scale_fill_gradient(low = "white", high="red")
data:image/s3,"s3://crabby-images/f8792/f8792596c75cda848252cb14011971d7af2121e3" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_60_1.png"
In [49]:
g4 + scale_fill_gradient2(low = "darkgreen", high="darkblue")
data:image/s3,"s3://crabby-images/ad284/ad2847cdd8ecc7758c627c5ac6160a2ccf032c50" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_61_1.png"
Check saved plot¶
data:image/s3,"s3://crabby-images/ecb8d/ecb8d3eb84f686928ebe5fb4b75f44e48dc14a6f" alt=""
Exercise¶
Hint: If nothing is plotted, wrap the entire R expression in a
print()
statement to see the error message.
In [51]:
head(mtcars)
mpg | cyl | disp | hp | drat | wt | qsec | vs | am | gear | carb | |
---|---|---|---|---|---|---|---|---|---|---|---|
Mazda RX4 | 21.0 | 6 | 160 | 110 | 3.90 | 2.620 | 16.46 | 0 | 1 | 4 | 4 |
Mazda RX4 Wag | 21.0 | 6 | 160 | 110 | 3.90 | 2.875 | 17.02 | 0 | 1 | 4 | 4 |
Datsun 710 | 22.8 | 4 | 108 | 93 | 3.85 | 2.320 | 18.61 | 1 | 1 | 4 | 1 |
Hornet 4 Drive | 21.4 | 6 | 258 | 110 | 3.08 | 3.215 | 19.44 | 1 | 0 | 3 | 1 |
Hornet Sportabout | 18.7 | 8 | 360 | 175 | 3.15 | 3.440 | 17.02 | 0 | 0 | 3 | 2 |
Valiant | 18.1 | 6 | 225 | 105 | 2.76 | 3.460 | 20.22 | 1 | 0 | 3 | 1 |
1. Make a scatter plot with y=mpg
and x=wt
In [52]:
g <- ggplot(mtcars, aes(wt, mpg)) + geom_point()
In [53]:
g
data:image/s3,"s3://crabby-images/0ad62/0ad6271f6d17e7ad8a1a23e8084cd37e3c45d923" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_69_1.png"
2 Add a linear regression curve.
In [54]:
g + geom_smooth(method='lm')
data:image/s3,"s3://crabby-images/30128/30128d49df73500ef38808c4c0ab22a5be230836" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_71_1.png"
3. Add a title ‘Fuel efficiency decreases with weight’, and rename the x and y axis to ‘Weight’ and ‘Miles per gallon’.
In [55]:
g + geom_smooth(method='lm') +
labs(x="Weight", y="Miles per gallon",
title="Fuel efficiency decreases with weight")
data:image/s3,"s3://crabby-images/90ec2/90ec25ea5367683463bafa1b273f7104b49735e5" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_73_1.png"
4. Change the color of the scatter points to salmon
.
In [56]:
g + geom_point(color='salmon') +
geom_smooth(method='lm') +
labs(x="Weight", y="Miles per gallon",
title="Fuel efficiency decreases with weight")
data:image/s3,"s3://crabby-images/8976c/8976c241d09d7b863c28ab752ec0f55a76090491" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_75_1.png"
4. Change the color of the scatter points to represent the
horsepower hp
.
In [57]:
g + geom_point(aes(color=hp)) +
geom_smooth(method='lm') +
labs(x="Weight", y="Miles per gallon",
title="Fuel efficiency decreases with weight")
data:image/s3,"s3://crabby-images/65063/65063b3334ef63a7842a02dd5bcb0aa610e829a9" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_77_1.png"
5. Use color brewer to set the scale in Q4 with the Oranges
seqeuntial palette for the cyl
variable.
In [58]:
g + geom_point(aes(color=as.factor(cyl))) +
geom_smooth(method='lm') +
scale_color_brewer(type='seq', palette = 'Reds') +
labs(x="Weight", y="Miles per gallon",
title="Fuel efficiency decreases with weight")
data:image/s3,"s3://crabby-images/70411/70411424ad8d00de332137c3ade4ea6bda0ca479" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_79_1.png"
7. Make a density plot of mpg
and fill by the factor cyl
,
and set the transparecny to 0.5.
In [59]:
ggplot(mtcars, aes(mpg, fill=as.factor(cyl))) +
geom_density(alpha=0.5)
data:image/s3,"s3://crabby-images/839dd/839dd1d2d3a5072c0ef43f1b59169f6220ee7401" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_81_1.png"
8. Repeat Q7, but use 3 separate plots. Remove the legend.
In [60]:
ggplot(mtcars, aes(mpg, fill=as.factor(cyl))) +
facet_wrap(~ cyl) +
geom_density(alpha=0.5) +
guides(fill=FALSE)
data:image/s3,"s3://crabby-images/660d3/660d3ddd9e91907e5ad33bb9753c666308cfc7f3" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_83_1.png"
9. Create a scatter plot -log(p value) on the y-axis and SNP
location on the x-axis, coloring by chromosome number. This is known as
a Manhattan plot. Use the code below to simulate data for the plot. Use
the Set3
palette of qual
type in scale_color_brewer
for the
color scheme.
n <- 10000 # number of genes
position <- 1:n
chromosome <- factor(rep(1:10, each=n/10))
p.value <- runif(n)
df <- data.frame(position=position, chromosome=chromosome, p.value=p.value)
In [61]:
n <- 10000 # number of genes
position <- 1:n
chromosome <- factor(rep(1:10, each=n/10))
p.value <- runif(n)
df <- data.frame(position=position, chromosome=chromosome, p.value=p.value)
In [62]:
ggplot(df %>% mutate(log.p.value = -log(p.value)),
aes(x=position, y=log.p.value, color=chromosome)) +
geom_point(size=0.5) +
scale_color_brewer(type='qual', palette='Set3')
data:image/s3,"s3://crabby-images/1d9ce/1d9ceb3d85d8d2835ff110f1d21701fc240a45ef" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplo2_Solutions_86_1.png"
In [ ]: