Using ggplot2
¶
In [1]:
suppressPackageStartupMessages(library(tidyverse))
Warning message:
“Installed Rcpp (0.12.12) different from Rcpp used to build dplyr (0.12.11).
Please reinstall dplyr to avoid random crashes or undefined behavior.”Warning message:
“package ‘dplyr’ was built under R version 3.4.1”
In [2]:
head(iris)
Sepal.Length | Sepal.Width | Petal.Length | Petal.Width | Species |
---|---|---|---|---|
5.1 | 3.5 | 1.4 | 0.2 | setosa |
4.9 | 3.0 | 1.4 | 0.2 | setosa |
4.7 | 3.2 | 1.3 | 0.2 | setosa |
4.6 | 3.1 | 1.5 | 0.2 | setosa |
5.0 | 3.6 | 1.4 | 0.2 | setosa |
5.4 | 3.9 | 1.7 | 0.4 | setosa |
In [3]:
options(repr.plot.width=4, repr.plot.height=3)
Map data attributes to aesthetics¶
In [4]:
g <- ggplot(iris, aes(x=Sepal.Length, y=Sepal.Width, color=Species))
g
data:image/s3,"s3://crabby-images/4de80/4de803a3e9fb9fde0fbabc7da0ebc7e75aaaffa8" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_5_1.png"
Use different geometries for the same mapping¶
In [5]:
g + geom_point()
data:image/s3,"s3://crabby-images/ea79d/ea79dedb1516262c5205e790f57aee9adee7c40e" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_7_1.png"
In [6]:
g + geom_density2d()
data:image/s3,"s3://crabby-images/40f83/40f837dc1e0359428b8cb5352fb3b29686582c5f" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_8_1.png"
In [7]:
g + geom_smooth()
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/0a05f/0a05f69a620a918ce7e78c3551bb6c9abae401e1" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_9_2.png"
Combine geometries¶
In [8]:
g + geom_point() + geom_smooth()
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/d4b3a/d4b3afbe26f3e5e2f53780ab3b1b49ebb0d0a92e" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_11_2.png"
Changing labels¶
In [9]:
g + geom_point() + geom_smooth() +
labs(x="Sepal Length", y="Sepa; Width", title="Iris", subtitle="Plotted with ggplot")
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/240fd/240fd3bbd5c99d5a685d384be53c45aac01e2ff0" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_13_2.png"
Changing scales¶
In [10]:
g +
geom_jitter(width = 0.2) +
scale_y_log10()
data:image/s3,"s3://crabby-images/5a45c/5a45c0e131aa2719c6c0c45a9202527d5fb1e181" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_15_1.png"
Group plots using facets
¶
In [11]:
g + geom_point() + geom_smooth() +
facet_wrap(~ Species)
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/51015/510155a461ecc834c8d14aa2732866024471e99b" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_17_2.png"
In [12]:
g + geom_point() + geom_smooth() +
facet_wrap(~ Species, ncol = 1)
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/39d99/39d99f24c9d3c15f354308262288ec0e332481b0" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_18_2.png"
Turning guides
on and off¶
In [13]:
g + geom_point() + geom_smooth() +
facet_wrap(~ Species) +
guides(color=FALSE)
`geom_smooth()` using method = 'loess'
data:image/s3,"s3://crabby-images/5bcfc/5bcfc9accfb878d5d2c98c24b3b5592eb461ea97" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_20_2.png"
Reshaping data.frames
with gather
and plotting¶
In [14]:
iris %>% gather(measure, value, -Species) %>% head
Species | measure | value |
---|---|---|
setosa | Sepal.Length | 5.1 |
setosa | Sepal.Length | 4.9 |
setosa | Sepal.Length | 4.7 |
setosa | Sepal.Length | 4.6 |
setosa | Sepal.Length | 5.0 |
setosa | Sepal.Length | 5.4 |
In [15]:
options(repr.plot.width=6, repr.plot.height=4)
In [16]:
g2 <- ggplot(iris %>% gather(measure, value, -Species),
aes(x=Species, y=value, fill=Species, color=Species))
In [17]:
g2 + facet_wrap(~ measure) + geom_boxplot()
data:image/s3,"s3://crabby-images/c51f8/c51f819dfa0456dea2f16c737a75dbf18fbbef26" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_25_1.png"
In [18]:
g2 + facet_wrap(~ measure) + geom_jitter()
data:image/s3,"s3://crabby-images/b9814/b981468c2ffc1713be873961b531e16183cfe068" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_26_1.png"
In [19]:
g2 + facet_wrap(~ measure) + geom_bar(stat="identity")
data:image/s3,"s3://crabby-images/80779/80779547ca4b74f274ccf7a255b5d5f7119c1249" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_27_1.png"
Allowing different scales for each plot¶
In [20]:
g2 + facet_wrap(~ measure, scales="free") + geom_boxplot()
data:image/s3,"s3://crabby-images/52356/523565dc6bc1f289318f322cbb85edf957a2a2e8" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_29_1.png"
Change coordinates¶
In [21]:
g2 +
geom_boxplot() +
coord_flip()
data:image/s3,"s3://crabby-images/d1665/d16657a0f2c0e398f2745c4718cd611d87a89eb4" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_31_1.png"
In [22]:
options(repr.plot.width=6, repr.plot.height=6)
In [23]:
polar <- g2 +
facet_wrap(~ measure) +
geom_jitter(width = 0.2, size=1) +
coord_polar() +
guides(color=FALSE, fill=FALSE) +
labs(x="", y="")
In [24]:
polar
data:image/s3,"s3://crabby-images/1625b/1625b0dc0e2475dceb5034803345465128f8a9f6" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_34_1.png"
Transparency¶
In [25]:
ggplot(iris, aes(x=Sepal.Length, fill=Species)) +
geom_density(alpha=0.5)
data:image/s3,"s3://crabby-images/a59c1/a59c11255f7f1dd7de69a781b732df8ad1e47ba8" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_36_1.png"
Themes¶
In [26]:
ggplot(iris, aes(x=Sepal.Length, fill=Species)) +
geom_density(alpha=0.5) +
theme_bw()
data:image/s3,"s3://crabby-images/fec21/fec214af1905279bd207fa53c46d207f4f0c2a11" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_38_1.png"
In [28]:
ggplot(iris, aes(x=Sepal.Length, fill=Species)) +
geom_density(alpha=0.5) +
theme_classic()
data:image/s3,"s3://crabby-images/19606/19606f161c6f71655a5d71c0efe403eb45fe07c6" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_39_1.png"
In [247]:
ggplot(iris, aes(x=Sepal.Length, fill=Species)) +
geom_density(alpha=0.5) +
theme_dark() +
theme(axis.text.x = element_text(colour = 'red', size=20),
axis.text.y = element_text(color = 'blue', size=20))
data:image/s3,"s3://crabby-images/91de2/91de2bf9b50376b2a29fedd11f46475fcb55c20e" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_40_1.png"
Using color scales¶
In [62]:
options(repr.plot.width=4, repr.plot.height=3)
Discrete colors or fills¶
In [63]:
g3 <- ggplot(iris %>% gather(measure, value, -Species),
aes(x=Species, y=value, fill=Species)) +
geom_bar(stat="identity")
In [64]:
g3
data:image/s3,"s3://crabby-images/4fa60/4fa608853810bcb5391b1aee0cdf7f098a8dee34" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_45_1.png"
In [69]:
g3 + scale_fill_brewer(type='seq')
data:image/s3,"s3://crabby-images/e5b7f/e5b7f7be8700474fa51f275ed51a2389ebc34d5e" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_46_1.png"
In [67]:
g3 + scale_fill_brewer(type='div')
data:image/s3,"s3://crabby-images/f477d/f477d0290fca190d05205a7d9d377a2e41e36a1b" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_47_1.png"
In [68]:
g3 + scale_fill_brewer(type='qual')
data:image/s3,"s3://crabby-images/da28f/da28fa3b1014141fa0c25268a2458c5d791dbccc" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_48_1.png"
In [71]:
g3 + scale_fill_brewer(type='seq', palette = 'Reds')
data:image/s3,"s3://crabby-images/1c29e/1c29e5443bdcc7eca60e3b9d4335fbab2342b385" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_49_1.png"
In [76]:
g3 + scale_fill_brewer(type='seq', palette = 'Reds', direction = -1)
data:image/s3,"s3://crabby-images/6e786/6e7860e849eb29455230d64c9de291e3a459e114" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_50_1.png"
Continuous colors or fills¶
In [110]:
suppressPackageStartupMessages(library(genefilter))
In [116]:
n <- 20
m <- 50000
EXPRS <- matrix(rnorm(m * 2 * n), m, 2*n)
rownames(EXPRS) <- paste('g', 1:m, sep='')
colnames(EXPRS) <- paste('pt', 1:(2*n), sep='')
grp <- as.factor(rep(c("Control", "Treated"), each=n))
In [117]:
p.values <- rowttests(EXPRS, grp)$p.value
ii <- order(p.values)
TOPEXPRS <- EXPRS[ii[1:100], ]
In [119]:
M <- data.frame(t(TOPEXPRS)) %>% rownames_to_column("pid") %>% gather(gene, expression, -pid)
In [120]:
head(M)
pid | gene | expression |
---|---|---|
pt1 | g156 | -0.4724457 |
pt2 | g156 | 0.3118261 |
pt3 | g156 | -0.4587830 |
pt4 | g156 | -1.1015636 |
pt5 | g156 | -0.3718592 |
pt6 | g156 | -1.0262855 |
In [165]:
options(repr.plot.width=6, repr.plot.height=4)
In [176]:
g4 <- ggplot(M, aes(gene, pid, fill=expression)) +
geom_tile(colour='white') +
theme(axis.text.x = element_blank(),
axis.text.y = element_blank(),
axis.ticks.x = element_blank(),
axis.ticks.y = element_blank())
In [177]:
g4
data:image/s3,"s3://crabby-images/0c51b/0c51bb5e70adfbdb2dc75f48f8e459fe60b9a481" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_59_1.png"
In [178]:
g4 + scale_fill_gradient(low = "white", high="red")
data:image/s3,"s3://crabby-images/f52eb/f52eb03ba84d7df10d3382eab68de62f97a89562" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_60_1.png"
In [188]:
g4 + scale_fill_gradient2(low = "darkgreen", high="darkblue")
data:image/s3,"s3://crabby-images/45be5/45be5dc24f5149086037994dc82465be60c8819f" alt="../../_images/Computation_Wk4_Day1_AM_Using_ggplot2_61_1.png"
Check saved plot¶
data:image/s3,"s3://crabby-images/ecb8d/ecb8d3eb84f686928ebe5fb4b75f44e48dc14a6f" alt=""
Exercise¶
Hint: If nothing is plotted, wrap the entire R expression in a
print()
statement to see the error message.
In [199]:
head(mtcars)
mpg | cyl | disp | hp | drat | wt | qsec | vs | am | gear | carb | |
---|---|---|---|---|---|---|---|---|---|---|---|
Mazda RX4 | 21.0 | 6 | 160 | 110 | 3.90 | 2.620 | 16.46 | 0 | 1 | 4 | 4 |
Mazda RX4 Wag | 21.0 | 6 | 160 | 110 | 3.90 | 2.875 | 17.02 | 0 | 1 | 4 | 4 |
Datsun 710 | 22.8 | 4 | 108 | 93 | 3.85 | 2.320 | 18.61 | 1 | 1 | 4 | 1 |
Hornet 4 Drive | 21.4 | 6 | 258 | 110 | 3.08 | 3.215 | 19.44 | 1 | 0 | 3 | 1 |
Hornet Sportabout | 18.7 | 8 | 360 | 175 | 3.15 | 3.440 | 17.02 | 0 | 0 | 3 | 2 |
Valiant | 18.1 | 6 | 225 | 105 | 2.76 | 3.460 | 20.22 | 1 | 0 | 3 | 1 |
1. Make a scatter plot with y=mpg
and x=wt
In [ ]:
2. Add a linear regression curve.
In [ ]:
3. Add a title ‘Fuel efficiency decreases with weight’, and rename the x and y axis to ‘Weight’ and ‘Miles per gallon’.
In [ ]:
4. Change the color of the scatter points to salmon
.
In [ ]:
5. Change the color of the scatter points to represent the
horsepower hp
.
In [ ]:
6. Use color brewer to set the scale in Q4 with the Oranges
seqeuntial palette for the cyl
variable.
In [ ]:
7. Make a density plot of mpg
and fill by the factor cyl
,
and set the transparecny to 0.5.
In [ ]:
8. Repeat Q7, but use 3 separate plots. Remove the legend.
In [ ]:
9. Create a scatter plot -log(p value) on the y-axis and SNP
location on the x-axis, coloring by chromosome number. This is known as
a Manhattan plot. Use the code below to simulate data for the plot. Use
the Set3
palette of qual
type in scale_color_brewer
for the
color scheme.
n <- 10000 # number of genes
position <- 1:n
chromosome <- factor(rep(1:10, each=n/10))
p.value <- runif(n)
df <- data.frame(position=position, chromosome=chromosome, p.value=p.value)
In [ ]: