《拟时序分析》6.monocle3的拟时序分析

经过前两次课程的讲解,我们不断的将monocle3与monocle2、Seurat进行对比,相信大家已经能熟练掌握monocle3的语法与功能([《拟时序分析》5.monocle3的降维、分群、聚类(http://mp.weixin.qq.com/s?__biz=MzAwMzIzOTk5OQ==&mid=2247489494&idx=1&sn=44f0a44857c12fcf9a1ab84e4f2cbebf&chksm=9b3f6e86ac48e79026321de2fdca81a139d104d655884ffe87a68e0a1551ddffa5af7919d19e&scene=21#wechat_redirect))。

单细胞拟时序系列课程会继续更新,请大家持续关注:


image.png

视频教程已上传至B站,新做了一个片头,欢迎大家来三联:
https://www.bilibili.com/video/BV1br4y1x7Hf?p=10

图文教程

在我看来monocle3与monocle2最大的特点无非就是这俩:
1、可以交互式地选择拟时序的起点
2、可以采取3D的形式展示轨迹图

3.1.预处理数据

rm(list = ls());gc()#清除镜像重来
##            used  (Mb) gc trigger   (Mb)  max used   (Mb)
## Ncells 11383036 608.0   27726538 1480.8  27726538 1480.8
## Vcells 21313689 162.7   65077689  496.6 198600815 1515.3
#老一套、读入并创建cds对象expression_matrix <- readRDS('author.pro/expression_matrix.rds')cell_metadata <- readRDS('author.pro/cell_metadata.rds')gene_annotation <- readRDS('author.pro/gene_annotation.rds')cds <- new_cell_data_set(expression_matrix,                         cell_metadata = cell_metadata,                         gene_metadata = gene_annotation)cds <- preprocess_cds(cds, num_dim = 50)#预处理cds <- align_cds(cds, alignment_group = "plate",                 residual_model_formula_str = '~Size_Factor')

residual_model_formula_str可以指定连续变量进行去批次,例如某一基因/基因集的表达值,否则这些变量以分类变量的形式参与去批次会将每一个值视为一个批次从而产生极大的冗余计算量

降维、聚类

cds <- reduce_dimension(cds)
## No preprocess_method specified, and aligned coordinates have been computed previously. Using preprocess_method = 'Aligned'
plot_cells(cds, label_groups_by_cluster=FALSE,  color_cells_by = "cao_cell_type")
## No trajectory to plot. Has learn_graph() been called yet?
## Warning: Removed 1 rows containing missing values (geom_text_repel).
image.png
#再次演示一下monocle版的Seurat::FeaturePlot()ciliated_genes <- c("che-1",                    "hlh-17",                    "nhr-6",                    "dmd-6",                    "ceh-36",                    "ham-1")plot_cells(cds,           genes=ciliated_genes,           label_cell_groups=FALSE,           show_trajectory_graph=FALSE)
## Warning: `guides(<scale> = FALSE)` is deprecated. Please use `guides(<scale> =
## "none")` instead.
image.png
#感觉这个配色很丑,但是又没有参数去修改

我们monocle2那里谈过,虽然从计算学的角度来说细胞可以连续地从一种状态过渡到下一种状态,它们之间没有离散的边界,但从生物学上来说并不是所有细胞类型之间都能发生转化,所以Monocle3并不假设数据集中的所有细胞都来自一个共同的转录“祖先”,大家实际拿到的数据中可能有多个不同的轨迹。例如,在应对感染的组织中,组织常驻免疫细胞和基质细胞会有非常不同的初始转录组,对感染的反应也会非常不同,所以它们应该分属于不用的轨迹。因此,monocle3在进行拟时序分析时并不采取单起点的方式。

cds <- cluster_cells(cds)#聚类之后,每一个cluster会自成一个”拟时序轨迹“plot_cells(cds, color_cells_by = "partition")
image.png

这个图里黑色的线就是拟时序走向的背景,带有数字的灰色圈圈代表的是拟时序分支的leaf,他们被黑色的branch_points所隔开

3.2.拟时序分析

cds <- learn_graph(cds)#对每个cluster进行主成分分析,这步以后拟时序图谱就已经初步产生了
plot_cells(cds,           color_cells_by = "cao_cell_type",           label_groups_by_cluster=FALSE,           label_leaves=FALSE,           label_branch_points=FALSE)
## Warning: Removed 1 rows containing missing values (geom_text_repel).
image.png
#轨迹过多,稍微有点让人不适,还是建议大家在Seurat中处理好了之后拿一些特定的细胞类型来做拟时序分析plot_cells(cds,           color_cells_by = "cao_cell_type",           label_cell_groups=FALSE,           label_leaves=TRUE,#展示分支           label_branch_points=TRUE,#展示分支节点           graph_label_size=1.5)
image.png
cds <- order_cells(cds)
plot_cells(cds,           color_cells_by = "pseudotime",           label_cell_groups=FALSE,           label_leaves=FALSE,           label_branch_points=FALSE,           graph_label_size=1.5)
image.png

我们可以看到这个图中有许多灰色的点,这是因为这些单独的轨迹没有被选择root,所以产生了无效的拟时间值,也就是说,如果手动选择,那么每个拟时序的轨迹都需要选择一次root

说实话这个自动弹出交互式窗口的功能让我在写Rmarkdown的时候很抓狂,这个带有交互功能函数目前没法写在html文件中,所以,在我研读了一下源码之后,通过以下这种方式可以编程性选择拟时序的root,下面这个函数会选择最接近你选择细胞的节点作为拟时序的root

names(colData(cds))
## [1] "plate"         "cao_cluster"   "cao_cell_type" "cao_tissue"   
## [5] "Size_Factor"
colData(cds)[,'cao_cell_type'] %>% unique()
##  [1] "Unclassified neurons"     "Germline"                
##  [3] "Intestinal/rectal muscle" "Vulval precursors"       
##  [5] "Coelomocytes"             NA                        
##  [7] "Ciliated sensory neurons" "Failed QC"               
##  [9] "Seam cells"               "Non-seam hypodermis"     
## [11] "Pharyngeal epithelia"     "Touch receptor neurons"  
## [13] "Body wall muscle"         "Cholinergic neurons"     
## [15] "Distal tip cells"         "Other interneurons"      
## [17] "GABAergic neurons"        "Am/PH sheath cells"      
## [19] "Pharyngeal muscle"        "Pharyngeal neurons"      
## [21] "Oxygen sensory neurons"   "Somatic gonad precursors"
## [23] "flp-1(+) interneurons"    "Canal associated neurons"
## [25] "Unclassified glia"        "Pharyngeal gland"        
## [27] "Sex myoblasts"            "Excretory cells"         
## [29] "Dopaminergic neurons"     "Socket cells"            
## [31] "Rectum"
colData(cds)[,'cao_cell_type'] %>% table()
## .
##       Am/PH sheath cells         Body wall muscle Canal associated neurons 
##                      421                    10508                      239 
##      Cholinergic neurons Ciliated sensory neurons             Coelomocytes 
##                     1015                      842                     1358 
##         Distal tip cells     Dopaminergic neurons          Excretory cells 
##                      129                       70                      155 
##                Failed QC    flp-1(+) interneurons        GABAergic neurons 
##                     3483                      224                      400 
##                 Germline Intestinal/rectal muscle      Non-seam hypodermis 
##                     5144                      338                     1268 
##       Other interneurons   Oxygen sensory neurons     Pharyngeal epithelia 
##                      443                      305                      747 
##         Pharyngeal gland        Pharyngeal muscle       Pharyngeal neurons 
##                      271                      332                      314 
##                   Rectum               Seam cells            Sex myoblasts 
##                      121                     3523                      302 
##             Socket cells Somatic gonad precursors   Touch receptor neurons 
##                      358                      355                      334 
##        Unclassified glia     Unclassified neurons        Vulval precursors 
##                      208                     2639                      488
get_earliest_principal_node <- function(cds, my_select="Am/PH sheath cells"){  cell_ids <- which(colData(cds)[, "cao_cell_type"] == my_select)  closest_vertex <-  cds@principal_graph_aux[["UMAP"]]$pr_graph_cell_proj_closest_vertex  closest_vertex <- as.matrix(closest_vertex[colnames(cds), ])  root_pr_nodes <-  igraph::V(principal_graph(cds)[["UMAP"]])$name[as.numeric(names  (which.max(table(closest_vertex[cell_ids,]))))]    root_pr_nodes}cds <- order_cells(cds, root_pr_nodes=get_earliest_principal_node(cds))myselect <- function(cds,select.classify,my_select){  cell_ids <- which(colData(cds)[,select.classify] == my_select)  closest_vertex <-  cds@principal_graph_aux[["UMAP"]]$pr_graph_cell_proj_closest_vertex  closest_vertex <- as.matrix(closest_vertex[colnames(cds), ])  root_pr_nodes <-  igraph::V(principal_graph(cds)[["UMAP"]])$name[as.numeric(names  (which.max(table(closest_vertex[cell_ids,]))))]    root_pr_nodes}cds <- order_cells(cds,                    root_pr_nodes=myselect(cds,select.classify = 'cao_cell_type',                                          my_select = "Body wall muscle")                   )#没问题,行得通

下图可以看的出来,我们制定了”Body wall muscle”为起点后,这群细胞便是紫色的”零值”

plot_cells(cds,           color_cells_by = "pseudotime",           label_cell_groups=FALSE,           label_leaves=FALSE,           label_branch_points=FALSE,           graph_label_size=1.5)|plot_cells(cds,           color_cells_by = "cao_cell_type",           label_cell_groups=FALSE,           label_leaves=FALSE,           label_branch_points=FALSE,           graph_label_size=1.5)
image.png

拟时序中的基因展示

plot_genes_in_pseudotime(cds[1:5,],                         color_cells_by="cao_cell_type",                         min_expr=0.5)
image.png

还有3D版的拟时序,来试试吧

cds_3d <- reduce_dimension(cds, max_components = 3)
## No preprocess_method specified, and aligned coordinates have been computed previously. Using preprocess_method = 'Aligned'
cds_3d <- cluster_cells(cds_3d)
cds_3d <- learn_graph(cds_3d)
## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead

## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead
## Warning in louvain_clustering(data, pd[row.names(data), ], k = k, weight = weight, : RANN counts the point itself, k must be smaller than
## the total number of points - 1 (all other points) - 1 (itself)!
## Warning in igraph::graph.dfs(stree_ori, root = root_cell, neimode = "all", :
## Argument `neimode' is deprecated; use `mode' instead
cds <- order_cells(cds,                    root_pr_nodes=myselect(cds,select.classify = 'cao_cell_type',                                          my_select = "Body wall muscle")                   )cds_3d_plot_obj <- plot_cells_3d(cds_3d, color_cells_by="cao_cell_type")
## Warning in RColorBrewer::brewer.pal(N, "Set2"): n too large, allowed maximum for palette Set2 is 8
## Returning the palette you asked for with that many colors

遗憾的是这个函数不能通过拟时间染色

cds_3d_plot_obj

![image.png](https://upload-images.jianshu.io/upload_images/28196887-9df635947263e119.png?imageMogr2/auto-orient/strip%7CimageView2/2/w/1240)

欢迎关注同名公众号~







©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 215,294评论 6 497
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 91,780评论 3 391
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 161,001评论 0 351
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 57,593评论 1 289
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 66,687评论 6 388
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 50,679评论 1 294
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 39,667评论 3 415
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 38,426评论 0 270
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 44,872评论 1 307
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 37,180评论 2 331
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 39,346评论 1 345
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 35,019评论 5 340
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 40,658评论 3 323
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 31,268评论 0 21
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 32,495评论 1 268
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 47,275评论 2 368
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 44,207评论 2 352

推荐阅读更多精彩内容