1。首先去下面的这个网站
http://software.broadinstitute.org/gsea/msigdb/collections.jsp#C3
2.下载tft文件(entrez ID那个)
3.用clusterprofilter做富集,首先要读进去该文件
library(clusterProfiler)
library(GSEABase)
data(geneList, package="DOSE")
gene <- names(geneList)
c5 <- read.gmt("c3.tft.v6.2.entrez.gmt")
egmt <- enricher(gene, TERM2GENE=c5)
head(egmt)
egmt2=data.frame(egmt)
egmt3 <- GSEA(geneList, TERM2GENE=c5, verbose=FALSE)
head(egmt3)
barplot(egmt, drop=TRUE, showCategory=12)
两个破折号中间的为转录因子,一共就三种形式,红框为TF,unkonwn去除就行,
unkown表示有些motif 未对应已知转录因子啊