教程http://asia.ensembl.org/info/docs/tools/vep/script/vep_download.html
docker pull ensemblorg/ensembl-vep
docker run -ti ensemblorg/ensembl-vep ./vep (试验是否下载可运行)
mkdir /home/vep_data
chmod a+rwx /home/vep_data
docker run -tiv /home/vep_data:/opt/vep/ .vep ensemblorg/ensembl-vep perl INSTALL.pl (按提示下载cache等,啰里啰嗦,放弃!)
借此机会,开个新终端,查看下image的结构
docker ps (查看下container_id)·
docker exec -it container_id bash (进入image内部,看下examples下内容吧)
docker stop container_id(关闭)
docker run -t -i -v /home/vep_data:/opt/vep/ .vep ensemblorg/ensembl-vep perl INSTALL.pl -a cfp -s homo_sapiens -y GRCh37 -g all (自动下载所有东西,但是实在太慢了啊!!!放弃)
(a (or --AUTO) with the parameters c (cache), f (fasta) and p (plugins) . If you wish to include the VEP plugins, add the 'p' value to the -a flag and the --PLUGINS (or -g) flag as well:)
本地文件夹结构:与容器共享数据路径下,创建input,output,Plugins文件夹(名称、路径只是建议,随便啦)
1)下载caches, ftp://ftp.ensembl.org/pub/release-96/variation/VEP/homo_sapiens_vep_96_GRCh38.fa.gz
在/home/vep_data下解压,获得homo_sapiens文件夹/home/vep_data/homo_sapiens/96_GRCh38
2)下载fasta数据
在/home/vep_data/fasta下解压
3)去github下载plugin插件,https://github.com/Ensembl/VEP_plugins,例如G2P.pm,
https://asia.ensembl.org/info/docs/tools/vep/script/vep_plugins.html#g2p参看下载G2P.csv https://www.ebi.ac.uk/gene2phenotype/downloads
将G2P.pm和2个G2P.csv放入/home/vep_data/Plugins中,
试一下
docker run -u root -t -i -v /home/vep_data:/opt/vep/.vep ensemblorg/ensembl-vep \
./vep --cache --offline --format vcf --force_overwrite \--dir_cache /opt/vep/.vep/ \
--hgvs --fasta /opt/vep/.vep/fasta/Homo_sapiens.GRCh38.dna.primary_assembly.fa \
--input_file /opt/vep/.vep/input/input.vcf \
--output_file /opt/vep/.vep/output/output.vcf \
--dir_plugins /opt/vep/.vep/Plugins \
--plugin G2P,file=/opt/vep/.vep/Plugins/CancerG2P.csv,html_report=/opt/vep/.vep/output/report.html,txt_report=/opt/vep/.vep/output/report.txt
--assembly GRCh38
(--vcf, Writes output in VCF format. )
report.txt长这样:
report.html样子长这样: