【病毒组tips】在服务器上下载IMG/VR数据库

  1. 首先打开IMG/VR数据库地址,注册一个自己的账户;
  2. 获取自己的账户cookies到当前下载的目录
#把自己的账号密码替换一下
curl 'https://signon.jgi.doe.gov/signon/create' --data-urlencode 'login=【自己的账号】' --data-urlencode 'password=【自己的密码】' -c cookies > $PWD
  1. 使用自己的cookies进行下载(核心蛋白文件,核酸序列,分类表,宿主信息)
curl -C - -b cookies -o IMGVR_all_proteins-high_confidence.faa.gz 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a2/IMGVR_all_proteins-high_confidence.faa.gz'
curl -C - -b cookies -o IMGVR_all_nucleotides-high_confidence.fna.gz 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a0/IMGVR_all_nucleotides-high_confidence.fna.gz'
curl -C - -b cookies -o IMGVR_all_Sequence_information-high_confidence.tsv 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a4/IMGVR_all_Sequence_information-high_confidence.tsv'
curl -C - -b cookies -o IMGVR_all_Host_information-high_confidence.tsv 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a6/IMGVR_all_Host_information-high_confidence.tsv'
#公共服务器建议删掉cookies,自己的服务器无所谓
rm cookies

4.可以对比一下MD5信息

md5sum *
File_name MD5
IMGVR_all_Host_information-high_confidence.tsv 71b54d0f5c186d813f058bf0379dfd24
IMGVR_all_nucleotides-high_confidence.fna.gz 83301c9c6dfefea3305a53ee2a41bac3
IMGVR_all_proteins-high_confidence.faa.gz 19e266b87ec7ca96fe586aed172438fe
IMGVR_all_Sequence_information-high_confidence.tsv 3c516db128082fa29dc2c2f60520da1b

PS:服务器似乎不支持断点再续,和多线程下载,若网络问题重新下载需要删除源文件,建议白天下载,晚上下载速度较慢。

©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容