云计算百科
云计算领域专业知识百科平台

【病毒组tips】在服务器上下载IMG/VR数据库

  • 首先打开IMG/VR数据库地址,注册一个自己的账户;
  • 获取自己的账户cookies到当前下载的目录
  • #把自己的账号密码替换一下
    curl 'https://signon.jgi.doe.gov/signon/create' –data-urlencode 'login=【自己的账号】' –data-urlencode 'password=【自己的密码】' -c cookies > $PWD

  • 使用自己的cookies进行下载(核心蛋白文件,核酸序列,分类表,宿主信息)
  • curl -C – -b cookies -o IMGVR_all_proteins-high_confidence.faa.gz 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a2/IMGVR_all_proteins-high_confidence.faa.gz'
    curl -C – -b cookies -o IMGVR_all_nucleotides-high_confidence.fna.gz 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a0/IMGVR_all_nucleotides-high_confidence.fna.gz'
    curl -C – -b cookies -o IMGVR_all_Sequence_information-high_confidence.tsv 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a4/IMGVR_all_Sequence_information-high_confidence.tsv'
    curl -C – -b cookies -o IMGVR_all_Host_information-high_confidence.tsv 'https://genome.jgi.doe.gov/portal/ext-api/downloads/get_tape_file?blocking=true&url=/IMG_VR/download/_JAMO/63a22c8a3b5d0133c73fb0a6/IMGVR_all_Host_information-high_confidence.tsv'
    #公共服务器建议删掉cookies,自己的服务器无所谓
    rm cookies

    4.可以对比一下MD5信息

    md5sum *

    File_nameMD5
    IMGVR_all_Host_information-high_confidence.tsv 71b54d0f5c186d813f058bf0379dfd24
    IMGVR_all_nucleotides-high_confidence.fna.gz 83301c9c6dfefea3305a53ee2a41bac3
    IMGVR_all_proteins-high_confidence.faa.gz 19e266b87ec7ca96fe586aed172438fe
    IMGVR_all_Sequence_information-high_confidence.tsv 3c516db128082fa29dc2c2f60520da1b

    PS:服务器似乎不支持断点再续,和多线程下载,若网络问题重新下载需要删除源文件,建议白天下载,晚上下载速度较慢。

    赞(0)
    未经允许不得转载:网硕互联帮助中心 » 【病毒组tips】在服务器上下载IMG/VR数据库
    分享到: 更多 (0)

    评论 抢沙发

    评论前必须登录!