[TOC]
安装环境
Ubuntu18.10
这是这个工具的官网https://www.ncbi.nlm.nih.gov/IEB/ToolBox/CPP_DOC/lxr/source/src/app/dustmasker/
在官网的介绍
DustMasker is a program that identifies and masks out low complexity parts of a genome using a new and improved DUST algorithm. The main advantages of the new algorithm are symmetry with respect to taking reverse complements, context insensitivity, and much better performance The new DUST algorithm is described in [1]. Please cite this paper in any publication that uses DustMasker.
试用
dustmasker -in sample.fa -out result.txt
输入
DH10BmutV2_3989116_1_0_0_1_1_456191
CGCATTAAGCATGATCATCATCAGCGGTGATTCCGCCGCCAGCTCAAACTTCAGGCCGCGCGGTGTCTGGTGGTGCGGTTTATCCAGCAGCAGGTTGCTCATTTGCATCCACTCCGCCAGATTTGCTCCATGCTGACCATCAGCGGTGTGGGCGACAGACCCAGCGGTGAGTTCACAAAGAGCGGGTCGTCAAACCTCGCTCTTAACT
DH10BmutV2_3775598_0_0_1_2_2_346057
ACGCGTGGCAAATGCACCATCGTGACTTCGGGAATATCCGGTGGATTGCCAAGAATGATGTCATCCGCGTGTACATGGGCGCCACGGCACCCAGCTGCGCCAGCCAGTTTCTTGTTTGCCGCTCCCACACCACGCCCAGATCGCCAAGGATAGCGAACGGACGATTCAAATCCGCAGAGACTTTCTGGTTGATAGTCGGTTTGA
DH10BmutV2_1886214_1_0_3_0_1_158464
AACTGAATGAGGTTGAGTTCCGTGCAGAAGCGAATCCGGCACTGCATCCGGGGCCAATCCACAGCGATTTATCTGAAAGGTGAACGTATTGGTGTTTGTTGCGGGTTGTTCATCCTGAACTGGAACGTAAACTGGATCTTAACGGTCGCACTCTGGTGTTCGAACTGGAGTGGAACAAG
DH10BmutV2_1517991_1_0_0_0_1_91241
TGATAATGACTCATGATGCCAGTTCGCTTACTGAGCCAGCAGAACGCATAACAACAACAGGGTAATATTTTCCAGATGTTGCACCTGCAGGAGCGTTAACCCGCACATAACGCATGCCACGCTTATCAACAAAGTCTGTTTTACTGACCGCGTTAATGTTGTTCAGGAAGCA
pSFO157_102209_1_0_1_1_0_21456
CGGTGAAAGAATGTCTTTCGGCTTGTTCATGAATGACTCTGTGATGGTTTTCCGGTTTCACCGGTCGCCCATTCCGTGGTGCCGTCATGTTTTCAGGCCGCCATCCCCGCCGGCAGTCACGCCACCGTAATATTTTGTCTCGTCCGGATTCGCGTTATAGCCGGGGATACTCTCCTGTGGCTTAAAGCCCT
DH10BmutV2_2623800_1_0_2_0_1_387962
TCCTTAACTGTATGAAATTGGGATACAACAGGTAGCATACCCGCTCACAGAATATGCGGAAGTAAGGATTTAGCATATCTATATACAGAAGGGAAATAATGACATGCAAGATGGAATAAGGGGCGGCATAAGCCACCACCTGTTTCACACAAACGGTTTACTAATA
输出
DH10BmutV2_3989116_1_0_0_1_1_456191
DH10BmutV2_3775598_0_0_1_2_2_346057
DH10BmutV2_1886214_1_0_3_0_1_158464
DH10BmutV2_1517991_1_0_0_0_1_91241
pSFO157_102209_1_0_1_1_0_21456
DH10BmutV2_2623800_1_0_2_0_1_387962