Trimmomatic的安装与试用


[TOC]


安装环境

Ubuntu18.10
Trimmomatic Version 0.38: binary


安装过程

在此网站http://www.usadellab.org/cms/index.php?page=trimmomatic下载Trimmomatic的binary解压后得到trimmomatic-0.38.jar
各种操作都是用java调用这个jar包


试用Trimmomatic

Paired End:

java -jar trimmomatic-0.35.jar PE -phred33 input_forward.fq.gz input_reverse.fq.gz output_forward_paired.fq.gz output_forward_unpaired.fq.gz output_reverse_paired.fq.gz output_reverse_unpaired.fq.gz ILLUMINACLIP:TruSeq3-PE.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36

This will perform the following:

Remove adapters (ILLUMINACLIP:TruSeq3-PE.fa:2:30:10)
Remove leading low quality or N bases (below quality 3) (LEADING:3)
Remove trailing low quality or N bases (below quality 3) (TRAILING:3)
Scan the read with a 4-base wide sliding window, cutting when the average quality per base drops below 15 (SLIDINGWINDOW:4:15)
Drop reads below the 36 bases long (MINLEN:36)

Single End:

java -jar trimmomatic-0.35.jar SE -phred33 input.fq.gz output.fq.gz ILLUMINACLIP:TruSeq3-SE:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36

This will perform the same steps, using the single-ended adapter file

使用下面语句试了single end
···
java -jar trimmomatic-0.38.jar SE simulatedReads.fastq result.fq.gz ILLUMINACLIP:adapters/TruSeq3-SE.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36
TrimmomaticSE: Started with arguments:
simulatedReads.fastq result.fq.gz ILLUMINACLIP:adapters/TruSeq3-SE.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36
Automatically using 4 threads
Using Long Clipping Sequence: 'AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTA'
Using Long Clipping Sequence: 'AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC'
ILLUMINACLIP: Using 0 prefix pairs, 2 forward/reverse sequences, 0 forward only sequences, 0 reverse only sequences
Quality encoding detected as phred33
Input Reads: 1050000 Surviving: 1050000 (100.00%) Dropped: 0 (0.00%)
TrimmomaticSE: Completed successfully
···


介绍

Trimmomatic is a fast, multithreaded command line tool that can be used to trim and crop Illumina (FASTQ) data as well as to remove adapters. These adapters can pose a real problem depending on the library preparation and downstream application.
There are two major modes of the program: Paired end mode and Single end mode. The paired end mode will maintain correspondence of read pairs and also use the additional information contained in paired reads to better find adapter or PCR primer fragments introduced by the library preparation process.
Trimmomatic works with FASTQ files (using phred + 33 or phred + 64 quality scores,
depending on the Illumina pipeline used). Files compressed using either „gzip‟ or „bzip2‟ are supported, and are identified by use of „.gz‟ or „.bz2‟ file extensions.

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容

  • 我们听见了声音 “春眠不觉晓,处处闻啼鸟”,“此夜曲中闻折柳,何人不起故园情”,“牧童骑黄牛,歌声震林乐”。生活中...
    十万伏特阅读 427评论 0 0
  • ——龙池实验中学附属幼儿园 “祝爷爷奶奶身体健康……”稚嫩的祝福送到了“书香一品”社区。10月17日上午,龙池...
    龙池幼儿园阅读 481评论 0 0
  • 1.绝佳提问---未来最需要的是能够提出美丽问题的人。所有颠覆性创新,均源于一个由创新者提出并给出答案的美丽问题。
    Ralen阅读 113评论 0 0
  • 这两天在与人交流的时候,反思自己遇到了两次沟通不畅的情况。一次是别人试图说服我,另一次是我试图说服别人。 当别人试...
    猫女杜瑜阅读 682评论 3 4