RPM(CPM)/RPKM/FPKM/TPM

RPM/RPKM/FPKM/TPM是我们在定义表达量时常用的几种计算方式,那么究竟有什么区别呢?

RPM/CPM

RPM/CPM: Reads/Counts of exon model per million mapped reads
Calculate Formula:
RPM=Total exon reads/ Mapped reads(Millions)

We can get the decision easily: The longer the gene, the greater the number of reads.
So, we calculate the RPKM to exclude the effect of gene length

RPKM

RPKM: Reads Per Kilobase of exon model per Million mapped reads
Range of Use: Single-end RNA-seq
Calculate Formula:
RPKM=Total exon reads/[Mapped reads(Millions)*Exon length(Kb)]

Example of Calculating RPKM

Gene B is twice as long as gene A, and that might explain why it always gets twice as many reads, regardless of replicate.
Sample3 has way more reads than other replicates, regardless of the gene.
RPKM-Step1:normalize for Read Depth

For the purpose of this 4 gene examples, we’re scaling the total read counts by 10 instead of 1,000,000.
Originally,1,000,000 was picked just because it made the numbers look nice.(i.e. they didn’t require too many decimal places)

RPM-scaled using the ‘per million’ factors.

RPKM-Step2:normalize for gene length

Reads are scaled for depth(M) and gene length(K).

FPKM

RPKM and FPKM-two very closely related terms

RPKM=Reads Per Kilobase Million
FPKM=Fragments per Kilobase Million
RPKM is for single-end RNA-seq.
FPKM is for paired-end RNA-seq.
Differences
针对Single-end RPKM与FPKM基本没有差异
针对Paired-end,如果一对paired-read都比对上那么FPKM计算方法中认为这一对read为一个fragment(RPKM则计为2),如果一对中仅有一个比对上,则将比对上的计为一个fragment.

TPM

TPM is like RPKM and FPKM, except the order of operation is switched.



因此比对TPM和FPKM的公式可以发现,FPKM的分母没有考虑基因长度的影响,所以TPM更加符合我们对相对表达量的定义。

Example of Calculating TPM
TPM-Step1:Normalize for gene length

RPK-scaled by gene length

TPM-Step2:normalize for sequencing depth

TPM-scaled by gene length and sequencing depth(M)

RPKM vs TPM

With TPM, everyone gets the same sized pie

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容