Smart seq 2012

题目:Full-Length mRNA-Seq from single cell levels of RNA and individual circulating tumor cells
期刊:Nat Biotechnol.
通讯作者:Rickard Sandberg

1. Background

The question remained whether single-cell transcriptomes faithfully represent the RNA population before amplification and how technical variation limits the power to find differential expression.
This initial mRNA-Seq method also **preferentially amplified the 3′ ends of mRNAs, and hence the data could only be used to identify distal splicing events. ** Recently, a method for multiplexed single-cell RNA-Seq was introduced that quantifies transcripts through reads mapping to mRNA 5′ ends. Neither of these methods generates read coverage across full transcripts.

2. Gap

Since most mammalian multi-exons genes are subject to alternative RNA processing, there is a need for a single-cell transcriptome method that can both quantify gene expression and provide the coverage for efficient detection of transcript variants and alleles.

3. Aims

In this study, we introduce a single-cell RNA-Sequencing protocol with markedly improved transcriptome coverage, which samples cDNAs from more than just the ends of mRNAs.

4. Approaches

Smart-seq protocol: For Smart-Seq, first we lysed each cell in hypotonic solution and converted poly(A)+ RNA to full-length cDNA using oligo(dT) priming and SMART template switching technology, followed by 12‐18 cycles of PCR preamplification of cDNA. To enable gene and mRNA isoform expression analyses in single cells, a novel full-transcriptome mRNA-Seq protocol (Smart-Seq) was developed. Smart-Seq makes use of SMART™ template switching technology for the generation of full-length cDNAs and only 12 to 18 cycles of PCR following the initial cDNA synthesis steps. The amplified cDNA was used to construct standard Illumina sequencing libraries using either Covaris shearing followed by **ligation of adaptors (PE) or Tn5-mediated “tagmentation” **using the Nextera technology (Tn5). Both of these library preparation methods enable random shotgun sequencing of cDNAs.

workflow

5. Results

5.1 Smart-Seq read coverage across transcripts

Smart-Seq read coverage across transcripts

5.2 Quantitative assessment of single-cell transcriptomics

Aim :Analyses of gene expression from millions of cells using mRNA-Seq is highly reproducible and has low technical variation. So far, no single-cell mRNA-Seq study has measured the technical variation intrinsic to the cDNA pre-amplification components of single-cell methods.
Method: We therefore diluted microgram amounts of reference total RNA down to nano- and picogram levels and applied Smart-Seq to assess sensitivity, technical variability and detection of differentially expressed transcripts of Smart-Seq on low amounts of total RNA. For comparison, standard mRNA-Seq libraries were generated from 100 ng to microgram levels of reference total RNA.

5.2.1 the sensitivity of the method in detecting transcripts present at different expression levels

sensitivity

Starting with 10 ng or 1 ng of total RNA, we found no or minimal decline in sensitivity compared with standard mRNA-Seq. However, lowering the starting amounts to single-cell levels decreased the detection rate of less abundant transcripts (Fig. 2a). Analyses of the twelve cancer cell line cells (four cells each from the LNCaP, PC3 and T24 lines) showed that ~76% of transcripts expressed at 10 RPKM (reads per kilobase exon model and million mappable reads), an expression level that roughly equals the median expression level for detected transcripts, were reproducibly detected in all single-cell profiles (Fig. 2b).
summary:Transcript detection sensitivity is affected by limiting starting amounts of RNA that lead to random loss of low abundance transcripts, but still the majority of low abundance and the vast majority of highly expressed transcripts are reliably detected even in single cells.

5.2.2 the reproducibility in expression levels generated from diluted RNA and individual cells.

expression level estimation with Smart-Seq (lower oocyte to oocyte variability)

Correlation analyses

Correlation analyses between technical replicates of diluted RNA showed increasing concordance with larger amounts of RNA. Comparing the single cells against the RNA dilution, we observed higher correlations (Pearson correlations of 0.75–0.85) among individual cells of the same type than among dilution replicates at 10 pg (Pearson correlations of 0.65–0.75).
variability

Since variability in measurements of expression levels depends on transcript expression levels, we computed the variability as a function of the expression level (Fig. 2c,d). This analysis showed that Smart-Seq on 10 ng total RNA had the same technical variability as standard mRNA-Seq and that Smart-Seq on 1 ng total RNA showed only a modest increase in technical noise (Fig. 2c). When lowering input amounts down to picogram levels, there was a clear increase in technical variability, particularly for less abundantly expressed transcripts (Fig. 2c). The levels of technical variability at picogram levels of total RNA were compared to the biological variation found in comparisons of human brain and UHRR using standard mRNA-Seq (Fig. 2c, green line). Interestingly, analyses of variation between individual cancer cells of different origin revealed extensive biological variation in highly expressed genes (Fig. 2d).

5.2.3 whether pre-amplified single-cell expression profiles were representative of the original expression profiles.

Spearman correlations between standard mRNA-Seq and those estimated from Smart-Seq

Comparing relative gene expression levels (UHRR - brain) estimated using **standard mRNA-Seq to those estimated from Smart-Seq **with different amounts of input RNA, we again found a high concordance (Fig. 2e–g). Starting with 1 ng or 100 pg total RNA, the relative expression in Smart-Seq and standard mRNA-Seq respectively had Spearman correlations of 0.87 and 0.77 (Fig. 2e,f). Comparisons with 10 pg input RNA showed overall good correlation (Fig. 2g), but identified two populations of transcripts with distorted expression in Smart-Seq data from either human brain or UHRR, reflecting stochastic losses, mostly of low abundance transcripts when starting with such minute RNA of levels (Fig. 2g and Fig. 2a).
Analyses of GC and length biases in Smart-Seq and mRNA-Seq data

Pre-amplification of cDNA could also lead to disproportionate amplification of short transcripts, but we found no systematic bias (Supplementary Fig. 7). A previous microarray study analyzed PCR amplified cDNA (from picogram levels) and found the transcriptome overall preserved, but skewed.

Together, these results demonstrated that transcriptome analyses from few or single cells, in general, preserved relative expression level differences for detected transcripts.

5.3 Analyses of transcriptional and post-transcriptional (alternatively spliced exons) differences from single-cells.

Transcriptional and post-transcriptional analyses of cancer cell line cells using Smart-Seq

Conclude that Smart-Seq significantly improves our ability to detect alternative RNA processing in single cells.

5.4 Analyses of circulating tumor cell transcriptomes

Aim: whether global transcriptome analyses of putative circulating tumor cells (CTCs) could reveal their tumor of origin and provide data to support the use of this method for unbiased cancer-specific biomarker identification.
Method: generated transcriptomes from NG2+ putative melanoma circulating tumor cells (CTCs) isolated from peripheral blood drawn from a patient with recurrent melanoma using immunomagnetic purification with a MagSweeper instrument (Illumina Inc.) For comparison, we also generated Smart-Seq libraries from single cells derived from primary melanocytes (PMs, n=2), melanoma cancer cell line (SKMEL5, n=4 and UACC257, n=3) cells and from human embryonic stem cells (ESCs, n=8). Since the NG2+ putative CTCs were isolated from blood, it was important to compare them to blood cells.
**The putative CTCs were distinct from lymphoma cell lines (BL41 and BJAB)13 and immune tissues (lymphnode and white blood cell samples), as well as embryonic stem cells, and instead showed high similarity to PMs and melanoma cell line cells. **
results:
Unsupervised hierarchical clustering and correlation analyses of gene expression levels showed a clear clustering of cells according to cell type of origin;
Further support for the melanocytic origin of the putative melanoma CTCs came from analyses of melanocyte lineage specific markers, as all NG2+ cells expressed high levels of MLANA14, TYR15 and the melanocyte specific m-form of MITF16 but not immune markers such as PTPRC, in contrast to peripheral blood lymphocytes. Furthermore, NG2+ cells expressed high levels of melanoma-associated genes (based on our unbiased selection of the 100 transcripts most strongly associated with melanoma, see Methods), but not immune cell-associated genes selected in a similar manner.
Thus, both their global transcriptomes and expression patterns of melanoma-associated transcripts clearly support a melanocyte origin for the NG2+ cells putative melanoma CTCs.
Smart-Seq enables screening for SNPs and mutations in transcribed regions using only few cells.

6. Novelty and significance

Generating high-coverage transcriptomes from single cells and small numbers of cells.
Importantly, Smart-Seq has significantly improved read coverage across transcripts, which enables detailed analyses of alternative splicing and identification of SNPs and mutations.

7. Problems

The coverage is uneven, preferring the 3‘ end of the transcripts

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 205,033评论 6 478
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 87,725评论 2 381
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 151,473评论 0 338
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 54,846评论 1 277
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 63,848评论 5 368
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 48,691评论 1 282
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 38,053评论 3 399
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 36,700评论 0 258
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 42,856评论 1 300
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 35,676评论 2 323
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 37,787评论 1 333
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 33,430评论 4 321
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 39,034评论 3 307
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 29,990评论 0 19
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 31,218评论 1 260
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 45,174评论 2 352
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 42,526评论 2 343

推荐阅读更多精彩内容