1. metabolisHMM: Phylogenomic analysis for exploration of microbial phylogenies and metabolic pathways
Elizabeth A McDaniel, Karthik Anantharaman, Katherine McMahon (University of Wisconsin MAdison)
Advances in high-throughput sequencing technologies and bioinformatic pipelines have exponentially increased the amount of data that can be obtained from uncultivated microbial lineages inhabiting diverse ecosystems. Various annotation tools and databases currently exist for predicting the functional potential of sequenced genomes or microbial communities based upon sequence identity. However, intuitive, reproducible, and user-friendly tools for further exploring and visualizing functional guilds of microbial community metagenomic sequencing datasets remains lacking. Here, we present metabolisHMM, a series of workflows for visualizing the distribution of curated and user-provided Hidden Markov Models (HMMs) to understand metabolic characteristics and evolutionary histories of microbial lineages. metabolisHMM performs functional annotations with a set of curated or user-defined HMMs to 1) construct ribosomal protein and single marker gene phylogenies, 2) summarize the presence/absence of metabolic pathway markers, and 3) create heatmap visualizations of presence/absence summaries. Availability and Implementation: metabolisHMM is freely available on Github at https://github.com/elizabethmcd/metabolisHMM and on PyPi at https://pypi.org/project/metabolisHMM/ under the GNU General Public License v3.0.
2. Massively parallel, time-resolved single-cell RNA sequencing with scNT-Seq
Qi Qiu, Peng Hu, Kiya W Govek, Pablo G Camara, Hao Wu (University of Pennsylvania)
Single-cell RNA sequencing offers snapshots of whole transcriptomes but obscures the temporal dynamics of RNA biogenesis and decay. Here we present single-cell nascent transcript tagging sequencing (scNT-Seq), a method for massively parallel analysis of nascent and pre-existing RNAs from the same cell. This droplet microfluidics-based method enables high-throughput chemical conversion on barcoded beads, efficiently marking metabolically labeled nascent transcripts with T-to-C substitutions. By simultaneously measuring nascent and pre-existing transcriptomes, scNT-Seq reveals neuronal subtype-specific gene regulatory networks and time-resolved RNA trajectories in response to brief (minutes) versus sustained (hours) neuronal activation. Integrating scNT-Seq with genetic perturbation reveals that DNA methylcytosine dioxygenases may inhibit stepwise transition from pluripotent embryonic stem cell state to intermediate and totipotent two-cell-embryo-like (2C-like) states by promoting global nascent transcription. Furthermore, pulse-chase scNT-Seq enables transcriptome-wide measurements of RNA stability in rare 2C-like cells. Time-resolved single-cell transcriptomic analysis thus opens new lines of inquiry regarding cell-type-specific RNA regulatory mechanisms.
3. Ultra-high throughput single-cell RNA sequencing by combinatorial fluidic indexing
Paul Datlinger, André F Rendeiro, Thorina Boenke, Thomas Krausgruber, Daniele Barreca, Christoph Bock (Medical University of Vienna)
Cell atlas projects and single-cell CRISPR screens hit the limits of current technology, as they require cost-effective profiling for millions of individual cells. To satisfy these enormous throughput requirements, we developed “single-cell combinatorial fluidic indexing” (scifi) and applied it to single-cell RNA sequencing. The resulting scifi-RNA-seq assay combines one-step combinatorial pre-indexing of single-cell transcriptomes with subsequent single-cell RNA-seq using widely available droplet microfluidics. Pre-indexing allows us to load multiple cells per droplet, which increases the throughput of droplet-based single-cell RNA-seq up to 15-fold, and it provides a straightforward way of multiplexing hundreds of samples in a single scifi-RNA-seq experiment. Compared to multi-round combinatorial indexing, scifi-RNA-seq provides an easier, faster, and more efficient workflow, thereby enabling massive-scale scRNA-seq experiments for a broad range of applications ranging from population genomics to drug screens with scRNA-seq readout. We benchmarked scifi-RNA-seq on various human and mouse cell lines, and we demonstrated its feasibility for human primary material by profiling TCR activation in T cells.
4. ExpansionHunter Denovo: A computational method for locating known and novel repeat expansions in short-read sequencing dataView Egor Dolzhenko, View ORCID ProfileMark F. Bennett, Phillip A. Richmond, Brett Trost, Sai Chen, Joke J.F.A. van Vugt, Charlotte Nguyen, Giuseppe Narzisi, Vladimir G. Gainullin, Andrew Gross, Bryan Lajoie, Ryan J. Taft, Wyeth W. Wasserman, Stephen W. Scherer, View ORCID ProfileJan H. Veldink, David R. Bentley, R.K.C. Yuen, Melanie Bahlo, Michael A. Eberle (Illumina Inc.,)
Expansions of short tandem repeats are responsible for over 40 monogenic disorders, and undoubtedly many more pathogenic repeat expansions (REs) remain to be discovered. Existing methods for detecting REs in short read sequencing data require predefined repeat catalogs. However recent discoveries have emphasized the need for detection methods that do not require candidate repeats to be specified in advance. To address this need, we introduce ExpansionHunter Denovo, an efficient catalog-free method for genome-wide detection of REs. Analysis of real and simulated data shows that our method can identify large expansions of 41 out of 43 pathogenic repeats, including nine recently reported non-reference REs not discoverable via existing methods. ExpansionHunter Denovo is freely available at https://github.com/Illumina/ExpansionHunterDenovo