Comparison of direct cDNA and PCR-cDNA Nanopore sequencing of RNA from <i>Escherichia coli</i> isolates.
Rodger G., Lipworth S., Barrett L., Oakley S., Crook DW., Eyre DW., Stoesser N.
Whole-transcriptome (long-read) RNA sequencing (Oxford Nanopore Technologies, ONT) holds promise for reference-agnostic analysis of differential gene expression in pathogenic bacteria, including for antimicrobial resistance genes (ARGs). However, direct cDNA ONT sequencing requires large concentrations of polyadenylated mRNA, and amplification protocols may introduce technical bias. Here we evaluated the impact of direct cDNA- and cDNA PCR-based ONT sequencing on transcriptomic analysis of clinical Escherichia coli. Four E. coli bloodstream infection-associated isolates (n=2 biological replicates per isolate) were sequenced using the ONT Direct cDNA Sequencing SQK-DCS109 and PCR-cDNA Barcoding SQK-PCB111.24 kits. Biological and technical replicates were distributed over eight flow cells using 16 barcodes to minimize batch/barcoding bias. Reads were mapped to a transcript reference and transcript abundance was quantified after in silico depletion of low-abundance and rRNA genes. We found there were strong correlations between read counts using both kits and when restricting the analysis to include only ARGs. We highlighted that correlations were weaker for genes with a higher GC content. Read lengths were longer for the direct cDNA kit compared to the PCR-cDNA kit whereas total yield was higher for the PCR-cDNA kit. In this small but methodologically rigorous evaluation of biological and technical replicates of isolates sequenced with the direct cDNA and PCR-cDNA ONT sequencing kits, we demonstrated that PCR-based amplification substantially improves yield with largely unbiased assessment of core gene and ARG expression. However, users of PCR-based kits should be aware of a small risk of technical bias which appears greater for genes with an unusually high (>52%)/low (<44%) GC content.