Mastering Single-Cell Sequencing: Key Choices in Library Prep & Why They Matter

 

   03/04/2025

   Cécile Thion, PhD

   12-minute read

Share this article:

Email
Facebook
LinkedIn
WhatsApp
Print

Single-cell sequencing enables researchers to study cellular diversity, unlocking critical insights into fields like oncology, neurology, and immunology. By analyzing individual cells, researchers can detect rare mutations, track disease progression, and understand cellular heterogeneity with unprecedented precision. However, success in single-cell sequencing depends on the quality of cell isolation and Next-Generation Sequencing (NGS) library preparation. This blog explores how cutting-edge single cell isolation and precision dispensing technologies enhance sequencing accuracy, reduces reagent waste, and streamlines workflows, driving the next generation of single-cell genomic discoveries.

 

The Power of Single Cell Genomics

 

Single-cell sequencing encompasses several fields of single-cell omics, including genomics, transcriptomics, and epigenomics. This article will focus primarily on genomics, where recent applications of single-cell sequencing have led to advances across oncology, neurology, and immunology [1]. For example, a study published in Nature Genetics used CELLENION’s single-cell isolation technology to identify a rare subset of cells exhibiting copy number variations in patients with BRCA1 and BRCA2 mutations, providing insights into tumor development [2]. Such studies are only possible with instruments that facilitate both precise single-cell isolation and dispensing onto tiny targets, while maintaining environmental controls for cell viability, and reagent stability.

Let’s explore three fundamental steps in single-cell whole genome sequencing (scWGS) workflows and how accurate single-cell dispensing enhances insights while addressing common challenges.

 

Step1: Single-Cell Isolation – The Foundation of High-Quality Data

 

Several methods are used for single-cell isolation before sequencing. However, some methods restrict researchers’ ability to sort targeted populations or identify rare cell types while preserving cellular integrity, among other issues.

  • FACS: This method allows researchers to target single cells and specific populations using fluorescence. However, it subjects cells to shear forces, which can lead to reduced viability, and can require a large number of cells as initial input [3]. Moreover, because it is by definition “Fluorescence-Activated”, it requires that cells are fluorescent, i.e., stained if non-natively fluorescent. Fluorescent staining introduces biases and is time and money-consuming.
 
  • Droplet Microfluidics: Droplet microfluidics technology utilizes the natural properties of liquids to encapsulate single cells and reagents in droplets, then merge those cell and reagent droplets. In microfluidics devices, the efficiency of single-cell encapsulation, defined as the ratio of the number of droplets containing cells to the total number of droplets, is determined by the Poisson distribution, which reflects the probability that a cell is randomly “alone” in the volume of liquid which is encapsulated. While effective for large-scale single-cell sequencing projects, it carries a high risk of (i) reagent waste due to cell-free droplets and (ii) sequencing data contamination from cell multiplets, which can confound downstream analyses.
 
  • Manual Micromanipulation: This method involves using a microscope and a micropipette to manually aspirate single cells. While simple (in theory) and relatively inexpensive, this technique suffers from low throughput, very long hands-on time, and has the potential for artificial variability introduced by operator error [4].
 
  • Droplet Dispensing: Single-cell dispensers like CELLENION’s cellenONE utilize drop-on-demand dispensing, of droplets ranging from nanoliter to picoliter volumes, combined with image-based precision, to ensure gentle handling and verifiable single-cell isolation (Fig. 1). While fluorescence is not required for cell isolation, the instrument can incorporate fluorescence-based sorting, providing researchers with greater flexibility in their workflows. For some single-cell dispensers, low numbers of available cells, low sample volumes, or heterogeneity of cell sizes can make it challenging to perform initial isolation. By contrast, cellenONE uses precision droplet dispensing and is compatible with low sample volumes (~3 μL) and any cell size (0.5 to ~80 μm), including in mixtures, providing a versatile and forgiving method for sequencing library generation of rare cells
 
 
Figure 1. The cellenONE combines sciDROP and image-based single-cell isolation technology to isolate single cells from complex mixtures, minimizing doublets and providing images to verify single-cell dispensing. In this unique system, cells are aspirated through a capillary that is driven by high-precision XYZ axes in front of the microscopic camera. By analyzing the trajectory of particles through the capillary, the cellenONE software determines the area of the capillary image that corresponds to the volume of liquid that will be ejected next (below green line). When one, and just one, cell of the desired size, elongation, and fluorescence is detected in this area (left), the capillary is driven to the target consumable, and the droplet containing the cell is ejected. For single-cell sequencing workflows, the target well is prefilled with the first reagent mix of the library prep workflow, often a lysis buffer. If no object (middle), multiple objects, or one object with wrong parameters (right) is detected, the droplet is ejected in the ‘recovery’ tube.

Step 2: Single-cell Sequencing Library Preparation – Turning Cells into NGS-Ready Samples

Library preparation for single-cell omics is the process of converting DNA or RNA samples into a format compatible with NGS platforms. While the typical workflows differ according to the molecule of interest (RNA or DNA), some steps are typically similar to all single-cell sequencing strategies. 

After cell isolation, DNA or RNA needs to be accessed by the library prep reagents: this step is typically performed by lysing the cell and nuclear membranes, while some protocols simply make it permeable to the enzymatic catalysts. Once the molecular material is accessible, it must be amplified to generate enough material for subsequent reactions and their detection by sequencing instruments. A human cell contains about 6 pg of DNA, and an E. coli cell has less than 6 femtograms! Another crucial step in all single-cell omics workflow is the introduction of so-called cell barcodes, i.e., the addition of small nucleic acid sequences, different for each individual cell, allowing the bioinformatics-driven re-attribution of sequencing reads to the exact cell they originate from. Finally, sequencing adapters must be added to the barcoded DNA to enable their reading by the sequencing platform. 

Variations and adaptations of these core library prep principles are infinite and primarily depend on the molecular phenomenon of interest.

 

Single-cell RNA-seq (scRNA-seq)

Single-cell transcriptomics studies the variability of cell transcriptomes at the individual cell level. It is particularly relevant to cancer research, where it can help researchers understand the heterogeneity of cell populations within tumors [5]. There are several approaches to library prep in scRNA-seq, each with unique advantages.

 

Targeting the Transcript

scRNA-seq analysis may focus on different regions of the transcript molecules. Full-length scRNA-seq, mostly used for variant exploration, captures the entire messenger RNA (mRNA) length, preserving complete transcript sequences. 3’ and 5’ scRNA-seq focus on a different end of the mRNA molecules: while the former is used for the exploration of differential gene expression, as just a few dozen base pairs of the transcript 3’ end are needed to identify a gene in well-known targets such as the human genome, the latter focuses on post-transcriptional modifications, mainly operating at 5’ end of messenger molecules. Other scRNA-seq library preparation methods aim at sequencing the full transcriptome of each cell, including non-messenger RNAs, particularly regulatory RNAs. Finally, some targeted scRNA-seq enhances sensitivity and cost-efficiency by focusing on specific gene groups through hybridization probes or amplicon-based methods.

 

Protocol Selection

Overall, the choice of the scRNA-seq library prep protocol is highly dependent on the research question and model of interest. Most scRNA-seq methods implement a reverse-transcription (RT) step, which is necessary to convert RNA into PCR- and NGS-compatible DNA relatively early in the workflow, if not as a first step. This allows the introduction of cell barcodes early in the protocol by adding them onto the RT primers. This way, the libraries of multiple single cells can be quickly pooled into one sample for easier and quicker subsequent handling. cDNA can then be fragmented using different methods depending on the transcript regions of interest. It can also be amplified, typically by PCR, before or during the addition of more classical sequencing adapters. As with any molecular biology workflow, scRNA-seq has biases and limitations, including potential PCR biases, which are proportional to the number of PCR cycles needed to yield enough material for sequencing.

 

Single-Cell Whole Genome Sequencing (scWGS)

scWGS refers to the sequencing of the total DNA material of individual cells and has led to insights such as tracking the progression of melanoma tumors during anti-programmed cell death protein 1 therapy [5]. scWGS library prep requires the amplification and fragmentation of all of the DNA material in a cell.

 

Amplification Method

This step can be performed using a variety of methods, such as high-throughput in-vitro transcription (followed by RT) or rolling circle amplification, amongst which the most popular method is Multiple Displacement Amplification (MDA). While MDA, catalyzed by a bacteriophage-derived DNA polymerase, ensures the highest amplification yield, it also introduces important amplification biases, as certain regions of the genome may become overrepresented or underrepresented compared to others (e.g., loss of GC-rich regions [6]).

 

Fragmentation and Barcoding

Subsequently, the cutting of amplified DNA into NGS-compatible fragments and the addition of the cell barcodes and sequencing adapters generally involve either fragmentation (generally enzymatic) followed by adapter ligation ,or tagmentation, which achieves both steps in a single reaction thanks to engineered transposases. 

 

All of these steps involve very expensive enzymatic reagents, and their efficiency is tied to their ability to “meet” their target molecule in the reagent mix. Therefore, miniaturization represents a significant interest in terms of cost and efficiency.

 

The Precision Microdispensing Advantage

Precision microdispensing provides several advantages for single-cell sequencing library preparation. One advantage is minimizing reagent waste by delivering very small, and very precise, volumes, preventing excess usage and drastically reducing reagent cost, especially when using instruments with low dead volume. Moreover, miniaturized workflows often prove more efficient for single-cell sequencing. DNA content from a single cell falling into a classical 50 µL microplate-based assay would be like pouring a glass of fruit juice into the ocean: it would be very diluted. Why is this an issue? Because most enzymatic reactions, but also sequencing, require minimum target molecule concentration or mass to yield sufficient product, or even to start at all. Library miniaturization increases the chance that substrate and enzyme meet and then react efficiently.

 

Maximizing Efficiency While Minimizing Bias

Precision dispensing maximizes efficiency and minimizes amplification bias in certain contexts. For example, researchers used the cellenONE system to develop a scalable method for single-cell whole-genome sequencing called Direct Library Preparation (DLP+) that allowed single-cell WGS to be scaled to hundreds of thousands of genomes. The cellenONE performs sorting and dispensing of single cells into aluminum chips containing more than 5,000 nanowells prefilled with lysis buffer and individual cell barcodes while removing debris and multiplets.

After cell lysis, DNA is directly tagmented, by transposases that cut the DNA into controlled-size fragments and introduce PCR primer binding sites. PCR is then performed directly in the nanowells, introducing cell barcodes and generating sequencing-ready libraries in a few hundred nanoliters! This method was further improved by incorporating images of the isolated cells generated by the cellenONE, which provided deeper insights into cellular aneuploidy (Fig. 2). This approach enabled the analysis of rare cell populations, the replication status of individual cells, and other important cellular characteristics.

Figure 2. The cellenONE from CELLENION reduces amplification bias but also provides extra imaging information, allowing researchers to generate deeper insights into cellular states by linking cell morphological phenotype and genomic information. Adapted from Laks et al. 2019 (7)

Step 3: Beyond Library Preparation: Sequencing and Analysis of Single-Cell Sequencing Data

 

The choice of sequencing platform is determined in large part by the question being asked. A fundamental parameter is the choice between short- and long-read sequencing. Moreover, sequencing is only part of the process — downstream bioinformatics analyses, from raw data processing to complex computational interpretation, are crucial for extracting meaningful insights from single-cell sequencing experiments.

 

Short-Read vs. Long-Read Sequencing

The decision between long-read and short-read single-cell sequencing depends on the specific research objective. Short-read sequencing, typically performed using Illumina sequencers, is highly accurate for base-calling, making it well-suited for detecting SNPs and analyzing predefined genomic regions. In contrast, long-read sequencing platforms, like PACbio Revio, are more effective at identifying structural variants such as duplications and offer better coverage of highly repetitive genomic regions for single-cell genomics applications [8].

 

The Impact of Library Prep on Single-cell Sequencing Data

Library preparation methods play a crucial role in single-cell whole genome sequencing by influencing read depth, sensitivity, and genome coverage.

 

Genome coverage

Ideally, every base pair or genomic region should be represented in single-cell genomics data, resulting in a “complete” genome. In theory, most NGS platforms generate more than enough information (number of base pairs) to generate complete genomes, even large ones. However, even a high sequencing depth (or read depth—the number of raw reads per cell) may not be sufficient if the reads are unevenly distributed across the genome, resulting in incomplete genome coverage. Read distribution, or evenness, is highly influenced by sequence-specific patterns and library prep quality, including fragment size distribution and amplification biases (PCR and/or MDA) [9].

 

Sensitivity

This refers to the ability to detect low-abundance sequences. In scRNA-seq, low sensitivity introduces a risk that the variations in the least expressed transcripts, which may still have important physiological roles, are obscured by very strong signals from the most expressed ones, which are often less useful for discriminating between cell populations. In single-cell genomics, loss of sensitivity leads to the loss of the least represented information in uneven genome reads (see above).

Single-cell sequencing sensitivity may be influenced by sequence-specific factors, including GC content, sequence repeats, or secondary structures that modulate enzymatic reactions such as amplification by PCR or MDA, or RT. The methodological choices and quality of each step of the library preparation can have a big impact on single-cell sequencing sensitivity. For instance, incomplete lysis compromises access to DNA/RNA, and sub-efficient RT leads to the loss of the lowest abundance signal. Finally, bioinformatics alignment errors, filtering thresholds, and normalization strategies impact sensitivity in detecting true biological signals.

 

Contamination

Library preparation from single cells, more than any other molecular biology applications, are highly sensitive to contamination from foreign DNA. Non-contact dispensing and automation help mitigate these risks by eliminating manual handling, while advanced imaging capabilities virtually ensure precise single-cell dispensing. cellenONE extra-small droplets (only a few hundred picoliters of medium are ejected along with the cell) also minimize contamination via carry-over liquid (Fig. 3). Finally, the reduction of the library volume reduces the “kit-ome” background, which is the inevitable contamination through DNA remaining in the bioengineered reagents themselves, by reducing the volume of these reagents.

 

Figure 3. The cellenONE from CELLENION uses precision dispensing to overcome several common pitfalls in single-cell isolation prior to library preparation.

Precision Dispensing and Single-Cell Isolation Accuracy Empower Single-Cell Genomics Research

 

The rapid advancements in single-cell sequencing are transforming our understanding of cellular diversity and disease mechanisms. By leveraging precision dispensing combined with highly accurate single-cell isolation, like CELLENION’s cellenONE, researchers can enhance sequencing efficiency and minimize reagent waste. This offers seamless and reproducible single-cell isolation workflows, driving new discoveries in oncology, neurology, and immunology. As the field continues to evolve, adopting cutting-edge tools will be essential for unlocking deeper insights and pushing the boundaries of genomics research.

Contact the CELLENION team today to see how you can leverage the latest in single-cell dispensing for your genomics workflows.

 

References

  1. Evrony GD, Hinch AG, Luo C. Applications of Single-Cell DNA Sequencing. Annu Rev Genomics Hum Genet. 2021;22:171-197. doi:10.1146/annurev-genom-111320-090436
  2. Williams MJ, Oliphant MUJ, Au V, et al. Luminal breast epithelial cells of BRCA1 or BRCA2 mutation carriers and noncarriers harbor common breast cancer copy number alterations. Nat Genet. 2024;56(12):2753-2762. doi:10.1038/s41588-024-01988-0
  3. Brestoff JR, Frater JL. Contemporary Challenges in Clinical Flow Cytometry: Small Samples, Big Data, Little Time. The Journal of Applied Laboratory Medicine. 2022;7(4):931-944. doi:10.1093/jalm/jfab176
  4. Lee LM, Liu AP. The Application of Micropipette Aspiration in Molecular Mechanics of Single Cells. J Nanotechnol Eng Med. 2014;5(4):0408011-0408016. doi:10.1115/1.4029936
  5. Coulter EM, Bewicke-Copley F, Mossner M, Graham TA, Fitzgibbon J, Okosun J. Defining an Optimized Workflow for Enriching and Analyzing Residual Tumor Populations Using Intracellular Markers. The Journal of Molecular Diagnostics. 2024;26(4):245-256. doi:10.1016/j.jmoldx.2024.01.003
  6. Wang Y, Fan JL, Melms JC, et al. Multimodal single-cell and whole-genome sequencing of small, frozen clinical specimens. Nat Genet. 2023;55(1):19-25. doi:10.1038/s41588-022-01268-9
  7. Laks, E et al. “Clonal decomposition and DNA replication states defined by scaled single-cell genome sequencing.” Cell5 (2019): 1207-1221. doi:10.1016/j.cell.2019.10.026
  8. Logsdon GA, Vollger MR, Eichler EE. Long-read human genome sequencing and its applications. Nat Rev Genet. 2020;21(10):597-614. doi:10.1038/s41576-020-0236-x
  9. Aird D, Ross MG, Chen WS, et al. Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries. Genome Biol. 2011;12(2):R18. doi:10.1186/gb-2011-12-2-r18