Preparation of DNA Sequencing Libraries for Illumina Systems—6 Key Steps in the Workflow

Preparation of libraries for DNA sequencing for Illumina systems involves multiple steps. In a general workflow, purified DNA is fragmented, end-repaired, and A-tailed; adapters are ligated to the DNA fragments; libraries are amplified if necessary; and the prepared libraries are cleaned, quantitated, and normalized before loading onto a flow cell (Figure 1). Since library preparation plays a critical role in obtaining high-quality data [1], researchers should understand the underlying principles and considerations for the key steps in the workflow.

Click to enlarge image

Figure 1. DNA sequencing library preparation.

1. DNA sequencing methods

Common DNA sequencing methods include whole-genome sequencing, de novo sequencing, targeted sequencing, and exome sequencing (discussed below) (Figure 2). DNA may also be sequenced for epigenetic studies—e.g., methylation analysis (also known as bisulfite sequencing or Bis-Seq) and DNA–protein interaction sequencing (commonly known as ChIP-Seq), which are not covered in this section. The method of choice depends on the research goals and biological questions to address [2-4].

Figure 2. Common DNA sequencing methods. Exome and gene panel sequencing are considered targeted methods, since they only include subsets of the whole genome. Some gene panels may include promoter sequences.

a. Whole-genome sequencing

Whole-genome sequencing, or WGS, is performed to sequence the entire genome of an organism using the total genomic DNA. WGS data of a sample is then compared to a reference sample or control—for instance, comparison between cancer cells and normal cells—for small and large genetic variations. Examples of these genetic variations include single nucleotide polymorphisms (SNPs); single nucleotide variations (SNVs); nucleotide insertions and deletions (indels); structural rearrangements such as inversions, duplications, and translocations; and copy number variations (CNVs) (Figure 3).

Figure 3. Common genetic variations.

WGS is useful for uncovering genetic mutations in an unbiased and detailed manner. However, it requires a large amount of sample input and involves extensive data processing, especially when analyzing the human genome, which is large and complex.

Preparation of DNA Sequencing Libraries for Illumina Systems—6 Key Steps in the Workflow

On this page

1. DNA sequencing methods

a. Whole-genome sequencing

b. De novo sequencing

c. Targeted sequencing

d. Exome sequencing

2. DNA fragmentation strategies

a. Mechanical shearing

b. Enzymatic digestion

c. Transposon-based fragmentation

3. End repair and adapter ligation

a. End repair

b. Adapter ligation

c. Index hopping and unique dual indexes

4. Library amplification considerations

a. PCR-free libraries

b. PCR-based libraries

5. Size selection and cleanup

a. Importance of size selection and cleanup

b. Methods for size selection and cleanup

6. Library quantification approaches

a. Microfluidics-based quantitation

b. Fluorometry-based quantitation

c. qPCR-based quantitation

Resources

Learn more

Related products