SeqStudio Genetic Analyzer Applications
Diverse Sanger sequencing and fragment analysis applications with the new SeqStudio Genetic Analyzer
Designed for use by research assistants and scientists, the latest innovation for Sanger sequencing and fragment analysis is a low-throughput, easy-to-use, and convenient benchtop system. The features of the SeqStudio Genetic Analyzer make running capillary electrophoresis (CE) experiments easier with minimal hands-on time due to an all-in-one cartridge, facilitate collaboration through Thermo Fisher Cloud–based sharing and applications, and introduce new opportunities to run both sequencing and fragment analysis samples at one time.
Highlights of several powerful Sanger sequencing and fragment analysis applications:
One of the most common applications of Sanger sequencing is the analysis of inserts subcloned into plasmids. Applied Biosystems BigDye chemistries are widely used for Sanger sequencing and an integral part of plasmid sequencing workflows. Several of the new features on the SeqStudio platform offer benefits to researchers performing basic plasmid sequencing methods. The instrument is preloaded with sequencing modules optimized for short (<300 bp), medium (500 bp), and long (>600 bp) read lengths, and can also be customized on the instrument to meet specific needs. The swappable cartridges can be associated with individual projects and users. The cloud-based Sanger Quality Check application provides an intuitive set of tools to analyze sequencing traces. Finally, the cloud connectivity for remote monitoring, accessing, and sharing sequencing information can help collaborators rapidly analyze the same data sets.
The performance of the SeqStudio instrument for plasmid sequencing was determined by sequencing the pGEM7zf+ plasmid with M13 primers and Applied Biosystems BigDye Terminator v3.1 chemistry. Results were obtained by analyzing the sequencing traces using the Sanger Quality Check module on the Thermo Fisher Cloud (Figure 1). In the example shown, the same plasmid was sequenced in 16 wells and analyzed on the SeqStudio Genetic Analyzer in 4 different injections. Note that the trace score, peak under peak (PUP) values, contiguous read length (CRL), and QV20+ (length with quality values >20) are similar for each sample. Similar results were obtained in traces on the other strand, and in other experiments by using Applied Biosystems BigDye Terminator v1.1 chemistry. These data demonstrate that the SeqStudio platform can generate plasmid sequencing results of very high quality.
Figure 1. Analysis of sequencing quality using the Sanger Quality Check Cloud app. (A) Once a run is completed, the SeqStudio instrument displays the resulting sequence file as well as the quality scores for each base. (B) Sixteen separate pGEM7zf+ sequencing reactions were run on the SeqStudio instrument and the .ab1 files were uploaded to the cloud and analyzed. Note that the sequencing metrics were very similar in the sixteen different reactions. CRL = contiguous read length, QV20+ = number of nucleotides with a quality value >20.
The SeqStudio Genetic Analyzer can be used by clinical researchers to maintain the gold-standard quality for detecting and verifying the presence of mutant alleles in tumor tissue. The SeqStudio system integrates with the following tools to simplify Sanger sequencing workflows:
- The SeqStudio Genetic Analyzer comes preloaded with running modules optimized for fragmented DNA extracted from formalin-fixed, paraffin-embedded tissue.
- The cloud-based NGC module allows investigators to rapidly verify variants identified in next-generation sequencing (NGS) .vcf files using Sanger sequencing traces.
- Allelic variants at frequencies down to 5% can be detected using the Applied Biosystems Minor Variant Finder (MVF) Software and Sanger traces generated by the SeqStudio instrument.
- Applied Biosystems BigDye Direct and BigDye XTerminator chemistries simplify the Sanger sequencing workflow by providing one-tube sequencing and clean-up.
The performance of the SeqStudio Genetic Analyzer for detecting mutant alleles in tumor samples was determined by analyzing genomic DNA extracted from 10 different FFPE tumor samples, and determining variant frequencies at 4 different hotspot regions. The frequency of mutant alleles was determined by NGS using the Ion Torrent Oncomine Oncology Focus Panel, and Sanger sequencing using BigDye Direct/BigDye XTerminator chemistries and MVF Software. The correlation between the frequencies measured by the SeqStudio Genetic Analyzer was excellent when compared to NGS at allele frequencies—from about 9% to about 70% (Figure 1).
The ability of the SeqStudio Genetic Analyzer to analyze variant frequencies was also determined using a 96-well plate containing Sanger sequencing primers that query the most common tumorigenic mutations in KRAS and NRAS. The minor allele frequency analysis of SeqStudio instrument traces accurately measured the allele frequencies in 1 ng of diluted FFPE-extracted DNA (Figure 2A). Therefore, researchers needing to detect rare alleles can be confident that the SeqStudio Genetic Analyzer will produce accurate results on FFPE tissues.
Finally, the cloud-based NGC application simplifies the confirmation of variants identified by NGS by organizing Sanger sequencing traces by amplicons and specimens, and aligning them in the proper orientation to the candidate variant sequences in a .vcf file. To show the utility of the NGC app in an oncology workflow, we confirmed the presence of an NRAS mutation identified using the Oncomine Oncology Focus panel by Sanger sequencing (Figure 2B). The SeqStudio results verified that the mutation in NRAS (p.Ala59Thr) was present. Therefore, focused and rapid examination of the most meaningful portions of sequencing traces by the NGC app facilitates NGS variant confirmation.
Figure 2. Analysis and confirmation of variants by SeqStudio Genetic Analyzer and the NGC application, respectively. (A) Eight different FFPE samples with mutations at known RAS hotspots were diluted to 5% allele frequency, then analyzed using a 96-well plate containing Sanger sequencing primers that query the most common tumorigenic mutations in KRAS and NRAS, and using the SeqStudio Genetic Analyzer. Each of the allele queries accurately measured the allele frequencies; deviations from 5% reflected slight inconsistencies in starting concentration of the samples. Yellow line is 5% frequency. Similar results were seen with 10% and 50% dilutions. (B) Confirmation of variants identified by NGS. From a .vcf file generated using Ion Reporter Software, Sanger sequencing primers targeting loci of interest were ordered from Primer Designer, samples were sequenced on the SeqStudio instrument, and variants common to the .vcf file and the Sanger sequencing traces were highlighted using the NGC cloud app.
Genome editing technologies, including CRISPR-Cas9– mediated editing events, are rapidly becoming accessible to a majority of biological science researchers, and are poised to revolutionize all fields of biology and health care. Thermo Fisher Scientific offers all the tools necessary for a genome editing project. As an integral part of such a project, the features of the SeqStudio Genetic Analyzer facilitate Sanger sequencing analyses and fit well within a genome editing workflow. In particular, the data generated are compatible with Tracking of Indels by Decomposition (TIDE) software, a widely available tool for analyzing the efficiency of genome editing events.
The utility of the SeqStudio Genetic Analyzer in a genome editing project was shown by obtaining whole-cell lysates from HEK293 cells that were edited to introduce random deletions around a targeted site in the HPRT or the relA locus. To confirm the position of the edit, the Sanger sequencing traces were uploaded to the cloud and analyzed using the Sanger Variant Analysis module (Figure 1). Note that the position of the edit is clearly indicated and can be visualized by the abundant mixed- base peaks downstream of the break. The efficiency of the edits in this mixed primary culture was determined by analyzing these trace files using the TIDE software. In each case, the spectrum and frequencies of deletions at each locus was nearly identical using the data generated in the forward and reverse directions (Figure 2). These frequencies confirm results obtained using Invitrogen TOPO cloning and followed by Sanger sequencing results of the same edited cell populations.
Figure 2. Analysis of two different genome editing events at the HPRT and relA loci using TIDE software and mixed population sequencing traces generated by the SeqStudio instrument. The bars show the proportion of the population having the indicated number of nucleotides deleted or inserted. For (A) HPRT, the overall efficiency of the edit was around 80%, whereas the overall efficiency at the (B) relA locus was around 20%.
The study of development of human diseases relies heavily on the analysis of dissociated human cell lines grown in culture. However, an increasingly acknowledged problem is that cells grown in vitro can be misidentified or contaminated with other unrelated cell lines. The identity of cell lines can be verified by analysis of a highly specific genetic “fingerprint” of highly variable short tandem repeats (STRs). The SeqStudio platform integrates well with the Thermo Fisher Scientific cell line authentication solution. The Applied Biosystems Identifiler Plus and Identifiler Direct kits can be used on purified and crude DNA preparations, respectively, for analyzing 16 highly variable human STR loci commonly used for verifying cell line authenticity. The Applied Biosystems GeneMapper Software, used for analyzing alleles identified by Identifiler kits, is compatible with data produced by the SeqStudio instrument, and the results can be used to query ATCC or other STR databases to verify authenticity.
To demonstrate the utility of the SeqStudio instrument in a cell line authentication workflow, allelic information on STRs was obtained from five different, commonly used human cell lines. The identity of the cell lines was confirmed even with as little as 300 pg of gDNA. To show the ability to detect contaminating cells, a population of M4A4GFP cells was spiked with varying amounts of HeLa cells and analyzed using the Identifiler Direct kit. HeLa cell–specific alleles could be detected even if only 10% of the population had HeLa cells (Figure 1). Therefore, when coupled with the Identifiler kits, the SeqStudio instrument can be a central component for a cell line authentication solution.
Figure 1. Analysis of cell line contamination on the SeqStudio instrument. HeLa cells and M4A4GFP cell suspensions were diluted to 5 x 105 cells/mL, mixed in the indicated proportions, and spotted onto NUCLEIC-CARD Sample Collection Device. Contaminating HeLa cells can be detected with high confidence on the SeqStudio instrument if they make up approximately 20% of a population; however, some alleles unique to HeLa can be detected if they make up as little as 10% of a population.
One widely used method for studying inherited human diseases arising from variations in copy number of a locus is multiplex ligation–dependent probe amplification. This method, developed and commercialized by MRC Holland, can analyze up to 50 multiplexed pairs of adjacently located probes hybridizing to the loci of interest. The high dynamic range, sizing precision, and peak-height fidelity necessary for analyzing MLPA probe amplicons make the SeqStudio system an ideal platform for performing MLPA analyses. Results obtained on the SeqStudio instrument are compatible with MRC Holland’s Coffalyzer.Net software for analyzing MLPA data.
MLPA on the SeqStudio instrument was used to analyze a DNA sample from a probe that is known to carry a duplication of exons 2–30 in the Duchenne muscular dystrophy (DMD) gene and a normal sample using the P034 DMD assay set from MRC Holland. The peak heights and relative sizes of these samples can readily be translated into an accurate detection of the region containing the duplication (Figure 1). Similar results were obtained using probes for large and small deletions. Therefore, the SeqStudio instrument can be an integrated tool for MLPA investigations of regions containing CNVs.
The ready availability of genomic data opens the opportunity to identify species in an unknown sample by sequencing DNA of “fingerprint” loci. The Applied Biosystems family of kits, for example, the MicroSEQ kit, has simplified the identification of prokaryotes and fungi by Sanger sequencing ribosomal DNA (rDNA) sequences. Similarly, eukaryotic organisms can be identified using the mitochondrial-specific loci as the identifying locus. This strategy has been exploited in the Barcode of Life project (barcodeoflife.org), providing a means for rapidly establishing the identity of unknown eukaryotic samples.
To illustrate the performance of the SeqStudio Genetic Analyzer for microbial identification, we obtained genomic DNA samples from ATCC for a variety of microorganisms, and sequenced them using the Applied Biosystems MicroSEQ 500 PCR kit and the SeqStudio instrument. The resulting sequences were queried against the BLAST database. For each sequencing reaction, the correct organism was identified with the highest BLAST confidence. Similarly, using primers for fish mitochondrial sequences (CO1 gene) and fish samples, the fish species was correctly identified as the top BLAST hit. The accurate identification of the species queried with BLAST illustrates how well the SeqStudio platform can be used for species identification.
|Number of organisms||Number of queries||Percent correct|
Table 1. Analysis of species ID using the SeqStudio Genetic Analyzer. Samples of microorganism DNA or genomic DNA extracted from fish were sequenced using primers for 16s rDNA and the MicroSEQ kit (BigDye Terminator v1.1 chemistry), or using primers for fish mitochondrial CO1 sequences and BigDye Terminator v3.1 chemistry.
The ability to detect single-nucleotide polymorphisms (SNPs) plays a critical role in understanding how the genome influences biological phenotypes. To analyze SNP variants, the Applied Biosystems SNaPshot Multiplex System was developed. Customizable, color-coded fragments of differing sizes, corresponding to specific alleles, are analyzed by fragment analysis. The SeqStudio system includes new features that facilitate SNaPshot analysis, including built-in reporting of fragment analysis results of size and peak area. Additionally, the ability to mix fragment analysis and sequencing reactions on one plate enables investigators to perform SNP profiling and Sanger sequencing on a single run.
To illustrate the functional utility of the SeqStudio instrument in SNaPshot workflows, genomic DNA from FFPE-preserved tumor slices was collected and analyzed using probes targeting KRAS G12X and G13X alleles using the SNaPshot multiplex reagent kit. The SeqStudio instrument produced results that clearly showed the presence and accurate calls of the different alleles at this position (Figure 1). Note that although the detection of the alleles was accurate on SeqStudio instrument, the absolute migration of all peaks will differ slightly when compared to that in other platforms due to the different chemical nature of the different polymers. Therefore, to associate a peak with an allele without an ambiguity, a calibration with known alleles should be performed before undertaking a large-scale analysis.
- Introduction to SeqStudio applications
- Extended RAS Research Assay on the SeqStudio (Oncology Research)
- Enabling neurological disease research via DNA fragment analysis on the SeqStudio
- Genome editing workflow facilitated by the Thermo Fisher Scientific portfolio solution
- A Complete Workflow for Human Cell Line Authentication
SeqStudio Customer Spotlights
- SeqStudio Enables Community STEM Outreach at Pittsburgh’s Citizen Science Lab
- SeqStudio Speed and Accuracy for Inherited Disease Research
- SeqStudio for Translational Research at the University Hospital of Basel
- Shedding Light on Missing Heritability: SeqStudio Fragment Analysis for Neurological Disease Research