How to Select Endogenous Controls for Real-Time PCR

Selecting appropriate endogenous controls, also known as reference or housekeeping genes, is an important step in the design of reliable and accurate real-time quantitative PCR (qPCR) experiments. These controls are same for normalizing the expression levels of target genes, thereby compensating for potential variations in sample quantity, RNA quality, and efficiency of the reverse transcription and amplification processes. The stability and consistent expression of endogenous controls across different experimental conditions, cell types, and treatments are necessary to help ensure the validity of the qPCR results.

The process of selecting suitable endogenous controls is important, as the expression of commonly used housekeeping genes can vary under different experimental settings. This introduction aims to guide researchers through the key considerations and best practices for selecting and verifying endogenous controls, helping ensure reliable and reproducible gene expression data from qPCR research experiments.

On this page:

What are endogenous controls, and why are they used as qPCR controls?

Endogenous controls, also known as internal controls or housekeeping genes, are genes that are consistently expressed at stable levels across various experimental conditions, cell types, and treatments. These genes are typically involved in basic cellular functions necessary for cell survival and maintenance, such as metabolism, structure, or cell cycle regulation.

In qPCR, endogenous controls, or internal controls, are used as reference genes to normalize the expression levels of target genes. This normalization is crucial for several reasons:

Product photo of TaqMan assay control gene

Correction for sample variability: Endogenous controls help to account for variations in the amount of starting material, differences in RNA quality, and efficiency of the reverse transcription and PCR processes. By comparing the expression levels of target genes to those of the endogenous controls, researchers can correct for these sources of variability.
Accurate quantification: Normalizing against stable endogenous controls allows for more accurate and reliable quantification of gene expression. It help ensures that observed changes in target gene expression are due to experimental conditions rather than technical artifacts.
Consistency across samples: Using endogenous controls help ensures consistency and comparability of results across different samples and experiments. This is particularly important in experiments involving multiple conditions, time points, or biological replicates.

Common endogenous controls include genes such as GAPDH (glyceraldehyde-3-phosphate dehydrogenase), ACTB (beta-actin), and B2M (beta-2-microglobulin). Selection of appropriate endogenous controls is important and should be validated for each specific experimental setup to help ensure they remain stable under the given conditions.

qPCR internal controls are integral to the accuracy and reliability of qPCR experiments, providing a means to normalize data and help ensure that the results reflect true biological differences rather than technical variations.

How to measure fold change using housekeeping genes in qPCR

qPCR is the method of choice for studying how a change in the conditions under which a gene is expressed—such as the addition of a treatment—affects the amount of mRNA it produces. This is usually quoted in terms of ‘fold change’, e.g., if the treated sample produces twice as much mRNA as the untreated sample, the result is a ‘fold change’ of 2.

Variations in the quantity of DNA produced in a qPCR assay are not due solely to variations in gene expression. The extent of DNA amplification is influenced heavily by the experimental conditions, particularly the initial amount of cDNA. To get a valid result, you need to start with exactly the same amount of cDNA in the treated and untreated samples, which is difficult to achieve unless you include a second gene known to be unaffected by the treatment in each sample. Thus, any difference in the mRNA detected will be the result of changes in starting cDNA concentration. Compare the patterns of gene expression between the second gene and the gene of interest to work out the ‘true’ fold change. This second gene can be termed a housekeeping gene, endogenous control, normalizer, reference gene, or internal control gene.

Example

Suppose you test one gene under two conditions and end up with C_T values of 28.5 in the ‘treated’ sample and 27.5 in the ‘untreated’ sample. This gives a measured difference of 1 between these values (delta C_T). If you knew that the amount of cDNA in each sample was exactly the same, you could calculate the fold change as 2^(delta C_T), and that 2^1=2. You could then conclude that the expression level in the treated sample was twice that in the untreated sample.

But you still can’t tell whether this is a ‘true’ fold change because of differences in sample input, and this is where the endogenous control comes in. You select a control gene that is expressed consistently across all samples in your study, measure its expression level under each condition, and come up with C_T values of 19.5 and 18.5 for the treated and untreated samples, respectively. Here, the delta C_T value for the control would also be 1. The control—which has stable expression values—has given you the same delta C_T as your gene of interest. This could imply that the measured two-fold difference in expression levels is caused by a two-fold difference in the initial amount of cDNA in the samples and is not treatment-related at all.

Therefore, in relative gene expression, expression level changes are measured as the difference between delta C_T for the tested gene and delta C_T for the endogenous control: delta delta C_T.

In this example: delta delta C_T = (28.5 – 27.5) – (19.5 – 18.5) = 0. You can conclude from this that the treatment has made no difference to the level of gene expression.

How do you choose an appropriate endogenous control gene?

In the example above, we assume that the endogenous control gene is expressed at a consistent level in all studied conditions, so any change in control gene expression between the treated and untreated samples will be measured in that gene’s delta C_T value and will contribute to the calculated delta delta C_T. For reliable results, you need to select the correct control.

An endogenous control gene must have stable expression in all samples tested, i.e., the control should not change its expression between treatments, time points, or other test conditions. In practice, zero variation is very rare and endogenous control genes are allowed small differences in C_T values of up to 0.5 C_T. Differences at the top end of this range will introduce imprecisions. From our equation, a difference of 0.5 C_T will equate to a fold change of 2^0.5 or 1.41. However, if we tried a control gene with a difference of 2 C_T between samples, this would equate to a four-fold change in expression levels, making the gene inappropriate as a control.

Top

Choosing and verifying an endogenous control

It is not possible to predict exactly how any gene will behave under a given range of conditions. The most effective way of selecting the most appropriate control gene for a relative qPCR experiment is to select some candidate genes and determine their expression levels across the range of experimental conditions and treatments. The genes that are most stably expressed across these conditions will be the most appropriate controls.

1. Identify candidate endogenous control genes

Identify candidate endogenous genes

It is best practice to evaluate several candidate genes, as the ideal control for each experiment will depend on many variables, including the cell or tissue types involved and the range of conditions to be tested. Certain housekeeping genes for qPCR that encode proteins required for basic cellular function are typically expressed at constitutive levels in a range of cell types and conditions, including disease states. Although these housekeeping genes can be best candidates for endogenous controls, and are worth considering, the expression of some classical housekeeping genes, like beta-actin (β-actin) and glyceraldehyde 3-phosphate dehydrogenase (GAPDH), varies considerably between tissue types [1]. It is essential to test housekeeping genes for variability in expression before using them as endogenous controls in gene expression studies.

Genes that code for ribosomal RNA (rRNA) molecules, rather than proteins, are also stably expressed in almost all cell types and can serve as endogenous control candidates.

Thermo Fisher Scientific supplies TaqMan gene expression assays for human and other eukaryotic rRNA and housekeeping genes for use as endogenous controls. If you are working with human samples, your first port of call should be the TaqMan endogenous control plate. This standard 96-well plate includes triplicates of 32 stably expressed human genes known to be good control candidates; you are likely to find a control among these that is appropriate for your applications. For a wider variety of assays involving other species, go to TaqMan Endogenous Control Assays, select ‘Gene Expression’, ‘Controls’ and your species of interest (or ‘All’), and then click 'Search'.

2. Look up reported endogenous gene expression levels

Look up reported endogenous gene expression levels in different tissues

There are online resources, such as the National Center for Biotechnology Information (NCBI) database , that help you check if gene expression results for a specific tissue type are in line with results from other researchers. Consulting the NCBI database is quick and easy, and helps you avoid laborious literature search. Accessing database information could give you more confidence in your results or it might help you to optimize your experiment or experimental design. This reference material could also help you choose an appropriate control tissue to validate your experiment.

3. Test your candidate genes under a range of conditions

Test your candidate genes under a range of appropriate conditions

Once you have selected your candidate control genes, test each one for stable expression under your study conditions. You should make certain the methodology you use is exactly the same in each case. We recommend following these steps:

Select experimental conditions that are representative of your study (e.g., a specific range of cell types), treatments, or time points.
Purify the RNA from all your samples across different test conditions using the same method.
Quantify the RNA and use the same amount and method for cDNA synthesis.
Test the same volume of cDNA from each candidate control gene across the different experimental conditions in at least triplicate qPCR reactions.
Assess the variability in measured C_T values for each control gene under your chosen conditions, by measuring their standard deviation (SD).

4. Select the appropriate control genes

Select the appropriate control genes for your experiment

The well-suited control gene exhibits stable expression with the least variation in Ct values. This is determined by measuring the SD of the replicate Ct values. The most suitable candidates will be those genes with the lowest SD across all tested conditions.

If your assay reveals several candidate control genes with low variability, choose a control gene with roughly similar expression to your test genes. A significant difference in expression between the test and control genes will lead to poor results in relative gene expression analysis by qPCR.

When is it necessary to use multiple endogenous controls?

It is possible that no single endogenous gene will fit your requirements; in this case, use two or more genes in parallel for optimal results. This approach has been extensively documented in the literature. One example is a study by Schmid et al. of gene expression in renal biopsies from patients with different kidney diseases [2]. The researchers noted that regulation of housekeeping genes in this tissue made any single one of these genes unreliable as a control and suggested that relating expression to 18S rRNA and cyclophilin A in parallel would yield more reliable results.

Multiple controls are also extensively used in studies of gene expression in cancer. This technique helps classify tumors into subtypes defined by gene expression patterns, furthering the research into the morphology of the tumor. Lossos et al. published an optimization of qPCR parameters for differential diagnosis of non-Hodgkin’s lymphomas in which two optimum controls were selected from a panel of 11 housekeeping genes [3]. A later study by Ayakannu et al. on endometrial carcinomas [4] selected three different control genes from a similar but expanded gene panel.

It is clear from even these few examples that there is no "one size fits all" solution to choosing a control. Unless you can find a reliable report in the literature of the exact study you are planning, it is good to cast your net broadly and test a large panel of candidates. For human studies, the TaqMan Array Human Endogenous Control Panel is an excellent place to start.

Top

References

Stylesheet for Classic Wide Template adjustments

For Research Use Only. Not for use in diagnostic procedures.