What Is Next Generation Sequencing (NGS) And How Is It Used In Drug Development

NGS methodologies have been used to produce high-throughput sequence data. These data with appropriate computational analyses facilitate variant identification and prove to be extremely valuable in pharmaceutical industries and clinical practice for developing drug molecules inhibiting disease progression. Thus, by providing a comprehensive profile of an individual’s variome — particularly that of clinical relevance consisting of pathogenic variants — NGS helps in determining new disease genes. The information thus obtained on genetic variations and the target disease genes can be used by the Pharma companies to develop drugs impeding these variants and their disease-causing effect. However simple this may allude to, determination of genetic mutations and drug target genes requires population-scale NGS analyses and focused analyses on clinical trials to identify biomarkers for drug efficacy or safety.  

Diverse NGS methods such as Whole Genome Sequencing (WGS), Whole Exome Sequencing (WES), transcriptome sequencing, and targeted sequencing can be used to perform molecular profiling, and to discover novel drug biomarkers or targets. Among these methods, WGS detects whole-genome mutations, WES focuses on only 1-2% of the entire genome; however, WES covers > 95% of the exons. That is more than the WGS coverage. Transcriptome sequencing is used to profile mRNA expression analysis and detect non-coding RNAs. With the help of high sequencing depth and exon coverage, Targeted Sequencing is useful in detecting rare variants. 

In addition to the aforementioned sequencing methods, bisulfite sequencing, ribosome profiling, and Chip-sequencing have proved to be of vital importance for drug development and clinical practice (a comprehensive and strategic explanation of various sequencing methods is available in the “ExSeq program”). Epigenetic mechanisms such as chromatin remodeling, DNA methylation, and histone modification, involved in the aforementioned sequencing methods are very useful in gene and non-coding RNA expression profiling, which play a vital role in epigenetic drug development. Additionally, many cancer therapies are based on epigenetic drugs. Epigenetic drugs have a few advantages, such as:

a) many diseases are not mutation dependent, but rely on altered levels of expression of epigenetic modulators.

b) these drugs have relatively low toxicity as long as critical thresholds are not crossed.

1. NGS target identification

NGS has the ability to generate enormous amounts of sequence data from disease samples. This data harbor the potential to uncover mutations associated with genetic diseases and to determine target genes for drug development endeavors. To identify these target genes, an Electronic Health Record (EHR) approach coupled with NGS sequencing can be used. There could be two ways to execute this methodology:

a) By leveraging the rich phenotype information from the EHR, the association between variants in candidate drug targets and selected phenotypes of interest can be examined. The application of  NGS to well-phenotyped population data in the EHR system is useful in revealing phenotype-specific drug targets for multiple diseases or phenotypic traits simultaneously. This approach can also be called “EHR-based phenotyping and target selection.” This method has an advantage as it depends on the population in the healthcare system and does not require any data collection for specific phenotypes. Also, target genes belonging to multiple phenotypes can be concurrently determined.  Moreover, it constitutes the discovery of pathogenic and likely to be pathogenic germline mutations from population-wide studies. 

b)  Another approach could be to choose the desired phenotypic trait based on its frequency and relevance. NGS sequencing is performed on the cohort of specific phenotypes of therapeutic implications and then highly penetrant genetic targets for the trait of interest are identified. Although the approach demands initial effort in determining the cohort, it brings forth the pros of a strategic implementation for determining highly targeted genes and variants that could be used for personalized drug development in pharmacogenomics studies.

Thus, NGS can significantly expedite the determination of individuals that carry mutations in the gene for clinical trials. Consequently, helping the care-providers and pharma companies in identifying patients who are suitable for specific drugs. 

2. NGS in pharmacogenomics

Pharmacogenomics is the study of the efficacy of drug response in an individual and the extent to which it varies among individuals based on genomic content. It also explores how genotype-phenotype information can be used in personalized medicine. Research conducted in the biomedicine field has reported variants from different somatic and germline domains.

However, it is pertinent to understand that just the discovery of novel variants does not apprehend the variability in an individual’s disease management. The realistic application of genomic findings goes beyond variant identification and variation in clinical trials. A comprehensive set of stages to integrate next-generation sequencing into clinical pharmacogenomics is imperative to better understand disease management and envision personalized medicine. 

The stages are:

(i) identification of pharmacogenomic gene targets and their validation in controlled studies with independent population cohorts. 

(ii) replication and understanding of the drug-gene(s) association mechanism and demonstration of utility in patients at-risk. 

(iii) development of clinical diagnostic tests and their regulatory approval.

(iv) assessment of the clinical impact and cost-effectiveness of the pharmacogenomic gene targets. 

(v) involvement of stakeholders in clinical execution.

Genotyping And Data Analysis In Pharmacogenomics

With the plunging cost of genotyping or sequencing, more and more academic institutions and private organizations are engaging in collaborative programs to focus on NGS of disease-case genomes, such as cancer genome — aiming to describe the architecture of the disease-specific alterations and aid clinicians in disease management. We have touched base at the beginning of this article about the various types of NGS methods and their implications in identifying drug targets.

Although every sequencing methodology has its advantages in identifying variants characterized by their phenotypic and clinical variability, in pharmacogenomics, targeted gene sequencing seems to be more relevant compared to other sequencing methods such as WES. The reason being the ability of this method to capture genetic variants that are present in genomic positions other than the exons, such as the intronic and untranslated regions, that can lead to a substantial reduction of drug metabolizing enzyme activity. These rare pharmacovariants are of utmost importance in personalized drug therapy as they provide information to avoid adverse drug reactions and lack of response. Furthermore, compared to Sanger Sequencing, NGS yields more accurate quantitative results that can be achieved at a higher throughput scale.  

Variant Data Prioritization And Interpretation

After getting the variant data, comes the prioritization and interpretation of the discovered variants. This is an integral part of pharmacogenomics that is not well incorporated and interpolated in clinical settings which leads to the slow uptake of pharmacogenomics in clinical labs. The salient aspects that influence pharmacogenomics translation into clinical practice are: 

  • interpretation of published variant data results, and;
  • interpretation of reported genetic variant results. 

Considering the usage-rate of pharmacogenomics implementation in clinical labs, it is evident that, although a majority of the researchers/clinicians acknowledge the effect of genetic variants in drug response only a limited number show adequate information about pharmacogenomics data interpretation relevance. The way to overcome this gap and further the integration of variant data interpretation in labs is by collaborating and accumulating NGS data, as large sample sizes would help to maximize the clinical benefits by retrospectively analyzing large patient cohorts. This strategy will prove useful in:

  • determining common and rare variants
  • validation
  • diagnosis and decision-making from accumulated variant data

Efficient assembly, mining, and analysis of the accumulated multi-faceted data are necessary to make clinically significant diagnoses. To facilitate the decision-making process, a collaborative web-based support platform adopting a hybrid approach of synergy between computer-algorithms and human intellect can be used. Such platforms can be called “Clinical Decision Support (CDS)” tools. An example of such a tool is Agilent Technologies’ “Alissa Interpret” — a clinical informatics platform for molecular pathology and clinical genetics labs to standardize and automate variant triage, review, classification, and reports on clinical NGS and CGH data, and eventually assist in making clinical diagnoses. 

Accreditation And Consultation

Application of pharmacogenomics in drug or personalized medicine development requires proper accreditation of genome data quality and NGS assays design because of its vital influence in medicine and biomedical research.

To ensure the quality of medical products and services, medical agencies have been set up in countries. For example, European Medicines Agency (EMA) has defined regulatory frameworks for:

  • Good Clinical Practice (GCP) compliance 
  • Good Laboratory Practice (GLP) compliance 
  • Good Manufacturing Practice (GMP) 
  • Good Distribution Practice (GDP) 
  • Good Pharmacogenomics Practice (GPP)

The guidelines in these frameworks stress the importance of the steps included in NGS protocol from DNA isolation to variant identification, annotation, and interpretation. For example, the guideline gives the information that the minimum sequencing coverage for germline pharmacovariants should be 30x, whereas, for rare variants, it should be higher to ensure that rarer variants are also detected by sequencing. 

Next comes the consultation, where regulatory compliances are used to eliminate the uncertainties about the ways pharmacogenomics results can be translated into clinical care decisions by the government agencies. As per the United States Food and Drug Administration (FDA) policy, all relative information available for a pharmacogenomics product should be made available to ensure that patient care is not compromised.

Consequently, PharmGKB was established — it is a curated database constituting information on drug properties, pathway diagrams, and related publications. Users can query for drugs, genes, or diseases to obtain relevant data. In addition to PharmGKB, Dutch Pharmacogenetics Working Group (DPWG) and Clinical Pharmacogenetics Implementation Consortium (CPIC) were also deployed to assist healthcare professionals in interpreting pharmacogenomic testing results and in making efficient diagnoses.

The clinical pharmacogenomics workflow can thus be described in the following steps:

  1. NGS or genotyping using WGS, WES, or Targeted sequencing performed by accredited laboratories. 
  2. Variant discovery in pharmacogenes and other genes involved and related to drug metabolism. 
  3. Pharmacogenomic variant data prioritization, where the identified variant is given a score based on certain criteria such as:
    • Novelty of variant
    • Nature of the variant (frameshift, non-synonymous, synonymous, etc.)
    • Variant’s frequency (common or rare)
    • Existing evidence of the variant in drug metabolism from databases like PharmGKB and CPIC.  
  4. Variant interpretation based on the scientific literature, databases, and algorithms in association with the recommended databases like CPIC and PharmGKB.
  5. Pharmacogenomics consultation from a qualified experts to provide appropriate advice on the drug choice to avoid adverse reactions.

Proper implementation of the aforementioned workflow for next-generation sequencing-based pharmacogenomic testing and validation is only possible with the synergy of stakeholders in accepting and implementing the current technological advances in the field of NGS. 

Figure 1: An overview of different drug development stages.

Having covered the pharmacogenomics workflow, let’s move onto the application areas of pharmacogenomics and NGS in clinical trials that are used to understand the efficacy of drugs by following strategic approaches.

A special application area of pharmacogenomics is:

1) Companion diagnostics

These diagnostic tests are used to assess the effectiveness of the drug treatment before the drug is prescribed for disease treatment. In other words, these tests are the “companions” of specific drugs and help understand the drug’s expected efficacy. A list of FDA-approved companion diagnostics devices can be found here. As we see in the list, NGS assays are among the most approved diagnostics devices.  

Next comes the clinical trials studies performed using the diagnostic tests devices and strategies to further understand drug therapies:

1) Basket/bucket clinical trials

This is the type of clinical trial in which diagnostics tests are performed in patients that carry different types of cancer having the same mutation/biomarker but with distinct clinical phenotypes. These patients receive the same drug treatment targeting the specific mutation diagnosed to cause cancer. Because of this strategic approach, where all patients are put under the same treatment, the trial method is called “Basket” clinical trial. These clinical trials can also be used to study rare genetic aberrations. 

2) Genetically stratified clinical trials

These clinical trials only include patients bearing a specific type of mutation that is more likely to respond to the tested drug. These mutations could be in the same or different genes. An advantage of this method is that it improves the power of trials with a fixed sample size since it recruits only a subset of patients with specific genotypes that are most likely to respond to the drug treatment. 

A simple workflow of stratified clinical trials can be represented as below:

Stratification is the process of division of participants into smaller subgroups. It is used to ensure equal allocation of subgroups to each experimental condition based on age, gender, or other demographic factors. And, randomization is the process of randomly assigning participants to separate groups that receive different treatments. In this study, usually there are two groups: the investigational group and the control group. The investigational group receives the new treatment whereas the control group receives the standard therapy. At the end of the clinical trial study, a comparison is made to examine the efficacy of treatments. Randomization helps to prevent bias.

Concluding Remarks

The goal of this article is to give a perspective on how NGS can be used in pharmacogenomics to perform better interpretation and diagnosis of the identified variants/mutations/biomarkers from patients’ data. Usage of the technological advancement in NGS and the extensive patient data available in EHR systems in determining pharmacogenes could help in conducting efficient clinical trial studies to improve the drug treatment efficacy. In the coming years, with the continuous increase and availability of sequencing data and the focus towards pharmacogenes, we will surely see newly developed and marketed drug compounds facilitating personalized drug treatments.

To learn more about gene prediction and how NGS can assist you, and to get access to all of our advanced materials including 20 training videos, presentations, workbooks, and private group membership, get on the Expert Sequencing wait list.

Join Expert Cytometry's Mastery Class
Deepak Kumar, PhD
Deepak Kumar, PhD Genomics Software Application Engineer

Deepak Kumar is a Genomics Software Application Engineer (Bioinformatics) at Agilent Technologies. He is the founder of the Expert Sequencing Program (ExSeq) at Cheeky Scientist. The ExSeq program provides a holistic understanding of the Next Generation Sequencing (NGS) field - its intricate concepts, and insights on sequenced data computational analyses. He holds diverse professional experience in Bioinformatics and computational biology and is always keen on formulating computational solutions to biological problems.

Similar Articles

How To Profile DNA And RNA Expression Using Next Generation Sequencing

How To Profile DNA And RNA Expression Using Next Generation Sequencing

By: Deepak Kumar, PhD

Why is Next Generation Sequencing so powerful to explore and answer both clinical and research questions. With the ability to sequence whole genomes, identifying novel changes between individuals, to exploring what RNA sequences are being expressed, or to examine DNA modifications and protein-DNA interactions occurring that can help researchers better understand the complex regulation of transcription. This, in turn, allows them to characterize changes during different disease states, which can suggest a way to treat said disease.  Over the next two blogs, I will highlight these different methods along with illustrating how these can help clinical diagnostics as well as…

Optimizing Flow Cytometry Experiments - Part 2         How To Block Samples (Sample Blocking)

Optimizing Flow Cytometry Experiments - Part 2 How To Block Samples (Sample Blocking)

By: Tim Bushnell, PhD

In my previous blog on  experimental optimization, we discussed the idea of identifying the best antibody concentration for staining the cells. We did this through a process called titration, which  focuses on finding the best signal-to-noise ratio at the lowest antibody concentration. In this blog we will deal with sample blocking As a reminder, there are two other major binding concerns with antibodies. The first is the specific binding of the Fc fragment of the antibody to the Fc Receptor expressed on some cells. This protein is critical for the process of destroying microbes or other cells that have been…

How To Determine The Optimal Antibody Concentration For Your Flow Cytometry Experiment (Part 1 of 6)

How To Determine The Optimal Antibody Concentration For Your Flow Cytometry Experiment (Part 1 of 6)

By: Tim Bushnell, PhD

Over the next series of blog posts, we will explore the different aspects of optimizing a polychromatic flow cytometry panel. These steps range from figuring out the best voltage to use, which controls are critical for data interpretation, what quality control tools can be integrated into the assay; how to block cells, and more. This blog will focus on determining the optimal antibody concentration.  As a reminder about the antibody structure, a schematic of an antibody is shown below.  Figure 1: Schematic of an antibody. Figure from Wikipedia. The antibody is composed of two heavy chains and two light chains that…

Structural Variant Calling From NGS Data

Structural Variant Calling From NGS Data

By: Deepak Kumar, PhD

Single Nucleotide Variant (SNVs) have been considered as the main source of genetic variation, therefore precisely identifying these SNVs is a critical part of the Next Generation Sequencing (NGS) workflow. However, in this report from 2004, the authors identified another form of variants called the Structural Variants (SVs), which are genetic alterations of 50 or more base pairs, and result in duplications, deletions, insertions, inversions, and translocations in the genome. The changes in the DNA organization resulting from these SVs have been shown to be responsible for both phenotypic variation and a variety of pathological conditions. While the average variation,…

Essential Concepts in Gene Prediction and Annotation

Essential Concepts in Gene Prediction and Annotation

By: Deepak Kumar, PhD

After genome assembly (covered in my previous blog) comes the vital step of gene prediction and annotation. This step entails the prediction of all the genes present in the assembled genome and to provide efficient functional annotation to these genes from the data available in diverse public repositories; such as Protein Family (PFAM), SuperFamily, Conserved Domain Database (CDD), TIGRFAM, PROSITE, CATH, SCOP, and other protein domain databases. It is imperative to understand that prediction and annotation of non-protein-coding genes, Untranslated Regions (UTR), and tRNA are as vital as protein-coding genes to determine the overall genetic constitution of the assembled genome. …

Brightness Is In The Eye Of The Detector - What To Consider When Designing Your Panel

Brightness Is In The Eye Of The Detector - What To Consider When Designing Your Panel

By: Tim Bushnell, PhD

The heart and soul of the flow cytometry experiment is the ‘panel.’ The unique combinations of antibodies, antigens, fluorochromes, and other reagents are central to identifying the cells of interest and extracting the data necessary to answer the question at hand. Designing the right panel for flow cytometry is essential for detecting different modalities. The more parameters that can be interrogated will yield more information about the target cells. Current instruments can measure as many as 40 different parameters simultaneously. This is exciting, as it allows for more complex questions to be studied. Panel design is also valuable for precious samples,…

7 Key Image Analysis Terms For New Microscopist

7 Key Image Analysis Terms For New Microscopist

By: Heather Brown-Harding, PhD

As scientists, we need to perform image analysis after we’ve acquired images in the microscope, otherwise, we have just a pretty picture and not data. The vocabulary for image processing and analysis can be a little intimidating to those new to the field. Therefore, in this blog, I’m going to break down 7 terms that are key when post-processing of images. 1. RGB Image Images acquired during microscopy can be grouped into two main categories. Either monochrome (that can be multichannel) or “RGB.” RGB stands for red, green, blue – the primary colors of light. The cameras in our phones…

5 Essential Concepts In Genome Assembly From NGS data

5 Essential Concepts In Genome Assembly From NGS data

By: Deepak Kumar, PhD

The main goal for researchers, clinicians, and students who perform Next Generation Sequencing (NGS) and produce sequenced data for diverse projects involving human samples is to find biomarkers or variants to make diagnoses; and deduce the genetic anomalies that could be responsible for the disease they are conducting research on. Most projects (academic or non-academic) constitute the prior ideology on deciphering the “unknown.” There are well-versed computational protocols and pipelines formulated by labs across the world in determining what the “unknown” variants are. The fact that we have the “reference” human genome available – thanks to the Human Genome Project – plays…

The 5 Fundamental Methods For Imaging Nucleic Acids

The 5 Fundamental Methods For Imaging Nucleic Acids

By: Heather Brown-Harding, PhD

There are 4 major ways to sort cells. The first way can use magnetic beads coupled to antibodies and pass the cells through a magnetic field. The labeled cells will stick, and the unlabeled cells will remain in the supernatant. The second way is to use some sort of mechanical force like a flapper or air stream that separates the target cells from the bulk population. The third way is the recently introduced microfluidics sorter, which uses microfluidics channels to isolate the target cells. The last method, which is the most common––based on Fuwyler’s work––is the electrostatic cell sorter. This…

Top Technical Training eBooks

Get the Advanced Microscopy eBook

Get the Advanced Microscopy eBook

Heather Brown-Harding, PhD

Learn the best practices and advanced techniques across the diverse fields of microscopy, including instrumentation, experimental setup, image analysis, figure preparation, and more.

Get The Free Modern Flow Cytometry eBook

Get The Free Modern Flow Cytometry eBook

Tim Bushnell, PhD

Learn the best practices of flow cytometry experimentation, data analysis, figure preparation, antibody panel design, instrumentation and more.

Get The Free 4-10 Compensation eBook

Get The Free 4-10 Compensation eBook

Tim Bushnell, PhD

Advanced 4-10 Color Compensation, Learn strategies for designing advanced antibody compensation panels and how to use your compensation matrix to analyze your experimental data.