The Ensembl Variant Effect Predictor is a powerful toolset for the analysis, annotation, and prioritization of genomic variants in coding and non-coding regions. It provides access to an extensive collection of genomic annotation, with a variety of interfaces to suit different requirements, and simple options for configuring and extending analysis. It is open source, free to use, and supports full reproducibility of results. The Ensembl Variant Effect Predictor can simplify and accelerate variant interpretation in a wide range of study designs. It determines the effect of our variants (SNPs, insertions, deletions, CNVs or structural variants) on genes, transcripts, and protein sequence, as well as regulatory regions.
Analysis of variant data resulting from genome or exone sequencing is necessary for progress in biology, from basic research to translational genomics in the clinic. It is key for investigating function and for progressing from a system of medical care based on standardized treatment to one targeted to the individual patient.
Uses
The VEP has been used for analysis of traits in farm animals
for patient diagnosis in the clinic
for research on GWAS
used for analysis in many large-scale projects, including the 1000 Genomes and Exome Aggregation Consortium (ExAC)
Find out the genes and transcripts affected by the variants
Know the location of the variants (e.g. upstream of a transcript, in coding sequence, in non-coding RNA, in regulatory regions)
Consequence of our variants on the protein sequence (e.g. stop gained, missense, stop lost, frameshift)
Known variants that match ours, and associated minor allele frequencies from the 1000 Genomes Project
VEP’s annotations are used as input to tools for deep exploration of variant annotation such as GEMINI.
The VEP annotates two broad categories of genomic variant;
sequence variants with specific and well-defined changes
Larger structural variants (greater than 50 nucleotides in length), including those with changes in copy number or insertions and deletions of DNA.
We can input coordinates of any alleles we have identified or variant identifiers. A full list of input files is available. If a variant that we enter as input causes a change in the protein sequence, the VEP will calculate the possible amino acids at that position and the variant would be given a consequence type of missense. Download our results in multiple data formats, easily share our results with others, and integrate our variation data with the powerful Ensembl web browser.
We have three options for uploading our data:
File upload
Paste file
File URL - point the VEP to a file hosted on a publically accessible adderess
The summary panel on the VEP results page gives a brief overview of the VEP job, with some basic statistics about the results.
Statistics include;
Variants processed
Variants remaining after filtering
Novel / known variants
Number of overlapped genes, transcripts and regulatory features
Pie charts are shown detailing the proportion of consequence types called across all variants in the results