AnnotSV is a program designed for annotating Structural Variations (SV). This tool compiles functionally, regulatory and clinically relevant information and aims at providing annotations useful to i) interpret SV potential pathogenicity and ii) filter out SV potential false positives.
Different types of SV exist including deletions, duplications, insertions, inversions, translocations or rearrangements that are more complex. They can be either balanced or unbalanced. When unbalanced and resulting in a gain or loss of material, they are called Copy Number Variations (CNV). CNV can be described by coordinates on one chromosome, with the start and end positions of the SV (deletions, insertions, duplications). Complex rearrangements with breakends can arbitrary be summarized as a set of novel adjacencies, as described in the Variant Call Format Specification VCFv4.3 (Jul 2017).
AnnotSV takes as an input file a classical bed or VCF file describing the SV coordinates. The outputfile contains the overlaps of SV with relevant genomic features where the genes refer to NCBI RefSeq genes. In addition to the gene annotations, we have provided numerous additional relevant annotations (OMIM, DGV frequencies, promoters, DECIPHER, GC content, compound heterozygosity … ). All annotations are detailed here.
This program is well detailed in the README file.
Thank you for taking the time to use AnnotSV, your feedback is greatly appreciated.
AnnotSV supports as well the VCF (Variant Call Format) or the commonly used BED (Browser Extensible Data) input format to describe the SV to annotate. It allows the program to be easily integrated into any bioinformatics pipeline dedicated to NGS analysis.
Giving a SV BED file, AnnotSV produces a tab-separated values file which can be easily integrated in bioinformatics pipelines or directly read in a spreadsheet program.
There are 2 types of annotations with AnnotSV:
- An annotation on the “full” length of the SV. Every SV are reported, even those not covering a gene. This type of annotation gives an estimate of the SV event itself.
- An annotation of the SV “split” by gene. This type of annotation gives an estimate of the gene composition of the corresponding SV and is meant to analyse the consequences more deeply. Thus, in some cases, when a SV spans over several genes, the output will contain as many annotations lines as covered genes (cf example in FAQ). This latter annotation is extremely powerful to shorten the identification of mutation in a implicating a specific gene.
The annotations columns available in the output file are detailed here and in the README file.