Shannon Human Splicing Pipeline
The splicing mutation pipeline plugin finds and interprets genomic variants that alter mRNA splicing in human exome, targeted sequencing, or complete genome data.
Cytognomix genome-scale human splicing mutation analysis.
Shannon pipeline from Cytognomix was initially created to address the vexing problem of assessing the many variants of unknown significance that are detected in cancer genetic testing. The Shannon human splicing mutation plug-in uses a proven approach to predict mRNA splicing abnormalities based on single nucleotide variant data from human sequencing. The Cytognomix plugin can be deployed on CLC Genomics workbench either as a standalone application or to CLC Genomics server using the Genomics Workbench client.
The Cytognomix Shannon plugin identifies splicing mutations with industry-leading sensitivity and specificity. It is based on a legacy of peer-reviewed literature confirming validated mutation predictions using our patented information theory based technology. Changes in information content in splice sites correspond to actual differences in binding affinity. Input can either be CLC bio SNP objects, in standard VCF, or an indexed simple SNP format. Results are based on sensible default parameters, are sortable and intuitive, and exportable to spreadsheets or genome browsers.
The Shannon pipeline has been used to reanalyze the Breast Cancer Information Core identifying many splicing mutations, most of which were previously unrecognized (Mucaki et al. 2011).
Learn more about the Shannon pipeline plugin in this video from Cytognomix:
Performance of Shannon pipeline plugin on complete genome sequence data
|Number of variants||Complete analysis time|
|211,049||1h 12 m|
|290,589||1h 22 m|
|314,637||1h 27 m|
The pipeline analyzes an average of 3198 variants/min on an I7-based server.
Versioning and Requirements
The Cytognomix Shannon human mRNA splicing plugin runs in standalone mode on CLC Genomics Workbench V5.5 or with both the Workbench and CLC Genomics Server V.4.5 (as a standalone server or running Gridworks). Released for Linux and MacOSX Operating systems supporting Perl and gcc. Installation has been verified with Perl v.5.8.8 and 5.10.1 and gcc v.4.1.2 and v.4.4.3 with the Ubuntu 2.6.32-27 (32 and 64 bit), CentOS 2.6.18-238 (64 bit), and Fedora 16 (32 bit) kernels, and MacOSX (Lion release version 10.7.4; gcc v.4.2.1 and Perl 5.12.3) on hardware equipped with an Intel I7 processor and at least 4Gb RAM.
- Hones in on a limited number of variants that potentially affect mRNA splicing
- Variants categorized by whether they fully or partially inactivate natural splice sites or activate cryptic sites
- Predicts variants missed by other bioinformatic methods
- Changes displayed graphically as Manhattan-like plots or custom tracks
- Mutations are sortable according to information content, proximity to natural splice site, relative strength of cryptic vs natural site, gene, and coordinate
- Variants affecting SNPs are annotated and can be filtered by allele frequency
- Rapid results generated from complete genome-wide, variant analysis of thousands of SNPs