[HTML][HTML] ASEQ: fast allele-specific studies from next-generation sequencing data

A Romanel, S Lago, D Prandi, A Sboner… - BMC medical …, 2015 - Springer
BMC medical genomics, 2015Springer
Background Single base level information from next-generation sequencing (NGS) allows
for the quantitative assessment of biological phenomena such as mosaicism or allele-
specific features in healthy and diseased cells. Such studies often present with
computationally challenging burdens that hinder genome-wide investigations across large
datasets that are now becoming available through the 1,000 Genomes Project and The
Cancer Genome Atlas (TCGA) initiatives. Results We present ASEQ, a tool to perform gene …
Background
Single base level information from next-generation sequencing (NGS) allows for the quantitative assessment of biological phenomena such as mosaicism or allele-specific features in healthy and diseased cells. Such studies often present with computationally challenging burdens that hinder genome-wide investigations across large datasets that are now becoming available through the 1,000 Genomes Project and The Cancer Genome Atlas (TCGA) initiatives.
Results
We present ASEQ, a tool to perform gene-level allele-specific expression (ASE) analysis from paired genomic and transcriptomic NGS data without requiring paternal and maternal genome data. ASEQ offers an easy-to-use set of modes that transparently to the user takes full advantage of a built-in fast computational engine. We report its performances on a set of 20 individuals from the 1,000 Genomes Project and show its detection power on imprinted genes. Next we demonstrate high level of ASE calls concordance when comparing it to AlleleSeq and MBASED tools. Finally, using a prostate cancer dataset we report on a higher fraction of ASE genes with respect to healthy individuals and show allele-specific events nominated by ASEQ in genes that are implicated in the disease.
Conclusions
ASEQ can be used to rapidly and reliably screen large NGS datasets for the identification of allele specific features. It can be integrated in any NGS pipeline and runs on computer systems with multiple CPUs, CPUs with multiple cores or across clusters of machines.
Springer