Mulan: multiple-sequence local alignment and visualization for studying function and evolution

I Ovcharenko, GG Loots, BM Giardine, M Hou… - Genome …, 2005 - genome.cshlp.org
Genome research, 2005genome.cshlp.org
Multiple-sequence alignment analysis is a powerful approach for understanding
phylogenetic relationships, annotating genes, and detecting functional regulatory elements.
With a growing number of partly or fully sequenced vertebrate genomes, effective tools for
performing multiple comparisons are required to accurately and efficiently assist biological
discoveries. Here we introduce Mulan (http://mulan. dcode. org/), a novel method and a
network server for comparing multiple draft and finished-quality sequences to identify …
Multiple-sequence alignment analysis is a powerful approach for understanding phylogenetic relationships, annotating genes, and detecting functional regulatory elements. With a growing number of partly or fully sequenced vertebrate genomes, effective tools for performing multiple comparisons are required to accurately and efficiently assist biological discoveries. Here we introduce Mulan (http://mulan.dcode.org/), a novel method and a network server for comparing multiple draft and finished-quality sequences to identify functional elements conserved over evolutionary time. Mulan brings together several novel algorithms: the TBA multi-aligner program for rapid identification of local sequence conservation, and the multiTF program for detecting evolutionarily conserved transcription factor binding sites in multiple alignments. In addition, Mulan supports two-way communication with the GALA database; alignments of multiple species dynamically generated in GALA can be viewed in Mulan, and conserved transcription factor binding sites identified with Mulan/multiTF can be integrated and overlaid with extensive genome annotation data using GALA. Local multiple alignments computed by Mulan ensure reliable representation of short- and large-scale genomic rearrangements in distant organisms. Mulan allows for interactive modification of critical conservation parameters to differentially predict conserved regions in comparisons of both closely and distantly related species. We illustrate the uses and applications of the Mulan tool through multispecies comparisons of the GATA3 gene locus and the identification of elements that are conserved in a different way in avians than in other genomes, allowing speculation on the evolution of birds. Source code for the aligners and the aligner-evaluation software can be freely downloaded from http://www.bx.psu.edu/miller_lab/.
genome.cshlp.org