Integration of Computers    
   
Table of contents
(Prev) List of semiconductor scale ex ...List of SIP software (Next)

List of sequence alignment software

This list of sequence alignment software is a compilation of software tools and web portals used in pairwise sequence alignment and multiple sequence alignment. See structural alignment software for structural alignment of proteins.

Contents

Database search only

NameDescriptionSequence Type*LinkAuthorsYear
BLASTlocal search with fast k-tuple heuristic (Basic Local Alignment Search Tool)BothNCBI EBI DDBJ DDBJ (psi-blast) GenomeNet PIR (protein only)Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ[1]1990
CS-BLASTsequence-context specific BLAST, more sensitive than BLAST, FASTA, and SSEARCH. Position-specific iterative version CSI-BLAST more sensitive than PSI-BLASTProteinCS-BLAST server downloadAngermueller C, Biegert A, Soeding J[2]2013
CUDASW++GPU accelerated Smith Waterman algorithm for multiple shared-host GPUsProteinhomepage publicationLiu Y, Maskell DL and Schmidt B2009/2010
FASTAlocal search with fast k-tuple heuristic, slower but more sensitive than BLASTBothEBI DDBJ GenomeNet PIR (protein only)  
GGSEARCH / GLSEARCHGlobal:Global (GG), Global:Local (GL) alignment with statisticsProteinFASTA server  
HMMERlocal and global search with profile Hidden Markov models, more sensitive than PSI-BLASTBothdownloadDurbin R, Eddy SR, Krogh A, Mitchison G[3]1998
HHpred / HHsearchpairwise comparison of profile Hidden Markov models; very sensitive, but can only search alignment databases (Pfam, PDB, InterPro...)Proteinserver downloadSöding J[4]2005
IDFInverse Document FrequencyBothdownload  
Infernalprofile SCFG searchRNAdownloadEddy S 
PSI-BLASTposition-specific iterative BLAST, local search with position-specific scoring matrices, much more sensitive than BLASTProteinNCBI PSI-BLASTAltschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ[5]1997
PSI-SearchCombining the Smith-Waterman search algorithm with the PSI-BLAST profile construction strategy to find distantly related protein sequences, and preventing homologous over-extension errors.ProteinEBI PSI-SearchLi W, McWilliam H, Goujon M, Cowley A, Lopez R, Pearson WR[6]2012
ScalaBLASTHighly parallel Scalable BLASTBothScalaBLASTOehmen et al.[7]2011
SequilabLinking and profiling sequence alignment data from NCBI-BLAST results with major sequence analysis servers/servicesNucleotide/peptideserver 2010
SAMlocal and global search with profile Hidden Markov models, more sensitive than PSI-BLASTBothSAMKarplus K, Krogh A[8]1999
SSEARCHSmith-Waterman search, slower but more sensitive than FASTABothEBI DDBJ  
*Sequence Type: Protein or nucleotide

Pairwise alignment

NameDescriptionSequence Type*Alignment Type**LinkAuthorYear
ACANAfast heuristic anchor based pairwise alignmentBothBothdownloadHuang, Umbach, Li2005
AlignMeAlignments for membrane protein sequencesProteinBothdownload,serverM. Stamm, K. Khafizov, R. Staritzbichler, L.R. Forrest2013
Bioconductor Biostrings::pairwiseAlignmentDynamic programmingBothBoth + Ends-freesiteP. Aboyoun2008
BioPerl dpAlignDynamic programmingBothBoth + Ends-freesiteY. M. Chan2003
BLASTZ,LASTZSeeded pattern-matchingNucleotideLocaldownload,downloadSchwartz et al.2004,2009
DNADotWeb-based dot-plot toolNucleotideGlobalserverR. Bowen1998
DOTLETJava-based dot-plot toolBothGlobalappletM. Pagni and T. Junier1998
FEASTPosterior based local extension with descriptive evolution modelNucleotideLocalsiteA. K. Hudek and D. G. Brown2010
G-PASGPU-based dynamic programming with backtrackingBothLocal, SemiGlobal, Globalsite+downloadW. Frohmberg, M. Kierzynka et al.2011
GapMisGapMis is a tool for pairwise sequence alignment with a single gapBothSemiGlobalsiteK. Frousios, T. Flouri, C. S. Iliopoulos, K. Park, S. P. Pissis, G. Tischler2012
GGSEARCH, GLSEARCHGlobal:Global (GG), Global:Local (GL) alignment with statisticsProteinGlobal in queryFASTA serverW. Pearson2007
JAlignerOpen source Java implementation of Smith-WatermanBothLocalJWSA. Moustafa2005
K*SyncProtein sequence to structure alignment that includes secondary structure, structural conservation, structure-derived sequence profiles, and consensus alignment scoresProteinBothRobetta serverD. Chivian & D. Baker2003
LALIGNMultiple, non-overlapping, local similarity (same algorithm as SIM)BothLocal non-overlappingserver FASTA serverW. Pearson1991 (algorithm)
mAlignmodelling alignment; models the information content of the sequencesNucleotideBothdoc codeD. Powell, L. Allison and T. I. Dix2004
matcherWaterman-Eggert local alignment (based on LALIGN)BothLocalPasteurI. Longden (modified from W. Pearson)1999
MCALIGN2explicit models of indel evolutionDNAGlobalserverJ. Wang et al.2006
MUMmersuffix tree basedNucleotideGlobaldownloadS. Kurtz et al.2004
needleNeedleman-Wunsch dynamic programmingBothSemiGlobalEBIPasteurA. Bleasby1999
Ngilalogarithmic and affine gap costs and explicit models of indel evolutionBothGlobaldownloadR. Cartwright2007
PathSmith-Waterman on protein back-translation graph (detects frameshifts at protein level)ProteinLocalserver downloadM. Gîrdea et al.2009
PatternHunterSeeded pattern-matchingNucleotideLocaldownloadB. Ma et al.2002–2004
ProbA (also propA)Stochastic partition function sampling via dynamic programmingBothGlobaldownloadU. Mückstein2002
PyMOL"align" command aligns sequence & applies it to structureProteinGlobal (by selection)siteW. L. DeLano2007
REPutersuffix tree basedNucleotideLocaldownloadS. Kurtz et al.2001
SABERTOOTHAlignment using predicted Connectivity ProfilesProteinGlobaldownload on requestF. Teichert, J. Minning, U. Bastolla, and M. Porto2009
SatsumaParallel whole-genome synteny alignmentsDNALocaldownloadM.G. Grabherr et al.2010
SEQALNVarious dynamic programmingBothLocal or GlobalserverM.S. Waterman and P. Hardy1996
SIM, GAP, NAP, LAPLocal similarity with varying gap treatmentsBothLocal or globalserverX. Huang and W. Miller1990-6
SIMLocal similarityBothLocalserversX. Huang and W. Miller1991
SPA: Super pairwise alignmentFast pairwise global alignmentNucleotideGlobalavailable upon requestShen, Yang, Yao, Hwang2002
SSEARCHLocal (Smith-Waterman) alignment with statisticsProteinLocalEBI FASTA serverW. Pearson1981 (Algorithm)
Sequences StudioJava applet demonstrating various algorithms from [9]Generic sequenceLocal and globalcode appletA.Meskauskas1997 (reference book)
SWIFT suitFast Local Alignment SearchingDNALocalsiteK. Rasmussen, W. Gerlach2005,2008
stretcherMemory-optimized Needleman-Wunsch dynamic programmingBothGlobalPasteurI. Longden (modified from G. Myers and W. Miller)1999
tranalignAligns nucleic acid sequences given a protein alignmentNucleotideNAPasteurG. Williams (modified from B. Pearson)2002
UGENEOpensource Smith-Waterman for SSE/CUDA, Suffix array based repeats finder & dotplotBothBothUGENE siteUniPro2010
waterSmith-Waterman dynamic programmingBothLocalEBIPasteurA. Bleasby1999
wordmatchk-tuple pairwise matchBothNAPasteurI. Longden1998
YASSSeeded pattern-matchingNucleotideLocalserver downloadL. Noe and G. Kucherov2003–2007
NW-alignStandard Needleman-Wunsch dynamic programming algorithmProteinGlobalserver and downloadY Zhang2012
*Sequence Type: Protein or nucleotide. **Alignment Type: Local or global

Multiple sequence alignment

NameDescriptionSequence Type*Alignment Type**LinkAuthorYearLicense
ABAA-Bruijn alignmentProteinGlobaldownloadB.Raphaelet al.2004Proprietary, without charge for educational, research and non profit.
ALEmanual alignment  some software assistanceNucleotidesLocaldownloadJ. Blandy and K. Fogel1994 (latest version 2007)GPL2
AMAPSequence annealingBothGlobalserverA. Schwartz and L. Pachter2006 
anon.fast, optimal alignment of three sequences using linear gap costsNucleotidesGlobalpaper softwareD. Powell, L. Allison and T. I. Dix2000
BAli-PhyTree+Multi alignment  Probabilistic/Bayesian  Joint EstimationBothGlobalWWW+downloadBD Redelings and MA Suchard2005 (latest version 2010)
Base-By-BaseJava-based multiple sequence alignment editor with integrated analysis toolsBothLocal or GlobaldownloadR. Brodie et al.2004Free, requires registration.
CHAOS/DIALIGNIterative alignmentBothLocal (preferred)serverM. Brudno and B. Morgenstern2003
ClustalWProgressive alignmentBothLocal or Globaldownload EBI DDBJ PBIL EMBNet GenomeNetThompson et al.1994GNU Lesser GPL
CodonCode AlignerMulti alignment; ClustalW & Phrap supportNucleotidesLocal or GlobaldownloadP. Richterich et al.2003 (latest version 2009)
CompassCOmparison of Multiple Protein sequence Alignments with assessment of Statistical SignificanceProteinGlobaldownload and serverR.I. Sadreyev, et al.2009
DIALIGN-TX and DIALIGN-TSegment-based methodBothLocal (preferred) or Globaldownload and serverA.R.Subramanian2005 (latest version 2008)
DNA AlignmentSegment-based method for intraspecific alignmentsBothLocal (preferred) or GlobalserverA.Roehl2005 (latest version 2008)
DNA Baser Sequence AssemblerMulti alignment; Automatic batch alignmentNucleotidesLocal or Globalwww.DnaBaser.comHeracle BioSoft2012
FSASequence annealingBothGlobaldownload and serverR. K. Bradley et al.2008
GeneiousProgressive/Iterative alignment; ClustalW pluginBothLocal or GlobaldownloadA.J. Drummond et al.2005 (latest version 2009)
KalignProgressive alignmentBothGlobalserverEBI MPItoolkitT. Lassmann2005
MAFFTProgressive/iterative alignmentBothLocal or GlobalGenomeNet MAFFTK. Katoh et al.2005
MARNAMultiple Alignment of RNAsRNALocalserver downloadS. Siebert et al.2005
MAVIDProgressive alignmentBothGlobalserverN. Bray and L. Pachter2004
MSADynamic programmingBothLocal or GlobaldownloadD.J. Lipman et al.1989 (modified 1995)
MSAProbsDynamic programmingProteinGlobaldownloadY. Liu, B. Schmidt, D. Maskell2010
MULTALINDynamic programming/clusteringBothLocal or GlobalserverF. Corpet1988
Multi-LAGANProgressive dynamic programming alignmentBothGlobalserverM. Brudno et al.2003
MUSCLEProgressive/iterative alignmentBothLocal or GlobalserverR. Edgar2004
OpalProgressive/iterative alignmentBothLocal or GlobaldownloadT. Wheeler and J. Kececioglu2007
PecanProbabilistic/consistencyDNAGlobaldownloadB. Paten et al.2008
PhyloA human computing framework for comparative genomics to solve multiple alignmentNucleotidesLocal or GlobalsiteMcGill Bioinformatics2010
PralineProgressive/iterative/consistency/hom ology-extended alignment with pre-profiling and secondary structure predictionProteinGlobalserverJ. Heringa1999 (latest version 2009)
POAPartial order/hidden Markov modelProteinLocal or GlobaldownloadC. Lee2002
ProbalignProbabilistic/consistency with partition function probabilitiesProteinGlobalserverRoshan and Livesay2006
ProbConsProbabilistic/consistencyProteinLocal or GlobalserverC. Do et al.2005
PROMALS3DProgressive alignment/hidden Markov model/Secondary structure/3D structureProteinGlobalserverJ. Pei et al.2008
PRRN/PRRPIterative alignment (especially refinement)ProteinLocal or GlobalPRRP PRRNY. Totoki (based on O. Gotoh)1991 and later
PSAlignAlignment preserving non-heuristicBothLocal or GlobaldownloadS.H. Sze, Y. Lu, Q. Yang.2006
RevTransCombines DNA and Protein alignment, by back translating the protein alignment to DNA.DNA/Protein (special)Local or GlobalserverWernersson and Pedersen2003 (newest version 2005)
SAGASequence alignment by genetic algorithmProteinLocal or GlobaldownloadC. Notredame et al.1996 (new version 1998)
SAMHidden Markov modelProteinLocal or GlobalserverA. Krogh et al.1994 (most recent version 2002)
Se-AlManual alignmentBothLocaldownloadA. Rambaut2002
StatAlignBayesian co-estimation of alignment and phylogeny (MCMC)BothGlobaldownloadA. Novak et al.2008
StemlocMultiple alignment and secondary structure predictionRNALocal or GlobaldownloadI. Holmes2005GPLv3 (parte de DART)
T-CoffeeMore sensitive progressive alignmentBothLocal or Globalserver downloadC. Notredame et al.2000 (newest version 2008)GPL2
UGENESupports multiple alignment with MUSCLE, KAlign, Clustal and MAFFT pluginsBothLocal or GlobaldownloadUGENE team2010 (newest version 2012)GPL2
*Sequence Type: Protein or nucleotide. **Alignment Type: Local or global

Genomics analysis

NameDescriptionSequence Type*Link
ACT (Artemis Comparison Tool)Synteny and comparative genomicsNucleotideserver
AVIDPairwise global alignment with whole genomesNucleotideserver
BLATAlignment of cDNA sequences to a genome.Nucleotide 
GMAPAlignment of cDNA sequences to a genome. Identifies splice site junctions with high accuracy.Nucleotidehttp://research-pub.gene.com/gmap
SplignAlignment of cDNA sequences to a genome. Identifies splice site junctions with high accuracy. Able to recognize and separate gene duplications.Nucleotidehttp://www.ncbi.nlm.nih.gov/sutils/sp lign
MauveMultiple alignment of rearranged genomes (also available inside Geneious)Nucleotidedownload
MGAMultiple Genome AlignerNucleotidedownload
MulanLocal multiple alignments of genome-length sequencesNucleotideserver
MultizMultiple alignment of genomesNucleotidedownload
PLAST-ncRNASearch for ncRNAs in genomes by partition function local alignmentNucleotideserver
SequeromeProfiling sequence alignment data with major servers/servicesNucleotide/peptideserver
SequilabProfiling sequence alignment data from NCBI-BLAST results with major servers/servicesNucleotide/peptideserver
Shuffle-LAGANPairwise glocal alignment of completed genome regionsNucleotideserver
SIBsim4 / Sim4A program designed to align an expressed DNA sequence with a genomic sequence, allowing for intronsNucleotidedownload
SLAMGene finding, alignment, annotation (human-mouse homology identification)Nucleotideserver
*Sequence Type: Protein or nucleotide



Motif finding

NameDescriptionSequence Type*Link
PMSMotif search and discoveryBothserver server
FMMMotif search and discovery (can get also positive & negative sequences as input for enriched motif search)Nucleotideserver
BLOCKSUngapped motif identification from BLOCKS databaseBothserver
eMOTIFExtraction and identification of shorter motifsBothservers
Gibbs motif samplerStochastic motif extraction by statistical likelihoodBothserver server
HMMTOPPrediction of transmembrane helices and topology of proteinsProteinhomepage & download
I-sitesLocal structure motif libraryProteinserver
JCoilsPrediction of Coiled coil and Leucine ZipperProteinhomepage & download
MEME/MASTMotif discovery and searchBothserver
CUDA-MEMEGPU accelerated MEME (v4.4.0) algorithm for GPU clustersBothhomepage
MERCIDiscriminative motif discovery and searchBothhomepage & download
PHI-BlastMotif search and alignment toolBothPasteur
PhyloscanMotif search toolNucleotideserver
PRATTPattern generation for use with ScanPrositeProteinserver
ScanPrositeMotif database search toolProteinserver
TEIRESIASMotif extraction and database searchBothserver
*Sequence Type: Protein or nucleotide



Benchmarking

NameLinkAuthors
BAliBASEdownloadThompson, Plewniak, Poch
HOMSTRADdownloadMizuguchi
OxbenchdownloadRaghava, Searle, Audley, Barber, Barton
PFAMdownload 
PREFABdownloadEdgar
SABmarkdownloadVan Walle, Lasters, Wyns
SMARTdownloadLetunic, Copley, Schmidt, Ciccarelli, Doerks, Schultz, Ponting, Bork

Alignment Viewers/Editors

Please see the List of alignment visualization software.

Short-Read Sequence Alignment

NameDescriptionpaired-end optionUse FASTQ qualityGappedMulti-threadedLicenseLink
BarraCUDAA GPGPU accelerated Burrows-Wheeler transform (FM-index) short read alignment program based on BWA, supports alignment of indels with gap openings and extensions.YesNoYesYes (POSIX Threads and CUDA)GPLlink
BFASTExplicit time and accuracy tradeoff with a prior accuracy estimation, supported by indexing the reference sequences. Optimally compresses indexes. Can handle billions of short reads. Can handle insertions, deletions, SNPs, and color errors (can map ABI SOLiD color space reads). Performs a full Smith Waterman alignment.   Yes (POSIX Threads)GPLlink
BLASTNBLAST's nucleotide alignment program, slow and not accurate for short reads, and uses a sequence database (EST, sanger sequence) rather than a reference genome.    link
BLATMade by Jim Kent. Can handle one mismatch in initial alignment step.   Yes (client/server).Free for academic and non-commercial use.link
BowtieUses a Burrows-Wheeler transform to create a permanent, reusable index of the genome; 1.3 GB memory footprint for human genome. Aligns more than 25 million Illumina reads in 1 CPU hour. Supports Maq-like and SOAP-like alignment policies   Yes (POSIX Threads)Artistic Licenselink
BWAUses a Burrows-Wheeler transform to create an index of the genome. It's a bit slower than bowtie but allows indels in alignment.YesNoYesYesGPLlink
CASHXQuantify and manage large quantities of short-read sequence data. CASHX pipeline contains a set of tools that can be used together or as independent modules on their own. This algorithm is very accurate for perfect hits to a reference genome.   NoFree for academic and non-commercial use.link
CUDA-ECShort-read alignment error correction using GPUs.   Yes (GPU enabled) CUDA-EC-
CUSHAWA CUDA compatible short read aligner to large genomes based on Burrows-Wheeler transform.YesYesNoYes (GPU enabled)GPLlink
CUSHAW2Long read alignment based on maximal exact match seeds.YesNoYesYesGPLlink
drFASTRead mapping alignment software that implements cache obliviousness to minimize main/cache memory transfers like mrFAST and mrsFAST, however designed for the SOLiD sequencing platform (color space reads). It also returns all possible map locations for improved structural variation discovery.YesYes (for structural variation)YesNoBSDlink
ELANDImplemented by Illumina. Includes ungapped alignment with a finite read length.      
ERNEExtended Randomized Numerical alignEr for accurate alignment of NGS reads. It can map bisulfite-treated reads.YesLow quality bases trimmingYesMultithreading and MPI-enabledGPL v3link
GNUMAPAccurately performs gapped alignment of sequence data obtained from next-generation sequencing machines (specifically that of Solexa/Illumina) back to a genome of any size. Includes adaptor trimming, SNP calling and Bisulfite sequence analysis. Yes (also supports Illumina *_int.txt and *_prb.txt files with all 4 quality scores for each base) Multithreading and MPI-enabled link
GEMHigh-quality alignment engine (exhaustive mapping, that is 100% of sensitivity, for any number of substitutions; 1 non-exhaustive indel). Several standalone applications (mapper, split mapper, mappability, and other) provided. YesYesYesGPL; GEM source is currently unavailablelink
GensearchNGSComplete framework with user-friendly GUI to analyse NGS data. It integrates a proprietary high quality alignment algorithm as well as plug-in capability to integrate various public aligner into a framework allowing to import short reads, align them, detect variants and generate reports. It is geared towards re-sequencing projects, namely in a diagnostic setting.YesNoYesYesCommercial;link
GMAP and GSNAPRobust, fast short-read alignment. GMAP: longer reads, with multiple indels and splices (see entry above under Genomics analysis); GSNAP: shorter reads, with a single indel or up to two splices per read. Useful for digital gene expression, SNP and indel genotyping. Developed by Thomas Wu at Genentech. Used by the National Center for Genome Resources (NCGR) in Alpheus.YesYesYesYesFree for academic and non-commercial use.link
Geneious AssemblerFast, accurate overlap assembler with the ability to handle any combination of sequencing technology, read length, any pairing orientations, with any spacer size for the pairing, with or without a reference genome.   YesCommerciallink
iSAACiSAAC has been designed to take full advantage of all the computational power available on a single server node. As a result iSAAC scales well over a broad range of hardware architectures, and alignment performance improves with hardware capabilitiesYesYesYesYesFree for academic and non-commercial use.link
LAST YesYesYes GPLlink
MAQUngapped alignment that takes into account quality scores for each base.    GPLlink
mrFAST and mrsFASTGapped (mrFAST) and ungapped (mrsFAST) alignment software that implements cache obliviousness to minimize main/cache memory transfers. They are designed for the Illumina sequencing platform and they can return all possible map locations for improved structural variation discovery.YesYes (for structural variation)YesNoBSDmrFAST mrsFAST
MOMMOM or maximum oligonucleotide mapping is a query matching tool that captures a maximal length match within the short read.   Yes link
MOSAIKFast gapped aligner and reference-guided assembler. Aligns reads using a banded Smith-Waterman algorithm seeded by results from a k-mer hashing scheme. Supports reads ranging in size from very short to very long.   Yes link
MPscanFast aligner based on a filtration strategy (no indexing, use q-grams and Backward Nondeterministic DAWG Matching)     link
Novoalign & NovoalignCSGapped alignment of single end and paired end Illumina GA I & II, ABI Colour space & ION Torrent reads.. High sensitivity and specificity, using base qualities at all steps in the alignment. Includes adapter trimming, base quality calibration, Bi-Seq alignment, and option to report multiple alignments per read.YesYesYesMulti-threading and MPI versions available with paid license.Single threaded version free for academic and non-commercial use.Novocraft
NextGENeNextGENe® software has been developed specifically for use by biologists performing analysis of next generation sequencing data from Roche Genome Sequencer FLX, Illumina GA/HiSeq, Life Technologies Applied BioSystems’ SOLiD™ System, PacBio and Ion Torrent platforms.YesYesYesYesCommercialSoftgenetics
OmixonThe Omixon Variant Toolkit includes highly sensitive and highly accurate tools for detecting SNPs and indels. It offers a solution to map NGS short reads with a moderate distance (up to 30% sequence divergence) from reference genomes. It poses no restrictions on the size of the reference, which, combined with its high sensitivity, makes the Variant Toolkit well-suited for targeted sequencing projects and diagnostics.YesYesYesYesCommercialwww.omixon.com
PALMapperPALMapper, efficiently computes both spliced and unspliced alignments at high accuracy. Relying on a machine learning strategy combined with a fast mapping based on a banded Smith-Waterman-like algorithm it aligns around 7 million reads per hour on a single CPU. It refines the originally proposed QPALMA approach.   YesGPLlink
PartekPartek® Flow software has been developed specifically for use by biologists and bioinformaticians. It supports un-gapped, gapped and splice-junction alignment from single and paired-end reads from Illumina, Life technologies Solid TM, Roche 454 and Ion Torrent raw data (with or without quality information). It integrates powerful quality control on FASTQ/Qual level and on aligned data. Additional functionality include trimming and filtering of raw reads, SNP and InDel detection, mRNA and microRNA quantification and fusion gene detection.YesYesYesMultiprocessor/Core, Client-Server installation possibleCommercial, FREE trial version[1]
PASSIndexes the genome, then extends seeds using pre-computed alignments of words. Works with base space as well as color space (SOLID) and can align genomic and spliced RNA-seq reads.YesYesYesYesFree for academic and non-commercial use.PASS_HOME
PerMIndexes the genome with periodic seeds to quickly find alignments with full sensitivity up to four mismatches. It can map Illumina and SOLiD reads. Unlike most mapping programs, speed increases for longer read lengths.   YesGPLlink
QPalmaIs able to take advantage of quality scores, intron lengths and computation splice site predictions to perform and performs an unbiased alignment. Can be trained to the specifics of a RNA-seq experiment and genome. Useful for splice site/intron discovery and for gene model building. (See PALMapper for a faster version).   Yes (client/server)GPLv2link
RazerSNo read length limit. Hamming or edit distance mapping with configurable error rates. Configurable and predictable sensitivity (runtime/sensitivity tradeoff). Supports paired-end read mapping.    LGPLlink
REAL, cREALREAL is an efficient, accurate, and sensitive tool for aligning short reads obtained from next-generation sequencing. The programme can handle an enormous amount of single-end reads generated by the next-generation Illumina/Solexa Genome Analyzer. cREAL is a simple extension of REAL for aligning short reads obtained from next-generation sequencing to a genome with circular structure. Yes YesGPLlink
RMAPCan map reads with or without error probability information (quality scores) and supports paired-end reads or bisulfite-treated read mapping. There are no limitations on read length or number of mismatches.YesYesYes GPL v3link
rNAA randomized Numerical Aligner for Accurate alignment of NGS readsYesLow quality bases trimmingYesMultithreading and MPI-enabledGPL v3link
RTG InvestigatorExtremely fast, tolerant to high indel and substitution counts. Includes full read alignment. Product includes comprehensive pipelines for variant detection and metagenomic analysis with any combination of Illumina, Complete Genomics and Roche 454 data.YesYes, for variant callingYesYesFree for individual investigator use.link
SegemehlCan handle insertions, deletions and mismatches. Uses enhanced suffix arrays.YesNoYesYesFree for non-commercial uselink
SeqMapUp to 5 mixed substitutions and insertions/deletions. Various tuning options and input/output formats.    Free for academic and non-commercial use.link
ShrecShort read error correction with a Suffix trie data structure.   Yes (Java) link
SHRiMPIndexes the reference genome as of version 2. Uses masks to generate possible keys. Can map ABI SOLiD color space reads.YesYesYesYes (OpenMP)BSD derivativelink
SLIDERSlider is an application for the Illumina Sequence Analyzer output that uses the "probability" files instead of the sequence files as an input for alignment to a reference sequence or a set of reference sequences.     link
SOAP, SOAP2 and SOAP3Robust with a small (1-3) number of gaps and mismatches. Speed improvement over BLAT, uses a 12 letter hash table. SOAP2 using bidirectional BWT to build the index of reference, and it is much faster than the first version. Now an GPU-accelerated version named as SOAP3/GPU is available, that could find all 4-mismatch alignments in tens of seconds per one million reads.Yes  Yes(multithread), SOAP3/GPU need GPU available.GPLlink
SOCSFor ABI SOLiD technologies. Significant increase in time to map reads with mismatches (or color errors). Uses an iterative version of the Rabin-Karp string search algorithm.   YesGPLlink
SSAHA and SSAHA2Fast for a small number of variants.    Free for academic and non-commercial use.link
StampyFor Illumina reads. High specificity, and sensitive for reads with indels, structural variants, or many SNPs. Slow, but speed increased dramatically by using BWA for first alignment pass).YesYesYesNoFree for academic and non-commercial uselink
SToRMExperimental  for singles reads only (mainly SOLiD, but with Illumina experimental support now), and with SAM native output. Highly sensitive for reads with many errors, indels (from 1 to 16), and SNPs. Uses spaced seeds. Authors recommend Shrimp2.NoYesYesYes (OpenMP) link
Taipande-novo Assembler for Illumina reads    Free for academic and non-commercial use.link
UGENEVisual interface both for Bowtie and BWA, as well as an embedded aligner    Opensource, GPLlink
VelociMapperFPGA-accelerated reference sequence alignment mapping tool from TimeLogic. Faster than Burrows-Wheeler transform-based algorithms like BWA and Bowtie. Supports up to 7 mismatches and/or indels with no performance penalty. Produces sensitive Smith-Waterman gapped alignments.YesYesYesYesCommercialTimeLogic
XpressAlignFPGA based sliding window short read aligner which exploits the embarrassingly parallel property of short read alignment. Performance scales linearly with number of transistors on a chip (i.e. performance guaranteed to double with each iteration of Moore's Law without modification to algorithm). Low power consumption is useful for datacentre equipment. Predictable runtime. Better price/performance than software sliding window aligners on current hardware, but not better than software BWT-based aligners currently. Can cope with large numbers (>2) of mismatches. Will find all hit positions for all seeds. Single-FPGA experimental version, needs work to develop it into a multi-FPGA production version.    Free for academic and non-commercial use.link
ZOOM100% sensitivity for a reads between 15 - 240bp with practical mismatches. Very fast. Support insertions and deletions. Works with Illumina & SOLiD instruments, not 454.   Yes (GUI) No (CLI).Commerciallink

See also

References

  1. ^ Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (October 1990). "Basic local alignment search tool". Journal of Molecular Biology 215 (3): 403–10. doi:10.1016/S0022-2836(05)80360-2. PMID 2231712.
  2. ^ Angermüller, C.; Biegert, A.; Söding, J. (Dec 2012). "Discriminative modelling of context-specific amino acid substitution probabilities.". Bioinformatics 28 (24): 3240–7. doi:10.1093/bioinformatics/bts622. PMID 23080114.
  3. ^ Durbin, Richard; Eddy, Sean R.; Krogh, Anders et al., eds. (1998). Biological sequence analysis: probalistic models of proteins and nucleic acids. Cambridge, UK: Cambridge University Press. ISBN 978-0-521-62971-3.[page needed]
  4. ^ Söding J (April 2005). "Protein homology detection by HMM-HMM comparison". Bioinformatics 21 (7): 951–60. doi:10.1093/bioinformatics/bti125. PMID 15531603.
  5. ^ Altschul SF, Madden TL, Schäffer AA et al. (September 1997). "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs". Nucleic Acids Research 25 (17): 3389–402. doi:10.1093/nar/25.17.3389. PMC 146917. PMID 9254694. //www.ncbi.nlm.nih.gov/pmc/articles/P MC146917/.
  6. ^ Li W, McWilliam H, Goujon M et al. (June 2012). "PSI-Search: iterative HOE-reduced profile SSEARCH searching". Bioinformatics 28 (12): 1650–1651. doi:10.1093/bioinformatics/bts240. PMC 3371869. PMID 22539666. //www.ncbi.nlm.nih.gov/pmc/articles/P MC3371869/.
  7. ^ Oehmen, C.; Nieplocha, J. (August 2006). "ScalaBLAST: A scalable implementation of BLAST for high-performance data-intensive bioinformatics analysis". IEEE Transactions on Parallel & Distributed Systems 17 (8): 740–749. doi:10.1109/TPDS.2006.112.
  8. ^ Hughey, R.; Karplus, K.; Krogh, A. (2003). SAM: sequence alignment and modeling software system. Technical report UCSC-CRL-99-11 (Report). University of California, Santa Cruz, CA. http://compbio.soe.ucsc.edu/papers/sa m_doc/sam_doc.html.
  9. ^ Gusfield, Dan (1997). Algorithms on strings, trees and sequences. Cambridge university press. ISBN 0-521-58519-8.

External links

(Prev) List of semiconductor scale ex ...List of SIP software (Next)