BLAST and FASTA Similarity Searching for Multiple Sequence Alignment

互联网2014-02-13

541

BLAST, FASTA, and other similarity searching programs seek to identify homologous proteins and DNA sequences based on excess sequence similarity. If two sequences share much more similarity than expected by chance, the simplest explanation for the excess similarity is common ancestry—homology. The most effective similarity searches compare protein sequences, rather than DNA sequences, for sequences that encode proteins, and use expectation values, rather than percent identity, to infer homology. The BLAST and FASTA packages of sequence comparison programs provide programs for comparing protein and DNA sequences to protein databases (the most sensitive searches). Protein and translated-DNA comparisons to protein databases routinely allow evolutionary look back times from 1 to 2 billion years; DNA:DNA searches are 5–10-fold less sensitive. BLAST and FASTA can be run on popular web sites, but can also be downloaded and installed on local computers. With local installation, target databases can be customized for the sequence data being characterized. With today’s very large protein databases, search sensitivity can also be improved by searching smaller comprehensive databases, for example, a complete protein set from an evolutionarily neighboring model organism. By default, BLAST and FASTA use scoring strategies target for distant evolutionary relationships; for comparisons involving short domains or queries, or searches that seek relatively close homologs (e.g. mouse–human), shallower scoring matrices will be more effective. Both BLAST and FASTA provide very accurate statistical estimates, which can be used to reliably identify protein sequences that diverged more than 2 billion years ago.

相关产品推荐

MiN_P_P1/MiN_P_P1蛋白Recombinant Rat Multiple inositol polyphosphate phosphatase 1 (MiN_P_P1)重组蛋白2,3-bisphosphoglycerate 3-phosphatase (EC:3.1.3.80) ;2,3-BPG phosphataseInositol (1,3,4,5)-tetrakisphosphate 3-phosphatase ;Ins(1,3,4,5)P(4) 3-phosphatase蛋白

￥1836

CHAMP1 Homo sapiens chromosome alignment maintaining phosphoprotein 1 (CHAMP1), transcript variant 2, mRNA.

询价

FCER2/FCER2蛋白Recombinant Human Low affinity immunoglobulin epsilon Fc receptor (FCER2)重组蛋白BLAST-2 (C-type lectin domain family 4 member J) (Fc-epsilon-RII) (Immunoglobulin E-binding factor) (Lymphocyte IgE receptor)蛋白

￥2376

SPT15/SPT15蛋白Recombinant Saccharomyces cerevisiae TATA-box-binding protein (SPT15)重组蛋白TATA sequence-binding protein ;TBPTATA-binding factorTATA-box factorTranscription factor DTranscription initiation factor TFIID TBP subunit蛋白

￥2616

LRP4/LRP4蛋白Recombinant Human Low-density lipoprotein receptor-related protein 4 (LRP4)重组蛋白Multiple epidermal growth factor-like domains 7蛋白

￥1344