Pairwise sequence alignment

MUTATIONS

Can happens by

substitution

insertion

deletion

Mutability

high mutability

Asn,

Ser

Asp

Glu

Mutability

Tyr

Phe

Leu

Cys

Trp

SCORING MATRICES

PAM

PAM1

probability of occurance
+
percent accepted mutations
=
mutation probability matrix

at least 85% identical

one PAM (unit of evolution) =
1% amino acids changed between two proteins

Percent Accepted Mutations (Dayhoff)

global alignment (closely related protein)

X% divergence

BLAST

Blocks Substitution Matrix (Henikoff)

local alignment (distantly related protein)

X% identical

Smith-Waterman local alignment

four component for BLAST search

CHOOSE sequences

SELECT BLAST program

blastp

blastn

blastx

tblastn

tblastx

CHOOSE database

nr

Month

Swisprot

RefSeq

Pdb

Pat

Subtopic

pnv_nr

CHOOSE optional parameters

different for every blast program

Organism

Turn filtering on/off

substitution matrix

expect value

Extreme Value Distribution (EVD)

E = Kmn e-λS

word size

output format

BLAT

BLAST-like alignment tool

500 times faster than BLAST

design to rapidly align longer nucleotide sequence

more than 95% similarity

SCORES

identity

mismatch

gap penalty

opening - inroduce gap

extension - elongate gap

scores

raw scores (calculate from substitution matrix)

bit scores (normalized scores)