Navigation Links
BLAST


For other uses, see BLAST (disambiguation).

In bioinformatics, Basic Local Alignment Search Tool, or BLAST, is an algorithm for comparing biological sequences, such as the amino-acid sequences of different proteins or the DNA sequences. Given a library or database of sequences, a BLAST search enables a researcher to look for sequences that resemble a given sequence of interest. For example, following the discovery of a previously unknown gene in the mouse, a scientist typically will perform a BLAST search of the human genome to see if human beings carry a similar gene; BLAST will identify sequences in the human genome that resemble the mouse gene based on similarity of sequence.

BLAST is one of the most widely used bioinformatics programs, probably because it addresses a fundamental problem, and its algorithm emphasizes speed over sensitivity. This emphasis on speed is vital to making the algorithm practical on the huge genome databases currently available, although subsequent algorithms can be even faster.

Examples of other questions that researchers use BLAST to answer are

  • Which bacterial species have a protein that is related in lineage to a certain protein whose amino-acid sequence I know?
  • Where does the DNA that I've just sequenced come from?
  • What other genes encode proteins that exhibit structures or motifs such as the one I've just determined?

BLAST is also often as part of other algorithms that require approximate sequence matching.

The BLAST algorithm and a computer program that implements it were developed by Stephen Altschul, Warren Gish , David Lipman at the U.S. National Center for Biotechnology Information (NCBI), Webb Miller at The Pennsylvania State University, and Gene Myers at the University of Arizona . It is available on the web at [1].

The original paper "Altschul, SF, W Gish, W Miller, EW Myers, and DJ Lipman. Basic local alignment search tool. J Mol Biol 215(3):403-10, 1990." was the most highly cited paper published in the 1990s.

Algorithm

To run, BLAST requires two sequences as input: a query sequence (also called the target sequence) and a sequence database. BLAST will find subsequences in the query that is similar to a subsequence in the database. In typical usage, the query sequence is much smaller than the database, e.g., the query may be 1 thousand nucleotides while the database is several billion nucleotides.

To define what it means for two subsequences to be "similar", BLAST uses the Smith-Waterman algorithm. Unfortunately, the Smith-Waterman algorithm is too slow to use on huge genome databases currently available. Therefore, the BLAST algorithm works by searching for small regions that are exactly the same in the two sequences and then attempting to extend the alignment to either side until the comparison score reaches a certain threshold. These heuristics used to speed the basic Smith-Waterman algorithm are the key technical innovation of BLAST programs, and the more practical CPU requirements explain why BLAST is vastly more used than the Smith-Waterman algorithm.

An extremely fast alternative to BLAST that compares nucleotide sequences to the genome is BLAT (Blast Like Alignment Tool). A more precise, and much slower, alternative to BLAST is the Smith Waterman.

Program

The BLAST program can either be downloaded and run as a command-line utility "blastall" or accessed for free over the web. The BLAST web server, hosted by the NCBI, allows anyone with a web browser to perform similarity searches against constantly updated databases of proteins and DNA that include most of the newly sequenced organisms.

BLAST is actually a family of programs (all included in the blastall executable). The following are some of the programs, ranked mostly in order of importance:

  • Nucleotide-nucleotide BLAST (blastn): This program, given a DNA query, returns the most similar DNA sequences from the DNA database that the user specifies.
  • Protein-protein BLAST (blastp): This program, given a protein query, returns the most similar protein sequences from the protein database that the user specifies.
  • Position-Specific Iterative BLAST (PSI-BLAST): One of the more recent BLAST programs, this program is used for finding distant relatives of a protein. First, a list of all closely related proteins is created. Then these proteins are combined into a "profile" that is a sort of average sequence. A query against the protein database is then run using this profile, and a larger group of proteins found. This larger group is used to construct another profile, and the process is repeated.
    By including related proteins in the search, PSI-BLAST is much more sensitive in picking up distant evolutionary relationships than the standard protein-protein BLAST.
  • Nucleotide-protein 6-frame translation (blastx): This program compares the six-frame conceptual translation products of a nucleotide query sequence (both strands) against a protein sequence database. This can be very slow.
  • Nucleotide-nucleotide 6-frame translation (tblastx): This program is the slowest of the BLAST family. It translates both query and target nucleotide sequences in all six possible frames and compares the resulting proteins. The purpose of tblastx is to find very distant relationships between nucleotide sequences.
  • Protein-nucleotide 6-frame translation (tblastn): This program translates the target database in all 6 frames and compares to a protein query sequence.
  • Large numbers of query sequences (megablast): When comparing large numbers of input sequences via the command-line BLAST, "megablast" is much faster than running BLAST multiple times.

External links


'"/>


See more about: BLAST

TAG: BLAST
Other biology definition
(Date:11/21/2008)...rproof" versions of popular varieties of rice, whi...have passed tests in farmers, fields with flying c... official release by national and state seed certi...armers suffer major crop losses because of floodin...s enough rice to feed 30 million people. , The f...
(Date:11/21/2008)...rown University] Although Americans are becoming ... everyday household products like bisphenol A in s...ot readily connect typical household products with...alth effects, according to research from the Decem...avior . Brown University sociologist Phil Brown is...
(Date:11/20/2008)...-like tracks on the ocean floor made by giant deep...ights into the evolutionary origin of animals, say...ty of Texas at Austin. , Matz and his colleague... their complex tracks on the ocean floor near the ...ganism has been shown to make such animal-like tra...
(Date:11/20/2008)...ov. 20, 2008 -- A team led by Thomas Schulthess of...l Laboratory received the prestigious 2008 Associa...ze Thursday after attaining the fastest performanc... , Schulthess is group leader of ORNL,s Computat...d a position as director of the Swiss National Sup...
Breaking Biology News(10 mins):From genes to farmers' fields 2From genes to farmers' fields 3Household exposure to toxic chemicals lurks unrecognized, researchers find 2Household exposure to toxic chemicals lurks unrecognized, researchers find 3Discovery of giant roaming deep sea protist provides new perspective on animal evolution 2Discovery of giant roaming deep sea protist provides new perspective on animal evolution 3ORNL supercomputer simulation wins prize for fastest-running science application 2ORNL supercomputer simulation wins prize for fastest-running science application 3Better Sleepers Are Successful Agers 21745 1Better Sleepers Are Successful Agers 21745 2228 People in 22 States Sickened in Ongoing Salmonella Tomato Outbreak 21743 1228 People in 22 States Sickened in Ongoing Salmonella Tomato Outbreak 21743 2228 People in 22 States Sickened in Ongoing Salmonella Tomato Outbreak 21743 3228 People in 22 States Sickened in Ongoing Salmonella Tomato Outbreak 21743 4228 People in 22 States Sickened in Ongoing Salmonella Tomato Outbreak 21743 5MEDRAD and PETNET Solutions Partner for Innovative FDG Delivery 21741 1MEDRAD and PETNET Solutions Partner for Innovative FDG Delivery 21741 2MEDRAD and PETNET Solutions Partner for Innovative FDG Delivery 21741 3Cancer Clinics of Excellence 28CCE 29 Announces Management and Board of Directors Slate at 1st Annual General Membership Meeting 21739 1Cancer Clinics of Excellence 28CCE 29 Announces Management and Board of Directors Slate at 1st Annual General Membership Meeting 21739 2
...m, popped them, endured bad song lyrics about them...add a more sophisticated application to the list--...to tumors, or to deliver drugs. , The process of...tion, and using gas bubbles is a new technique in ...the technique allows doctors to control exactly wh...
...ucted in Virunga National Park in the Democratic R...es of large mammal are now recovering from a decad...the Wildlife Conservation Society (WCS) and the In... (ICCN). Specifically, elephants and other species...rk,s last census, due in large part to the anti-po...
...a long-held belief regarding the cultural spread o...hat chimpanzees in the Ebo forest, Cameroon, use s...cess the nutrient-rich seeds. The findings are sig...eviously known only in a distant chimpanzee popula...be restricted by geographical boundaries that prev...
...issue therapy. Simply supply a bag of your blood a... cells from other tissues, ranging from brain and ...lls of the pancreas. , The idea is to revert a pa...n chemically nudge them to re-specialise into part...damaged tissue. A huge advantage over using donate...
Other Biology News:Bubbles go high-tech to fight tumors 2Elephants, large mammals recover from poaching in Africa's oldest national park 2Use of stone hammers sheds light on geographic patterns of chimpanzee tool use 2Teasing out tissue from blood 2