Basic local alignment search tool¶
Why this mattered¶
Before BLAST, searching rapidly growing DNA and protein databases for biologically meaningful similarities was computationally expensive enough to limit routine use. The 1990 paper changed that balance by showing that local sequence similarity could be found quickly through a seed-and-extend heuristic while still being tied to a statistical framework for assessing significance. This made database search a practical everyday operation rather than a specialized, slow computation.
The paradigm shift was not merely faster alignment; it was the conversion of sequence comparison into an exploratory engine for biology. After BLAST, a newly sequenced gene or protein could be compared against known sequences quickly enough to infer possible function, identify homologs, detect conserved domains, and guide experiments. As sequence databases expanded through the 1990s and into the genome era, this capacity became central to annotation: raw sequence data could be linked to evolutionary and functional context at scale.
BLAST also helped define the working style of modern bioinformatics. Its successors and extensions, including gapped BLAST and PSI-BLAST, improved sensitivity and made profile-based discovery of distant relationships more routine. The broader impact was that large-scale genomics, comparative genomics, metagenomics, and protein family analysis all came to rely on fast, statistically interpretable similarity search as a basic layer of inference.
Abstract¶
(no abstract available)
Related¶
- cite → Improved tools for biological sequence comparison. — BLAST extends FASTA-style heuristic sequence searching with a faster local-alignment word-hit strategy.
- cite → A general method applicable to the search for similarities in the amino acid sequence of two proteins — BLAST uses the same biological goal as Needleman-Wunsch, identifying sequence similarity, but replaces full dynamic programming with local heuristic search.
- enables → Accelerated Profile HMM Searches — BLAST popularized fast heuristic local sequence search, setting the computational benchmark that accelerated profile HMM searches aimed to surpass.
- enables → The Protein Data Bank — BLAST enables the Protein Data Bank by making rapid sequence similarity search a core way to identify and relate protein structures.
- enables → The Sequence of the Human Genome — BLAST enabled rapid sequence similarity search, supporting annotation and validation of assembled regions in the Human Genome Project.
- enables → Mapping and quantifying mammalian transcriptomes by RNA-Seq — BLAST enabled RNA-Seq transcriptome mapping by providing fast local sequence alignment for assigning short reads or assembled sequences to genes.
- cite ← Accelerated Profile HMM Searches — Accelerated Profile HMM Searches compares HMMER's profile-HMM sequence search speed and sensitivity against BLAST's local alignment heuristic.
- cite ← The Protein Data Bank — The Protein Data Bank cites BLAST as a standard method for local sequence similarity searches against protein structures.
- cite ← The Sequence of the Human Genome — The human genome paper uses BLAST sequence alignment to identify genes and compare genomic sequences.
- cite ← Mapping and quantifying mammalian transcriptomes by RNA-Seq — RNA-Seq uses BLAST-style local sequence alignment principles to map sequenced transcript reads to reference genes or genomes.
- enables ← A general method applicable to the search for similarities in the amino acid sequence of two proteins — Needleman-Wunsch framed sequence similarity as an alignment problem, while BLAST made local similarity search fast enough for large sequence databases.