Sequence Similarity and Database Searching

David S. Wishart

doi:10.1007/978-1-59259-335-4_27

ScienceGate Book Chapters

BOOK-CHAPTER

Sequence Similarity and Database Searching

David S. Wishart

Year: 2003 Humana Press eBooks Pages: 443-461 Publisher: Humana Press

DOI: 10.1007/978-1-59259-335-4_27

Get Full-Text PDF Get Analytical Report

Abstract

Database searching is perhaps the fastest, cheapest, and most powerful experiment a biologist can perform. No other 10-s test allows a biologist to reveal so much about the function, structure, location or origin of a gene, protein, organelle, or organism. A database search does not consume any reagents or require any specific wet-bench laboratory skills; just about anyone can do it, but the key is to do it correctly. The power of database searching comes from not only the size of today’s sequence databases (now containing more than 700,000 annotated gene and protein sequences), but from the ingenuity of certain key algorithms that have been developed to facilitate this very special kind of searching. Given the importance of database searching it is crucial that today’s life scientists try to become as familiar as possible with the details of the process. Indeed, the intent of this chapter to provide the reader with some insight and historical background to the methods and algorithms that form the foundation of a few of the most common database searching techniques. There are many strengths, misconceptions and weaknesses to these simple but incredibly useful computer experiments.KeywordsQuery SequenceBasic Local Alignment Search ToolHash TableAlignment ScoreBasic Local Alignment Search Tool SearchThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Keywords:

Ingenuity Computer science Similarity (geometry) Key (lock) Function (biology) Process (computing) Information retrieval Nearest neighbor search Sequence (biology) Database Data mining Artificial intelligence Biology

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.12

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Genomics and Phylogenetic Studies

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

Genetics, Bioinformatics, and Biomedical Research

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

RNA and protein synthesis mechanisms

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

Sequence Similarity and Database Searching

Abstract

Metrics

Topics

Related Documents

Sequence Similarity and Database Searching

Database Similarity Searching

Multiprocessor Sequence Similarity Searching

Similarity Searching for Database Applications

Sequence Alignment and Database Searching