JOURNAL ARTICLE

A Grammar Based Approach for Mining Bioinformatics Databases

Abstract

In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular grammars in allowing the use of advanced queries in comparing sequences and searching for motifs or interior-sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic and is provided at runtime.

Keywords:
Computer science Sorting Data mining Formal grammar Rule-based machine translation Biological database Ontology Scheme (mathematics) Similarity (geometry) Information retrieval Sequence (biology) Database Artificial intelligence Bioinformatics Programming language

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
6
Refs
0.14
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence
Genomics and Phylogenetic Studies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Bioinformatics and Genomic Networks
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology

Related Documents

BOOK-CHAPTER

BioInformatics: Databases + Data Mining

Arno Siebes

Lecture notes in computer science Year: 2000 Pages: 54-55
BOOK-CHAPTER

A Logic-Based Approach to Mining Inductive Databases

Hong-Cheu LiuJeffrey Xu YuJohn ZeleznikowYing Guan

Lecture notes in computer science Year: 2007 Pages: 270-277
BOOK-CHAPTER

Bioinformatics Databases

Stephen S.‐T. YauXin ZhaoKun TianHongyu Yu

Interdisciplinary applied mathematics Year: 2023 Pages: 13-25
BOOK-CHAPTER

Bioinformatics Databases

Hamid D. Ismail

Year: 2025 Pages: 98-163
© 2026 ScienceGate Book Chapters — All rights reserved.