Data Mining Based on Colon Cancer Gene Expression Profiles

Junkui Chen; Junzhong Gu

doi:10.1109/iccis.2011.120

ScienceGate Book Chapters

JOURNAL ARTICLE

Data Mining Based on Colon Cancer Gene Expression Profiles

Junkui Chen Junzhong Gu

Year: 2011 Vol: 21 Pages: 264-267

DOI: 10.1109/iccis.2011.120

Get Full-Text PDF Get Analytical Report

Abstract

This research is based on biological information theory. In order to study the selection of colon cancer samples in normal samples and the classification of information gene, the use of pattern recognition and data mining methods were applied to analyze gene expression data for colon cancer. Firstly, signal to noise ratio (SNR) and the Bhattacharyya distance (BHA) were used to remove the irrelevant genes and noise, on the basis of deletion by mistake. After that, 100 information genes could be obtained respectively. Secondly, we calculate the union set of the 200 information genes called union C, and 102 information genes left. Thirdly, the minimum redundancy maximum relevance (MRMR) method was used to search for the information gene set in the union C. Finally, support vector machine (SVM) was used as the classifier to identify normal samples from colon cancer samples and 12 information genes were extracted based on the average classification rate. Several random sampling results showed that 12 information gene extracted in the study can classify normal samples and colon cancer samples at a high correct rate of 93.70%.

Keywords:

Bhattacharyya distance Support vector machine Gene selection Classifier (UML) Pattern recognition (psychology) Gene Computer science Artificial intelligence Data mining Data redundancy Mathematics Gene expression Computational biology Biology Genetics

Metrics

Cited By

0.14

FWCI (Field Weighted Citation Impact)

Refs

0.52

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Gene expression and cancer classification

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

Bioinformatics and Genomic Networks

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

Machine Learning in Bioinformatics

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

Data Mining Based on Colon Cancer Gene Expression Profiles

Abstract

Metrics

Citation History

Topics

Related Documents

Breast and Colon Cancer Classification from Gene Expression Profiles Using Data Mining Techniques

Breast and Colon Cancer Classification from Gene Expression Profiles Using Data Mining Techniques

Gene expression profiles of folate deficient colon cancer cells

DATA MINING FOR GENE EXPRESSION PROFILES FROM DNA MICROARRAY

Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles