Document Type
Article
Journal Title
BMC bioinformatics
Publication Date
10-7-2011
Volume
12
Abstract
BACKGROUND: Although numerous methods of using microarray data analysis for cancer classification have been proposed, most utilize many genes to achieve accurate classification. This can hamper interpretability of the models and ease of translation to other assay platforms. We explored the use of single genes to construct classification models. We first identified the genes with the most powerful univariate class discrimination ability and then constructed simple classification rules for class prediction using the single genes.
RESULTS: We applied our model development algorithm to eleven cancer gene expression datasets and compared classification accuracy to that for standard methods including Diagonal Linear Discriminant Analysis, k-Nearest Neighbor, Support Vector Machine and Random Forest. The single gene classifiers provided classification accuracy comparable to or better than those obtained by existing methods in most cases. We analyzed the factors that determined when simple single gene classification is effective and when more complex modeling is warranted.
CONCLUSIONS: For most of the datasets examined, the single-gene classification methods appear to work as well as more standard methods, suggesting that simple models could perform well in microarray-based cancer prediction.
MeSH Headings
Algorithms, Cluster Analysis, Gene Expression Profiling, Humans, Neoplasms, Oligonucleotide Array Sequence Analysis, Reproducibility of Results, Support Vector Machines
DOI Link
ISSN
1471-2105
Rights
This work is licensed under a Creative Commons Attribution 2.0 License.
Recommended Citation
Wang, Xiaosheng and Simon, Richard, "Microarray-based cancer prediction using single genes." (2011). Journal Articles: Genetics, Cell Biology & Anatomy. 14.
https://digitalcommons.unmc.edu/com_gcba_articles/14