© The Institution of Engineering and Technology
Feature selection is important and necessary for disease classification and prediction using high-dimensional gene expression data. A hybrid method integrating sparse representation with a two-sample statistical t-test to construct features from high-throughput microarray data is presented. The approach takes account of gene interaction and reduces the variable dimension by sparse linear combination, as well as considers the discriminative power of genes using component regression. Under the recurrent independence rule for classification, the experiment results on real data demonstrate the improvements of this hybrid technique over conventional methods.
References
-
-
1)
-
4. Cheng, Q., Cheng, J.: ‘Sparsity optimization method for multivariate feature screening for gene expression analysis’, J. Comput. Biol., 2011, 16, (9), pp. 1241–1252 (doi: 10.1089/cmb.2008.0034).
-
2)
-
3. Lee, J., Lim, H., Kim, D.W.: ‘Approximating mutual information for multi-label feature selection’, Electron. Lett., 2012, 48, (15), p. 929 (doi: 10.1049/el.2012.1600).
-
3)
-
2. Peter, H., Xue, J.H.: ‘On selecting interacting features from high-dimensional data’, Comput. Stat. Data Anal., 2012, .
-
4)
-
10. Dudoit, S., Fridly, J., Speed, T.P.: ‘Comparison of discrimination methods for the classification of tumors using gene expression data’, J. Am. Stat. Assoc., 2002, 97, (457), pp. 77–87 (doi: 10.1198/016214502753479248).
-
5)
-
6. Tibshirani, R., Hastie, T., Narasimhan, B., Chu, G.: ‘Diagnosis of multiple cancer types by shrunken centroids of gene expression’, Proc. Natl. Acad. Sci., 2002, 99, pp. 6567–6572 (doi: 10.1073/pnas.082099299).
-
6)
-
9. Gordon, G.J., et al: ‘Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma’, Cancer Res., 2002, 62, (17), pp. 4963–4967.
-
7)
-
D. Singh ,
P.G. Febbo ,
K. Ross
.
Gene expression correlates of clinical prostate cancer behavior.
Cancer Cell
,
203 -
209
-
8)
-
1. Fan, J., Fan, Y.: ‘High-dimensional classification using features annealed independence rules’, Ann. Stat., 2008, 36, pp. 2232–2260 (doi: 10.1214/07-AOS504).
-
9)
-
5. He, B.S., Yuan, X.M.: ‘A contraction method with implementable proximal regularization for linearly constrained convex programming’, Optim. Online, 2010.
-
10)
-
T.R. Golub ,
D.K. Slonim ,
P. Tamayo ,
C. Huard ,
M. Gaasenbeek ,
J.P. Mesirov ,
H. Coller ,
M.L. Loh ,
J.R. Downing ,
M.A. Caligiuri ,
C.D. Bloomfield ,
E.S. Lander
.
Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.
Science
,
531 -
537
http://iet.metastore.ingenta.com/content/journals/10.1049/el.2013.3296
Related content
content/journals/10.1049/el.2013.3296
pub_keyword,iet_inspecKeyword,pub_concept
6
6