Abstract
Gene expression microarray technology provides the global information on transcriptional activities of essentially all genes simultaneously, and it thus promotes the new application of traditional feature selection methods in the fields of molecular biology and life sciences. The basic strategy for the traditional feature selection methods is to seek for a single gene subset that leads to the best prediction of biological types, for example tumor versus normal tissues. Because of complexities and genetic heterogeneities of biological phenotypes (e.g. complex diseases), robust computational approaches are desirable to achieve high generalization performance with multiple classifiers and perturbations of the data structures. The purpose of this study is to develop an ensemble decision approach to analysis of multiple heterogeneous phenotypes. The results from an application to a lymphoma data of five subtypes indicate that the proposed analysis strategy is feasible and powerful to perform biological subtype.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gu, C.C., Rao, D.C., Stormo, G., Hicks, C., Province, M.A.: Role of gene expression microarray analysis in finding complex disease genes. Genet Epidemiol 23, 37–56 (2002)
Bian, Z., Zhang, X.: Pattern Recognition, pp. 198, 87-90, 113-116;120-121. TsingHua Press, Beijing (2000)
Kohavi, R., John, G.: Wrappers for feature subset selection. Artificial Intelligence 97, 273–324 (1997)
Furlanello, C., Serafini, M., Merler, S., Jurman, G.: Entropy-based gene ranking without selection bias for the predictive classification of microarray data. BMC Bioinformatics 4, 54 (2003)
John, G.H., Kohavi, R., Pfleger, K.: Irrelevant features and the subset selection problem. In: Machine Learning: Proceedings of the 11th International Conference, pp. 121–129 (1994)
Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artificial Intelligence 97, 245–271 (1997)
Puuronen, S., Tsymbal, A.: Local feature selection with dynamic integration of classifiers. Fundamenta Informaticae 47, 91–117 (2001)
Hansen, J.V.: Combining predictors: comparison of five meta machine learning methods. Information Science 119, 91–105 (1999)
Opitz, D.W., Maclin, R.F.: An empirical evaluation of bagging and boosting for artificial neural networks. International conference on neural networks 3, 1401–1405 (1997)
Li, X., Rao, S., Wang, Y., Gong, B.: Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling. Nucl. Acids Res. 32, 2685–2694 (2004)
Zheng, Z., Webb, G., Ting, K.: Integrating boosting and stochastic attribute selection committees for further improving the performance of decision tree learning. In: 10th International Conference on Tools With Artificial Intelligence TAI 1998, Society IC, Los Alamitos, USA, pp. 216–223 (1998)
Wang, H.Y., Li, X., Guo, Z.: Research on pattern classification methods using gene expression data. Biomedical Engineering Journal (2005) (in press)
Kurra, G., Niu, W., Bhatnagar, R.: Mining microarray expression data for classifier gene-cores. In: Proceedings of the Workshop on Data Mining in Bioinformatics, pp. 8–14 (2001)
Alizadeh, A.A., Eisen, M.B., Davis, R.E., Ma, C., Lossos, I.S., Rosenwald, A., Boldrick, J.C., Sabet, H., Tran, T., Yu, X., Powell, J.I., Yang, L., Marti, G.E., Moore, T., Hudson Jr., J., Lu, L., Lewis, D. B., Tibshirani, R., Sherlock, G., Chan, W.C., Greiner, T.C., Weisenburger, D.D., Armitage, J.O., Warnke, R., Levy, R., Wilson, W., Grever, M.R., Byrd, J.C., Botstein, D., Brown, P.O., Staudt, L.M.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503–511 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, H., Zhang, Q., Wang, Y., Li, X., Rao, S., Ding, Z. (2005). A Novel Feature Ensemble Technology to Improve Prediction Performance of Multiple Heterogeneous Phenotypes Based on Microarray Data. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11540007_109
Download citation
DOI: https://doi.org/10.1007/11540007_109
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28331-7
Online ISBN: 978-3-540-31828-6
eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
