Skip to main content

A Novel Feature Ensemble Technology to Improve Prediction Performance of Multiple Heterogeneous Phenotypes Based on Microarray Data

  • Conference paper
Fuzzy Systems and Knowledge Discovery (FSKD 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3614))

Included in the following conference series:

  • 1412 Accesses

Abstract

Gene expression microarray technology provides the global information on transcriptional activities of essentially all genes simultaneously, and it thus promotes the new application of traditional feature selection methods in the fields of molecular biology and life sciences. The basic strategy for the traditional feature selection methods is to seek for a single gene subset that leads to the best prediction of biological types, for example tumor versus normal tissues. Because of complexities and genetic heterogeneities of biological phenotypes (e.g. complex diseases), robust computational approaches are desirable to achieve high generalization performance with multiple classifiers and perturbations of the data structures. The purpose of this study is to develop an ensemble decision approach to analysis of multiple heterogeneous phenotypes. The results from an application to a lymphoma data of five subtypes indicate that the proposed analysis strategy is feasible and powerful to perform biological subtype.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Gu, C.C., Rao, D.C., Stormo, G., Hicks, C., Province, M.A.: Role of gene expression microarray analysis in finding complex disease genes. Genet Epidemiol 23, 37–56 (2002)

    Article  Google Scholar 

  2. Bian, Z., Zhang, X.: Pattern Recognition, pp. 198, 87-90, 113-116;120-121. TsingHua Press, Beijing (2000)

    Google Scholar 

  3. Kohavi, R., John, G.: Wrappers for feature subset selection. Artificial Intelligence 97, 273–324 (1997)

    Article  MATH  Google Scholar 

  4. Furlanello, C., Serafini, M., Merler, S., Jurman, G.: Entropy-based gene ranking without selection bias for the predictive classification of microarray data. BMC Bioinformatics 4, 54 (2003)

    Article  Google Scholar 

  5. John, G.H., Kohavi, R., Pfleger, K.: Irrelevant features and the subset selection problem. In: Machine Learning: Proceedings of the 11th International Conference, pp. 121–129 (1994)

    Google Scholar 

  6. Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artificial Intelligence 97, 245–271 (1997)

    Article  MATH  MathSciNet  Google Scholar 

  7. Puuronen, S., Tsymbal, A.: Local feature selection with dynamic integration of classifiers. Fundamenta Informaticae 47, 91–117 (2001)

    MATH  MathSciNet  Google Scholar 

  8. Hansen, J.V.: Combining predictors: comparison of five meta machine learning methods. Information Science 119, 91–105 (1999)

    Article  Google Scholar 

  9. Opitz, D.W., Maclin, R.F.: An empirical evaluation of bagging and boosting for artificial neural networks. International conference on neural networks 3, 1401–1405 (1997)

    Google Scholar 

  10. Li, X., Rao, S., Wang, Y., Gong, B.: Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling. Nucl. Acids Res. 32, 2685–2694 (2004)

    Article  Google Scholar 

  11. Zheng, Z., Webb, G., Ting, K.: Integrating boosting and stochastic attribute selection committees for further improving the performance of decision tree learning. In: 10th International Conference on Tools With Artificial Intelligence TAI 1998, Society IC, Los Alamitos, USA, pp. 216–223 (1998)

    Google Scholar 

  12. Wang, H.Y., Li, X., Guo, Z.: Research on pattern classification methods using gene expression data. Biomedical Engineering Journal (2005) (in press)

    Google Scholar 

  13. Kurra, G., Niu, W., Bhatnagar, R.: Mining microarray expression data for classifier gene-cores. In: Proceedings of the Workshop on Data Mining in Bioinformatics, pp. 8–14 (2001)

    Google Scholar 

  14. Alizadeh, A.A., Eisen, M.B., Davis, R.E., Ma, C., Lossos, I.S., Rosenwald, A., Boldrick, J.C., Sabet, H., Tran, T., Yu, X., Powell, J.I., Yang, L., Marti, G.E., Moore, T., Hudson Jr., J., Lu, L., Lewis, D. B., Tibshirani, R., Sherlock, G., Chan, W.C., Greiner, T.C., Weisenburger, D.D., Armitage, J.O., Warnke, R., Levy, R., Wilson, W., Grever, M.R., Byrd, J.C., Botstein, D., Brown, P.O., Staudt, L.M.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503–511 (2000)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, H., Zhang, Q., Wang, Y., Li, X., Rao, S., Ding, Z. (2005). A Novel Feature Ensemble Technology to Improve Prediction Performance of Multiple Heterogeneous Phenotypes Based on Microarray Data. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11540007_109

Download citation

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics