A Novel Feature Ensemble Technology to Improve Prediction Performance of Multiple Heterogeneous Phenotypes Based on Microarray Data

Wang, Haiyun; Zhang, Qingpu; Wang, Yadong; Li, Xia; Rao, Shaoqi; Ding, Zuquan

doi:10.1007/11540007_109

Haiyun Wang^20,21,
Qingpu Zhang²²,
Yadong Wang²²,
Xia Li^20,21,22,
Shaoqi Rao²³ &
…
Zuquan Ding²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3614))

Included in the following conference series:

International Conference on Fuzzy Systems and Knowledge Discovery

1412 Accesses

Abstract

Gene expression microarray technology provides the global information on transcriptional activities of essentially all genes simultaneously, and it thus promotes the new application of traditional feature selection methods in the fields of molecular biology and life sciences. The basic strategy for the traditional feature selection methods is to seek for a single gene subset that leads to the best prediction of biological types, for example tumor versus normal tissues. Because of complexities and genetic heterogeneities of biological phenotypes (e.g. complex diseases), robust computational approaches are desirable to achieve high generalization performance with multiple classifiers and perturbations of the data structures. The purpose of this study is to develop an ensemble decision approach to analysis of multiple heterogeneous phenotypes. The results from an application to a lymphoma data of five subtypes indicate that the proposed analysis strategy is feasible and powerful to perform biological subtype.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Evolutionary Hybrid Feature Selection for Cancer Diagnosis

Methods of Analysis and Meta-Analysis for Identifying Differentially Expressed Genes

Multi-population adaptive genetic algorithm for selection of microarray biomarkers

Article 17 December 2019

References

Gu, C.C., Rao, D.C., Stormo, G., Hicks, C., Province, M.A.: Role of gene expression microarray analysis in finding complex disease genes. Genet Epidemiol 23, 37–56 (2002)
Article Google Scholar
Bian, Z., Zhang, X.: Pattern Recognition, pp. 198, 87-90, 113-116;120-121. TsingHua Press, Beijing (2000)
Google Scholar
Kohavi, R., John, G.: Wrappers for feature subset selection. Artificial Intelligence 97, 273–324 (1997)
Article MATH Google Scholar
Furlanello, C., Serafini, M., Merler, S., Jurman, G.: Entropy-based gene ranking without selection bias for the predictive classification of microarray data. BMC Bioinformatics 4, 54 (2003)
Article Google Scholar
John, G.H., Kohavi, R., Pfleger, K.: Irrelevant features and the subset selection problem. In: Machine Learning: Proceedings of the 11th International Conference, pp. 121–129 (1994)
Google Scholar
Blum, A.L., Langley, P.: Selection of relevant features and examples in machine learning. Artificial Intelligence 97, 245–271 (1997)
Article MATH MathSciNet Google Scholar
Puuronen, S., Tsymbal, A.: Local feature selection with dynamic integration of classifiers. Fundamenta Informaticae 47, 91–117 (2001)
MATH MathSciNet Google Scholar
Hansen, J.V.: Combining predictors: comparison of five meta machine learning methods. Information Science 119, 91–105 (1999)
Article Google Scholar
Opitz, D.W., Maclin, R.F.: An empirical evaluation of bagging and boosting for artificial neural networks. International conference on neural networks 3, 1401–1405 (1997)
Google Scholar
Li, X., Rao, S., Wang, Y., Gong, B.: Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling. Nucl. Acids Res. 32, 2685–2694 (2004)
Article Google Scholar
Zheng, Z., Webb, G., Ting, K.: Integrating boosting and stochastic attribute selection committees for further improving the performance of decision tree learning. In: 10th International Conference on Tools With Artificial Intelligence TAI 1998, Society IC, Los Alamitos, USA, pp. 216–223 (1998)
Google Scholar
Wang, H.Y., Li, X., Guo, Z.: Research on pattern classification methods using gene expression data. Biomedical Engineering Journal (2005) (in press)
Google Scholar
Kurra, G., Niu, W., Bhatnagar, R.: Mining microarray expression data for classifier gene-cores. In: Proceedings of the Workshop on Data Mining in Bioinformatics, pp. 8–14 (2001)
Google Scholar
Alizadeh, A.A., Eisen, M.B., Davis, R.E., Ma, C., Lossos, I.S., Rosenwald, A., Boldrick, J.C., Sabet, H., Tran, T., Yu, X., Powell, J.I., Yang, L., Marti, G.E., Moore, T., Hudson Jr., J., Lu, L., Lewis, D. B., Tibshirani, R., Sherlock, G., Chan, W.C., Greiner, T.C., Weisenburger, D.D., Armitage, J.O., Warnke, R., Levy, R., Wilson, W., Grever, M.R., Byrd, J.C., Botstein, D., Brown, P.O., Staudt, L.M.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503–511 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Life Science and Medical Engineering, TongJi University, ShangHai, 200092, China
Haiyun Wang, Xia Li & Zuquan Ding
Department of Bioinformatics, Harbin Medical University, Harbin, 150086, China
Haiyun Wang & Xia Li
Harbin Institute of Technology, Harbin, 150001, China
Qingpu Zhang, Yadong Wang & Xia Li
Departments of Cardiovascular Medicine and Molecular Cardiology, Cleveland Clinic Foundation, Cleveland, Ohio, 44195, USA
Shaoqi Rao

Authors

Haiyun Wang
View author publications
Search author on:PubMed Google Scholar
Qingpu Zhang
View author publications
Search author on:PubMed Google Scholar
Yadong Wang
View author publications
Search author on:PubMed Google Scholar
Xia Li
View author publications
Search author on:PubMed Google Scholar
Shaoqi Rao
View author publications
Search author on:PubMed Google Scholar
Zuquan Ding
View author publications
Search author on:PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, Nanyang Technological University, Block S1, Nanyang Avenue, 639798, Singapore
Lipo Wang
Honda Research Institute Europe GmbH, Offenbach/Main, Germany
Yaochu Jin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, H., Zhang, Q., Wang, Y., Li, X., Rao, S., Ding, Z. (2005). A Novel Feature Ensemble Technology to Improve Prediction Performance of Multiple Heterogeneous Phenotypes Based on Microarray Data. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11540007_109

Download citation

DOI: https://doi.org/10.1007/11540007_109
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28331-7
Online ISBN: 978-3-540-31828-6
eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics