Paper Infomation
Cancer Gene Extraction Based on Stepwise Regression
Full Text(PDF, 611KB)
Author: Yunfei Guo, Jie Ni, Fan Wu, Meixiang Jin, Yixing Bai
Abstract: With the expansion of the gene expression profile database, in the case of as little as possible to lose information or to retain the most critical information, gene extraction has become a main direction for the scholars. This paper excludes 1561 irrelevant genes through the definition of weighted distance firstly, and then removes 252 redundant genes by Pearson's correlation coefficient. Finally by comparing the two methods, stepwise regression after clustering and only stepwise analysis, we obtain the best combination of 8 genes.
Keywords: stepwise regression, cluster analysis, gene extraction
References:
[1] WANG SHOU-JUE, ZHOU LING-FEl. Gene Selection for Gene Expression Data Analysis, CONTROL & AUTOMATION, Vol 24,2008.
[2] LI Ying-xin, RUAN Xiao-gang. Cancer Subtype Recognition and Feature Selection with Gene Expression Profiles, ACTA ELECTRONICA SINICA,Vol 33,2005.
[3] Golub, T.R., Slonim, D.K., Tamayo, P., et al. Molecular classification of cancer:class discovery and class prediction by gene expression monitoring[J]. Science, 1999, 286: 531-537.
[4] Ramaswamy, S., Tamayo, P., et al. Multiclass cancer diagnosis using tumour gene expression signatures[J]. PNAS , 2001, 98 : 15149 – 15154
[5] WANG SHOU-JUE. Direction-Basis-Function neural networks, IJCNN’ 99, 1999: 1251-1254
[6] KAN Haijun, TANG Jun* ,SU Liangliang, A method for informative gene selection using neighborhood uncertainty and scoring criteria, Journal of Anhui University ( Natural Science Edition) Vol.38 No.1 2014
[7] Wang Jingqi, Xu Linli, Semi—supervised feature selection and clustering for multi—view data[J]. Journal of Data Acquisition and Processing,2015,30(1):106—116.
[8] Xu Jiucheng, Xu Tianhe, Sun Lin,et a1. Feature selection for cancer classification based on neighborhood rough set and par— ticle swarm optimization[J]. Journal of Chinese Computer Systems.2014,35(11):2529—2532.
[9] Xu Jiuchen,Li Tao,Sun Linl,Li Yuhuil, Feature Gene Selection Based on SNR and Neighborhood Rough Set, Journal of Data Acquisition and Processing V01.30.No.5,Sep.2015,PP.973—981
[10] Ra maswamy S.Golub T R.DNA Mi croar rays in Clinical Oncolog y[ J].Journal of Clinical Oncology,2 002; 20( 7): 1932-1 941
[11] Guyon I, Weston J, Barnhill S, et al. Gene selection for cancer classification using support vector machines. Machine Learning , 46(13): 389-422, 2000