Chin. Phys. Lett.  2003, Vol. 20 Issue (5): 774-777    DOI:
Original Articles |
Data Preprocessing in Cluster Analysis of Gene Expression
YANG Chun-Mei1;WAN Bai-Kun1;GAO Xiao-Feng2
1College of Precision Instrument and Opto-Electronics Engineering, Tianjin University, Tianjin 300072 2Motorola (China) Electronics Ltd., Tianjin 300457
Cite this article:   
YANG Chun-Mei, WAN Bai-Kun, GAO Xiao-Feng 2003 Chin. Phys. Lett. 20 774-777
Download: PDF(465KB)  
Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract Considering that the DNA microarray technology has generated explosive gene expression data and that it is urgent to analyse and to visualize such massive datasets with efficient methods, we investigate the data preprocessing methods used in cluster analysis, normalization or logarithm of the matrix, by using hierarchical clustering, principle component analysis (PCA) and self-organizing maps (SOMs). The results illustrate that when using the Euclidean distance as measuring metrices, logarithm of relative expression level is the best preprocessing method, while data preprocessed by normalization cannot attain the expected results because the data structure is ruined. If there are only a few principle components, the PCA is an effective method to extract the frame structure, while SOMs are more suitable for a specific structure.


Keywords: 87.80.Tq      07.05.Kf      89.70.+c      87.14.Gg     
Published: 01 May 2003
PACS:  87.80.Tq  
  07.05.Kf (Data analysis: algorithms and implementation; data management)  
  89.70.+c  
  87.14.Gg  
TRENDMD:   
URL:  
https://cpl.iphy.ac.cn/       OR      https://cpl.iphy.ac.cn/Y2003/V20/I5/0774
Service
E-mail this article
E-mail Alert
RSS
Articles by authors
YANG Chun-Mei
WAN Bai-Kun
GAO Xiao-Feng
Related articles from Frontiers Journals
[1] WU Zheng-Yi, FENG Jin-Fu, WU Xiao-Shan. Thermal Vibration and Twist Induced Semiconducting Behaviour in Short DNA Wires[J]. Chin. Phys. Lett., 2009, 26(2): 774-777
[2] ZHAO Yi-Bo, HAN Zheng-Fu, CHEN Jin-Jian, GU You-Zhen, GUO Guang-Can. Secret Key Distillation for Continuous Variable Quantum Key Distribution against Gaussian Classical Eve[J]. Chin. Phys. Lett., 2008, 25(9): 774-777
[3] LU Hang-Jun, , GONG Xiao-Jing, WANG Chun-Lei, FANG Hai-Ping, WAN Rong-Zheng,. Effect of Vibration on Water Transport through Carbon Nanotubes[J]. Chin. Phys. Lett., 2008, 25(3): 774-777
[4] ZHU Meng-Hua, LIU Liang-Gang, XU Ao-Ao, Ma Tao. Automatic Estimation of Peak Regions in Gamma-Ray Spectra Measured by NaI Detector[J]. Chin. Phys. Lett., 2008, 25(11): 774-777
[5] LI Jing-Yuan, YANG Zai-Xing, FANG Hai-Ping, ZHOU Ru-Hong, TANGXiao-Wei. Effect of the Carbon-Nanotube Length on Water Permeability[J]. Chin. Phys. Lett., 2007, 24(9): 774-777
[6] S. ZDRAVKOVIC, M. V. SATARIC. Impact of Viscosity on DNA Dynamics[J]. Chin. Phys. Lett., 2007, 24(5): 774-777
[7] QIN Tao, ZHAO Mei-Sheng, ZHANG Yong-De,. Classical Capacity for a Continuous Variable Teleportation Channel[J]. Chin. Phys. Lett., 2007, 24(2): 774-777
[8] YANG Shuai, ZHAO Mei-Sheng, LIU Nai-Le, CHEN Zeng-Bing. Universal Quantum Cloning Machines for Two Identical Mixed Qubits[J]. Chin. Phys. Lett., 2007, 24(11): 774-777
[9] ZHAO Wei-Jia, WENG Yu-Quan, FU Jing-Li,. Lie Symmetries and Conserved Quantities for Super-Long Elastic Slender Rod[J]. Chin. Phys. Lett., 2007, 24(10): 774-777
[10] ZHOU Tao, LIU Jian-Guo, WANG Bing-Hong. Notes on the Algorithm for Calculating Betweenness[J]. Chin. Phys. Lett., 2006, 23(8): 774-777
[11] LI Zhuo, ZHU Xue-Min, ZHANG Li-Hua, HUANG Xu-Guang, REN Yu-Feng, CHEN Geng-Hua, YANG Qian-Sheng, FENG Ji. An Economical Magnetocardiogram System Based on High-Tc SQUIDs[J]. Chin. Phys. Lett., 2006, 23(8): 774-777
[12] Venkatesh Rajagopalan, Asok Ray. Wavelet Space Partitioning for Symbolic Time Series Analysis[J]. Chin. Phys. Lett., 2006, 23(7): 774-777
[13] ZHAO Hui. Separability Criteria for Quantum Mixed States in Terms of Trace Norm[J]. Chin. Phys. Lett., 2006, 23(7): 774-777
[14] DENG Fu-Guo, , ZHOU Ping, LI Xi-Han, LI Chun-Yan, ZHOU Hong-Yu,. Efficient Multiparty Quantum Secret Sharing with Greenberger--Horne--Zeilinger States[J]. Chin. Phys. Lett., 2006, 23(5): 774-777
[15] WANG Xiao-Feng, LEI Xiao-Ling, FANG Hai-Ping. What Governs the Unzipping Process of Double-Stranded DNA[J]. Chin. Phys. Lett., 2006, 23(5): 774-777
Viewed
Full text


Abstract