In this work a multi-step approach for clustering assessment, visualization and data validation is introduced. Three main approaches for data clustering are used and compared: K-means, Self Organizing Maps and Probabilistic Principal Surfaces. A model explorer approach with different similarity measures is used to obtain the best parameters of the methods. The approach is used to identify genes periodically expressed in tumors related to the human cell cycle. Finally, clusters are validated by using GO Term information. ©2007 IEEE.
|Titolo:||Clustering, assessment and validation: An application to gene expression data|
|Data di pubblicazione:||2007|
|Appare nelle tipologie:||4.1 Contributo in Atti di convegno|