|| S. M. Rüger and S. E. Gauch, “Feature Reduction for Document Clustering and Classification,” Technical report, Computing Department, Imperial College, London, UK, 2000.|
 D. Sullivan, “Document Warehousing and Text Mining,” Wiley Computer Publishing, p.p.326, 2001.
 J. Moore, E. H. Han, D. Boley, M. Gini, R. Gros, K. Hasting, G. Karypis, V. Kumar, and B. Mobasher, “Web Page Categorization and Feature Selection Using Association Rule and Principal Component Clustering,” In 7th Workshop on Information Technologies and Systems, 1997.
 L. D. Baker and A. “McCallum, Distributional Clustering of Words for Text Classification,” In Proceedings of 21st Annual International ACM SIGIR, p.p.96-103, 1998.
 N. Slonim and N. Tishby, “The Power of Word Clusters for Text Classification,” In 23rd European Colloquium on Information Retrieval Research, 2001.
 R. Bekkerman, R. El-Yaniv, N. Tishby and Y. Winter, “Distributional Word Clusters vs. Words for Text Categorization,” Journal of Machine Learning Research, p.p.1-48, 2002.
 F. Pereira, N. Tishby and L. Lee, “Distributional Clustering of English Words,” In Meeting of the Association for Computational Linguistics, p.p.183-190, 1993.
 I. Dhillon, S. Mallela and R. Kumar, “A Divisive Information-Theoretic Feature Clustering Algorithm for Text Classification,” Journal of Machine Learning Research, p.p.1265-1287, 2003.
 Y. Yang and J. O. Pedersen, “A Comparative Study on Feature Selection in Text Categorization,” In Proceedings of 14th International Conference on Machine Learning, p.p.412-420, 1997.
 I. Dhillon, Y. Guan, and J. Fan, “Efficient Clustering of Very Large Document Collections,” In Data Mining for Scientific and Engineering Applications, Kluwer Academic Publishers, p.p.357-381, 2001.
 I. Dhillon, J. Kogan, and M. Nicholas, “Feature Selection and Document Clustering,” In a Comprehensive Survey of Text Mining, p.p.73-100, 2003.
 J. Kogan, M. Teboulle, and C. Nicholas, “Data Driven Similarity Measures for k-Means Like Clustering Algorithms,” Information Retrieval, p.p.331-349, 2005.
 I. Dhillon and D. Modha, “Concept Decompositions for Large Sparse Text Data using Clustering,” Machine Learning, p.p.143-175, 2001.
 Duda, Richard 0. , and Peter B. Hart, “Pattern Classification and Scene Analysis.” Wiley & Sons, New York, 1973.
 E. Piazza, “Comparison of different classification algorithms of NOAA AVHRR images,” Proceedings SPIE, July 2000.
 G. Salton, and M. McGill, “Introduction to Modern Information Retrieval,” McGraw-Hill, New York, 1983.
 P. Willet, “Recent Trends in Hierarchical Document Clustering: A Critical Review,” Information Processing and Management, Vol. 24 No. 5, p.p.557-597, 1988.
 V. Faber, “Clustering and the Continuous k-Means Algorithm”, Los Alamos Science, November 22, p.p.138-144, 1994.
 G. H. Ball and D. J. Hall, “ISODATA, a novel method of data analysis and classification,” Technical Report, Stanford University, Stanford, 1965.
 M. S. Chen, J. Han, and P. S. Yu, “Data Mining: An Overview from Database Perspective,” IEEE Transactions on Knowledge and Data Engineering, Vol. 8, No. 6, p.p.866-883, December, 1996.