دانلود مقاله ISI انگلیسی شماره 150852
ترجمه فارسی عنوان مقاله

یک الگوریتم جدید خوشه بندی مبتنی بر رویکردهای تبدیل داده است

عنوان انگلیسی
A novel clustering algorithm based on data transformation approaches
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
150852 2017 12 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Expert Systems with Applications, Volume 76, 15 June 2017, Pages 59-70

پیش نمایش مقاله
پیش نمایش مقاله  یک الگوریتم جدید خوشه بندی مبتنی بر رویکردهای تبدیل داده است

چکیده انگلیسی

Clustering provides a knowledge acquisition method for intelligent systems. This paper proposes a novel data-clustering algorithm, by combining a new initialization technique, K-means algorithm and a new gradual data transformation approach to provide more accurate clustering results than the K-means algorithm and its variants by increasing the clusters’ coherence. The proposed data transformation approach solves the problem of generating empty clusters, which frequently occurs for other clustering algorithms. An efficient method based on the principal component transformation and a modified silhouette algorithm is also proposed in this paper to determine the number of clusters. Several different data sets are used to evaluate the efficacy of the proposed method to deal with the empty cluster generation problem and its accuracy and computational performance in comparison with other K-means based initialization techniques and clustering methods. The developed estimation method for determining the number of clusters is also evaluated and compared with other estimation algorithms. Significances of the proposed method include addressing the limitations of the K-means based clustering and improving the accuracy of clustering as an important method in the field of data mining and expert systems. Application of the proposed method for the knowledge acquisition in time series data such as wind, solar, electric load and stock market provides a pre-processing tool to select the most appropriate data to feed in neural networks or other estimators in use for forecasting such time series. In addition, utilization of the knowledge discovered by the proposed K-means clustering to develop rule based expert systems is one of the main impacts of the proposed method.