دانلود مقاله ISI انگلیسی شماره 79670
ترجمه فارسی عنوان مقاله

یک الگوریتم چند منظوره مبتنی بر برنامه نویسی ژنتیک برای طبقه بندی داده های ترکیبی

عنوان انگلیسی
A niching genetic programming-based multi-objective algorithm for hybrid data classification
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
79670 2014 16 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Neurocomputing, Volume 133, 10 June 2014, Pages 342–357

ترجمه کلمات کلیدی
قوانین طبقه بندی کاوش داده های فضایی، برنامه نویسی ژنتیک، الگوریتم چند هدفه
کلمات کلیدی انگلیسی
Classification rules; Spatial data mining; Genetic programming; Multi-objective algorithm

چکیده انگلیسی

This paper introduces a multi-objective algorithm based on genetic programming to extract classification rules in databases composed of hybrid data, i.e., regular (e.g. numerical, logical, and textual) and non-regular (e.g. geographical) attributes. This algorithm employs a niche technique combined with a population archive in order to identify the rules that are more suitable for classifying items amongst classes of a given data set. The algorithm is implemented in such a way that the user can choose the function set that is more adequate for a given application. This feature makes the proposed approach virtually applicable to any kind of data set classification problem. Besides, the classification problem is modeled as a multi-objective one, in which the maximization of the accuracy and the minimization of the classifier complexity are considered as the objective functions. A set of different classification problems, with considerably different data sets and domains, has been considered: wines, patients with hepatitis, incipient faults in power transformers and level of development of cities. In this last data set, some of the attributes are geographical, and they are expressed as points, lines or polygons. The effectiveness of the algorithm has been compared with three other methods, widely employed for classification: Decision Tree (C4.5), Support Vector Machine (SVM) and Radial Basis Function (RBF). Statistical comparisons have been conducted employing one-way ANOVA and Tukey’s tests, in order to provide reliable comparison of the methods. The results show that the proposed algorithm achieved better classification effectiveness in all tested instances, what suggests that it is suitable for a considerable range of classification applications.