دانلود مقاله ISI انگلیسی شماره 48586
ترجمه فارسی عنوان مقاله

ترکیب انتخاب ویژگی روش های با SVM در امتیازدهی اعتباری

عنوان انگلیسی
Combination of feature selection approaches with SVM in credit scoring
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
48586 2010 8 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Expert Systems with Applications, Volume 37, Issue 7, July 2010, Pages 4902–4909

ترجمه کلمات کلیدی
پشتیبانی از ماشین بردار - تجزیه و تحلیل تفکیک خطی - درخت تصمیم گیری - نظریه مجموعه های سخت
کلمات کلیدی انگلیسی
Support vector machine; Linear discriminate analysis; Decision tree; Rough sets theory; F-score
پیش نمایش مقاله
پیش نمایش مقاله  ترکیب انتخاب ویژگی روش های با SVM در امتیازدهی اعتباری

چکیده انگلیسی

The credit scoring has been regarded as a critical topic and its related departments make efforts to collect huge amount of data to avoid wrong decision. An effective classificatory model will objectively help managers instead of intuitive experience. This study proposes four approaches combining with the SVM (support vector machine) classifier for features selection that retains sufficient information for classification purpose. Different credit scoring models are constructed by selecting attributes with four approaches. Two UCI (University of California, Irvine) data sets are chosen to evaluate the accuracy of various hybrid-SVM models. SVM classifier combines with conventional statistical LDA, Decision tree, Rough sets and F-score approaches as features pre-processing step to optimize feature space by removing both irrelevant and redundant features. In this paper, the procedure of the proposed approaches will be described and then evaluated by their performances. The results are compared in combination with SVM classifier and nonparametric Wilcoxon signed rank test will be held to show if there is any significant difference between these models. The result in this study suggests that hybrid credit scoring approach is mostly robust and effective in finding optimal subsets and is a promising method to the fields of data mining.