دانلود مقاله ISI انگلیسی شماره 48590
ترجمه فارسی عنوان مقاله

بررسی و بهبود شبکه های عصبی پرسپترون چندلایه ای برای امتیازدهی اعتباری

عنوان انگلیسی
Investigation and improvement of multi-layer perceptron neural networks for credit scoring
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
48590 2015 9 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Expert Systems with Applications, Volume 42, Issue 7, 1 May 2015, Pages 3508–3516

ترجمه کلمات کلیدی
اعتبارسنجی - شبکه های عصبی - انتشار خطا
کلمات کلیدی انگلیسی
Credit scoring; Neural networks; Back propagation
پیش نمایش مقاله
پیش نمایش مقاله  بررسی و بهبود شبکه های عصبی پرسپترون چندلایه ای برای امتیازدهی اعتباری

چکیده انگلیسی

Multi-Layer Perceptron (MLP) neural networks are widely used in automatic credit scoring systems with high accuracy and efficiency. This paper presents a higher accuracy credit scoring model based on MLP neural networks that have been trained with the back propagation algorithm. Our work focuses on enhancing credit scoring models in three aspects: (i) to optimise the data distribution in datasets using a new method called Average Random Choosing; (ii) to compare effects of training–validation–test instance numbers; and (iii) to find the most suitable number of hidden units. We trained 34 models 20 times with different initial weights and training instances. Each model has 6 to 39 hidden units with one hidden layer. Using the well-known German credit dataset we provide test results and a comparison between models, and we get a model with a classification accuracy of 87%, which is higher by 5% than the best result reported in the relevant literature of recent years. We have also proved that our optimisation of dataset structure can increase a model’s accuracy significantly in comparison with traditional methods. Finally, we summarise the tendency of scoring accuracy of models when the number of hidden units increases. The results of this work can be applied not only to credit scoring, but also to other MLP neural network applications, especially when the distribution of instances in a dataset is imbalanced.