دانلود مقاله ISI انگلیسی شماره 138435
ترجمه فارسی عنوان مقاله

انتخاب نمونه از طریق فاصله زاویه ای در فضای استدلال های یک شبکه عصبی مصنوعی

عنوان انگلیسی
Sample selection via angular distance in the space of the arguments of an artificial neural network
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
138435 2018 19 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Computers & Geosciences, Volume 114, May 2018, Pages 98-106

پیش نمایش مقاله
پیش نمایش مقاله  انتخاب نمونه از طریق فاصله زاویه ای در فضای استدلال های یک شبکه عصبی مصنوعی

چکیده انگلیسی

In the construction of an artificial neural network (ANN) a proper data splitting of the available samples plays a major role in the training process. This selection of subsets for training, testing and validation affects the generalization ability of the neural network. Also the number of samples has an impact in the time required for the design of the ANN and the training. This paper introduces an efficient and simple method for reducing the set of samples used for training a neural network. The method reduces the required time to calculate the network coefficients, while keeping the diversity and avoiding overtraining the ANN due the presence of similar samples. The proposed method is based on the calculation of the angle between two vectors, each one representing one input of the neural network. When the angle formed among samples is smaller than a defined threshold only one input is accepted for the training. The accepted inputs are scattered throughout the sample space. Tidal records are used to demonstrate the proposed method. The results of a cross-validation show that with few inputs the quality of the outputs is not accurate and depends on the selection of the first sample, but as the number of inputs increases the accuracy is improved and differences among the scenarios with a different starting sample have and important reduction. A comparison with the K-means clustering algorithm shows that for this application the proposed method with a smaller number of samples is producing a more accurate network.