ترجمه فارسی عنوان مقاله

پنهان شدن پراکنده و پراکنده شدن پهنای باند: یادگیری در راستای مقابله با ریزش موثر در شبکه عصبی کانولوشن

عنوان انگلیسی

Biased Dropout and Crossmap Dropout: Learning towards effective dropout regularization in convolutional neural network

کد مقاله	سال انتشار	تعداد صفحات مقاله انگلیسی
132172	2018	24 صفحه PDF

منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Neural Networks, Available online 9 April 2018

ترجمه کلمات کلیدی

خروج منظم سازی، شبکه عصبی متقاطع،

کلمات کلیدی انگلیسی

Dropout; Regularization; Convolutional neural network;

دانلود رایگان 2 صفحه اول مقاله لاتین (PDF)

پیش نمایش مقاله

چکیده انگلیسی

Training a deep neural network with a large number of parameters often leads to overfitting problem. Recently, Dropout has been introduced as a simple, yet effective regularization approach to combat overfitting in such models. Although Dropout has shown remarkable results on many deep neural network cases, its actual effect on CNN has not been thoroughly explored. Moreover, training a Dropout model will significantly increase the training time as it takes longer time to converge than a non-Dropout model with the same architecture. To deal with these issues, we address Biased Dropout and Crossmap Dropout, two novel approaches of Dropout extension based on the behaviour of hidden units in CNN model. Biased Dropout divides the hidden units in a certain layer into two groups based on their magnitude and applies different Dropout rate to each group appropriately. Hidden units with higher activation value, which give more contributions to the network final performance, will be retained by a lower Dropout rate, while units with lower activation value will be exposed to a higher Dropout rate to compesate the previous part. The second approach is Crossmap Dropout, which is an extention of the regular Dropout in convolution layer. Each feature map in a convolution layer has a strong correlation between each other, particularly in every identical pixel location in each feature map. Crossmap Dropout tries to maintain this important correlation yet at the same time break the correlation between each adjacent pixel with respect to all feature maps by applying the same Dropout mask to all feature map, so that all pixels or units in equivalent positions in each feature map will be either dropped or active during training. Our experiment with MNIST, MNIST Basic and random background, CIFAR10, DNA splice junction, and IMAGENET datasets shows that our approaches provide better generalization than the regular Dropout. Moreover, our Biased Dropout takes faster time to converge during training phase, suggesting that assigning noise appropriately in hidden units can lead to an effective regularization.