یادگیری در دستگاه: بازپرداخت تصادفی و کانال یادگیری عمیق

عنوان انگلیسی

Learning in the machine: Random backpropagation and the deep learning channel

کد مقاله	سال انتشار	تعداد صفحات مقاله انگلیسی
135740	2018	70 صفحه PDF

منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Artificial Intelligence, Volume 260, July 2018, Pages 1-35

ترجمه کلمات کلیدی

یادگیری عمیق، شبکه های عصبی، بازپرداخت، یادگیری محلی،

کلمات کلیدی انگلیسی

Deep learning; Neural networks; Backpropagation; Local learning;

دانلود رایگان 2 صفحه اول مقاله لاتین (PDF)

پیش نمایش مقاله

چکیده انگلیسی

Random backpropagation (RBP) is a variant of the backpropagation algorithm for training neural networks, where the transpose of the forward matrices are replaced by fixed random matrices in the calculation of the weight updates. It is remarkable both because of its effectiveness, in spite of using random matrices to communicate error information, and because it completely removes the taxing requirement of maintaining symmetric weights in a physical neural system. To better understand random backpropagation, we first connect it to the notions of local learning and learning channels. Through this connection, we derive several alternatives to RBP, including skipped RBP (SRBP), adaptive RBP (ARBP), sparse RBP, and their combinations (e.g. ASRBP) and analyze their computational complexity. We then study their behavior through simulations using the MNIST and CIFAR-10 benchmark datasets. These simulations show that most of these variants work robustly, almost as well as backpropagation, and that multiplication by the derivatives of the activation functions is important. As a follow-up, we study also the low-end of the number of bits required to communicate error information over the learning channel. We then provide partial intuitive explanations for some of the remarkable properties of RBP and its variations. Finally, we prove several mathematical results for RBP and its variants including: (1) the convergence to optimal fixed points for linear chains of arbitrary length; (2) convergence to fixed points for linear autoencoders with decorrelated data; (3) long-term existence of solutions for linear systems with a single hidden layer, and their convergence in special cases; and (4) convergence to fixed points of non-linear chains, when the derivative of the activation functions is included.

مقالات مرتبط

سلب حق مالکیت و درآمد دولت محلی از مالیات بر ملک: مورد مناطق آموزش و پرورش گرجستان

مالیات بر املاک و اثرات آن بر رفتار استراتژیک اجاره دادن و فروش برای انحصار گران کالاهای بادوام

مالیات بر املاک در جاده های خصوصی

رقابت مالیاتی ایالتی و محلی در مدل فضایی با مالیات فروش و مالیات بر املاک مسکونی

خانه های کوچک، مدارس دولتی و سرمایه گذاری های مالیات بر اموال