دانلود مقاله ISI انگلیسی شماره 79757
ترجمه فارسی عنوان مقاله

نتیجه boundedness برای برنامه نویسی پویا اکتشافی مستقیم

عنوان انگلیسی
A boundedness result for the direct heuristic dynamic programming
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
79757 2012 7 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Neural Networks, Volume 32, August 2012, Pages 229–235

ترجمه کلمات کلیدی
برنامه نویسی پویا تقریبی (ADP)؛ برنامه نویسی مستقیم پویا اکتشافی (مستقیم HDP)؛ پایداری لیاپانوف - boundedness یکنواخت در نهایت (UUB)
کلمات کلیدی انگلیسی
Approximate dynamic programming (ADP); Direct heuristic dynamic programming (direct HDP); Lyapunov stability; Uniformly ultimately boundedness (UUB)
پیش نمایش مقاله
پیش نمایش مقاله  نتیجه boundedness برای برنامه نویسی پویا اکتشافی مستقیم

چکیده انگلیسی

Approximate/adaptive dynamic programming (ADP) has been studied extensively in recent years for its potential scalability to solve large state and control space problems, including those involving continuous states and continuous controls. The applicability of ADP algorithms, especially the adaptive critic designs has been demonstrated in several case studies. Direct heuristic dynamic programming (direct HDP) is one of the ADP algorithms inspired by the adaptive critic designs. It has been shown applicable to industrial scale, realistic and complex control problems. In this paper, we provide a uniformly ultimately boundedness (UUB) result for the direct HDP learning controller under mild and intuitive conditions. By using a Lyapunov approach we show that the estimation errors of the learning parameters or the weights in the action and critic networks remain UUB. This result provides a useful controller convergence guarantee for the first time for the direct HDP design.