A new prospective for Learning Automata: A machine learning approach

详细信息查看全文

作者：Wen Jiang^a ; ^{wenjiang@sjtu.edu.cn" class="auth_mail" title="E-mail the corresponding author}Author Vitae ; Bin Li^b ; ^{stonebupt@gmail.com" class="auth_mail" title="E-mail the corresponding author} ; Shenghong Li^a ; ^{shli@sjtu.edu.cn" class="auth_mail" title="E-mail the corresponding author}Author Vitae ; Yuanyan Tang^c ; ^{yytang@umac.mo" class="auth_mail" title="E-mail the corresponding author} ; Chun Lung Philip Chen^c ; ^{philipchen@umac.mo" class="auth_mail" title="E-mail the corresponding author}
关键词：Learning Automata ; ϵ-Optimal ; Bayesian estimator ; Maximum Likelihood Estimator
刊名：Neurocomputing
出版年：2016
出版时间：5 May 2016
年：2016
卷：188
期：Complete
页码：319-325
全文大小：277 K

文摘

In the field of Learning Automata (LA), how to design faster learning algorithms has always been a key issue. Among solutions reported in the literature, the stochastic estimator reward-inaction learning automaton (SE_RI), which belongs to the Maximum Likelihood estimator based LAs, has been recognized as the fastest ϵ-optimal LA. In this paper, we first point out the limitations of the traditional Maximum Likelihood Estimator (MLE) based LAs and then introduce Bayesian estimator based approach, which is demonstrated to be equivalent to Laplace smoothing of the traditional method, to overcome these limitations. The key idea is that the Bayesian estimator, which estimates the probability of selecting each action in the LA, aims to reconstruct Bernoulli distribution from sequential data, and is formalized based on exponential conjugate family so that the LA has a relatively simple format for easy implementation. In addition, we also indicate that this Bayesian estimator could be applied to update almost all existing MLE estimator based LAs. Based on the proposed Bayesian estimator, a new LA, known as Generalized Bayesian Stochastic Estimator (GBSE) LA, is presented and proved to be ϵ-optimal. Finally, extensive experimental results on benchmarks demonstrate that our proposed learning scheme is more efficient than the current best LA SE_RI.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700