Robust Bionic Learning System Design Based on FBFN and Its Application to Motion Balance Control
CAI Jianxian1,2, RUAN Xiaogang1
1. College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100124, China; 2. Institute of Disaster Prevention, Langfang 06S201, China
蔡建羡, 阮晓钢. 基于FBFN的鲁棒仿生学习系统设计及在运动平衡控制中的应用[J]. 机器人, 2010, 32(6): 732-740..
CAI Jianxian, RUAN Xiaogang. Robust Bionic Learning System Design Based on FBFN and Its Application to Motion Balance Control. ROBOT, 2010, 32(6): 732-740..
Abstract:Aiming at the motion balance control problem of a two-wheeled upright robot and combining OCPA(operant conditioning probabilistic automaton) bionic learning system,a robust bionic learning control scheme based on fuzzy basis function is designed.It doesn't require prior knowledge of dynamic system or off-line learning phase.The architecture of the robust bionic learning controller contains a bionic learning unit,a gain controller unit and a robust adaptive control unit The bionic learning unit is realized based on fuzzy basis function network(FBFN),and it is not only employed to generate operant action for approximating nonlinear parts,but also to evaluate the operant action.Based on the error measurement signal provided by performance measurement mechanism,the orientation information is generated to tune operant action generation network.The function of the gain controller unit is to guarantee the stability and performance of system,and the function of the robust adaptive unit is to eliminate the approximation error of the FBFN and external disturbances.Besides, the proposed scheme can significantly shorten the learning time and guarantee the stability of system by on-line tuning all parameters of FBFN based on Lyapunov stability theory.The stability of the robust bionic learning controller is proved in theory,and its feasibility and effectiveness can be demonstrated from the results of simulation experiment.
[1] Unsal C,Kachroo P,Bay J S.Multiple stochastic learning automata for vehicle path control in an automated highway system[J].IEEE Transactions on Systems,Man,and Cybernetics:Part A,1999,29(1):120-128.
[2] Chang H S,Fu M C,Hu J Q,et al.Recursive learning automata approach to Markov decision processes[J].IEEE Transactions on Automatic Control,2007,52(7):1349-1355.
[3] Sutton R S,Barto A G.Reinforcement learning[M].Cambridge,USA:MIT,1998.
[4] Kondo T,Ito K.A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control[J].Robotics and Autonomous Systems,2004,46(2):111-124.
[5] 蔡建羡,阮晓钢,郜园园.随机模糊控制策略及其在机器人控制中的应用[J].电机与控制学报,2009,13(5):754-761.Cai Jianxian,Ruan Xiaogang,Gao Yuanyuan.Stochastic fuzzy control strategy and application to robot control[J].Electric Machines and Control,2009,13(5):754-761.
[6] Ruan X G,Cai J X.Stochastic fuzzy controller based on OCPFA and applied on two-wheeled self-balanced robot[M]//Advances in Intelligent and Soft Computing,vol.62.Berlin,Germany:Springer-Verlag,2009:141-151.
[7] Skinner B F.The behavior of organisms[M].New York,USA:Appleton-Century-Crofts,1938.
[8] Kuijaca O,Lewis F L.Adaptive critic design using non-linear network structures[J].International Journal of Adaptive Control and Signal Processing,2003,17(6):431-445.
[9] Lin C K.Adaptive critic autopilot design of bank-to-turn missiles using fuzzy basis function networks[J].IEEE Transactions on Systems,Man,and Cybernetics:Part B,2005,35(2):197-207.
[10] Lin C K.Robust adaptive critic control of nonlinear systems using fuzzy basis function networks:An LMI approach[J].Information Sciences,2007,177(22):4934-4946.