- [1]MO-MIX: Multi-Objective Multi-Agent Cooperative Decision-Making With Deep Reinforcement Learning[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 45 (10) : 12098 - 12112.
- [2]Huai-Ning Wu, Derong Liu, Yin Yang, Biao Luo.Event-Triggered Optimal Control With Performance Guarantees Using Adaptive Dynamic Programming[J].IEEE Transactions on Neural Networks and Learning: In Press.
- [3]Tingwen Huang, Huai-Ning Wu, Yin Yang, Biao Luo.Balancing Value Iteration and Policy Iteration for Discrete-Time Control[J].IEEE Trans. on Syst., Man, Cybern.: Syst.: In Press.
- [4]Jiangjiang Liu, Tingwen Huang, Derong Liu, Biao Luo.Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation[J].IEEE Trans. on Syst., Man, Cybern.: Syst.: In Press.
- [5]Derong Liu, Biao Luo*, Shan Xue.Event-triggered adaptive dynamic programming for zero-sum game of partially-unknown continuous-time nonlinear systems[J].IEEE Trans. on Syst., Man, Cybern.: Syst.
- [6]Yueheng Li, Derong Liu, Biao Luo*, Zhanyu Yang.Adaptive synchronization of delayed memristive neural networks with unknown parameters[J].IEEE Trans. on Syst., Man, Cybern.: Syst.: In Press.
- [7]Derong Liu, Yin Yang, Biao Luo.Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay[J].IEEE Transactions on Cybernetics, 2018, 48 (12) : 3337-3348.
- [8]Tingwen Huang, Huai-Ning Wu, Biao Luo.Optimal Output Regulation for Model-Free Quanser Helicopter With Multi-Step Q-Learning[J].IEEE Transactions on Industrial Electronics, 2018, 65 (6) : 4953-4961.
- [9]Huai-Ning Wu, Derong Liu, Biao Luo.Adaptive Constrained Optimal Control Design for Data-Driven Nonlinear Discrete-Time Systems with Critic-only Structure[J].IEEE Transactions on Neural Networks and Learning, 2018, 29 (6) : 2099-2111.
- [10]Frank L. Lewis, Ding Wang, Huai-Ning Wu, Derong Liu, Biao Luo.Policy gradient adaptive dynamic programming for data-based optimal control[J].IEEE Transactions on Cybernetics, 2017, 47 (10) : 3341–3354.
- [11]Ding Wang, Tingwen Huang, Derong Liu, Biao Luo.Model-free optimal tracking control via critic-only Q-learning[J].IEEE Transactions on Neural Networks and Learning, 2016, 27 (10) : 2134–2144.
- [12]Xiong Yang, Huai-Ning Wu, Tingwen Huang, Biao Luo.Data-driven H∞ control for nonlinear distributed parameter systems.IEEE Transactions on Neural Networks and Learning, 2015, 26 (11) : 2949-2961.
- [13]Hongwen Ma, Biao Luo, Hongliang Li, Derong Liu, Ding Wang.An approximate optimal control approach for robust stabilization of a class of discrete-time nonlinear systems with uncertainties[J].IEEE Trans. on Syst., Man, Cybern.: Syst., 2016, 46 (5) : 713–717.
- [14]Han-Xiong Li, Huai-Ning Wu, Biao Luo.Adaptive optimal control of highly dissipative nonlinear spatially distributed processes with neuro-dynamic programming[J].IEEE Transactions on Neural Networks and Learning, 2015, 26 (4) : 684-696.
- [15]Tingwen Huang, Huai-Ning Wu, Biao Luo.Off-policy reinforcement learning for H∞ control design[J].IEEE Transactions on Cybernetics, 2015, 45 (1) : 65-76.
- [16]Derong Liu, Tingwen Huang, Huai-Ning Wu, Biao Luo.Data-based approximate policy iteration for nonlinear continuous-time optimal control design[J].Automatica, 2014, 50 (12) : 3281-3290.
- [17]Han-Xiong Li, Huai-Ning Wu, Biao Luo.Data-based optimal neuro-control design with reinforcement learning for dissipative spatially distributed processes[J].Industrial & Engineering Chemistry Research, 2014, 53 (29) : 8106-8119.
- [18]Biao Luo, Huai-Ning Wu.Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control[J].IEEE Transactions on Neural Networks and Learning, 2012, 23 (12) : 1884-1895.
- [19]Huai-Ning Wu, Biao Luo.Approximate optimal control design for nonlinear one-dimensional parabolic PDE systems using empirical eigenfunctions and neural network[J].IEEE Transactions on Systems, Man, and Cybernetics, 2012, 42 (6) : 1538-1549.
- [20]Biao Luo, Huai-Ning Wu.Heuristic dynamic programming algorithm for optimal control design of linear continuous-time hyperbolic PDE systems[J].Industrial & Engineering Chemistry Research, 2012, 51 (27) : 9310-9319.
|