Multi-Sensor Cooperative Tracking Using Distributed Nash Q-Learning

Article Preview

Abstract:

Traditional target tracking algorithm has a disadvantage of excessive dependence on the environment model. Thus a multi-sensor cooperative tracking method using distributed Nash Q-learning was proposed. Distributed Nash Q-learning with model-free was firstly described. Then sensor action and reward function were defined, which both are very crucial to the learning. Sensor action was only subjected to angle control, and reward function was given by calculating the trace of one time-step prediction error covariance. Nash tragedy can not be directly calculated, therefore, a probability statistics method using Bayesian inference was used to update the Q function. Simulation of passive tracking merely with angle measurements shows that this algorithm can enhance the adaptation to environment change and the tracking accuracy.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 591-593)

Pages:

1475-1478

Citation:

Online since:

November 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Y. He, G.H. Wang, D.J. Lu and Y.N. Peng: Multisensor Information Fusion with Application (Publishing House of Electronics Industry, China 2000), p.242

Google Scholar

[2] J. Ousingsawat and M.E. Campbell: AIAA Guidance, Navigation and Control Conference and Exhibit (Providence, Rhode Island, 2004), p.2188

Google Scholar

[3] K. Kirkpatrick and J. Valasek: AIAA Guidance, Navigation and Control Conference and Exhibit (Honolulu, Hawaii, 2008), p.5

Google Scholar

[4] D.D. Dier and S.S. Joshi: AIAA Guidance, Navigation and Control Conference and Exhibit (Hilton Head, South Carolina, 2007), p.7

Google Scholar

[5] C.J.C.H. Watkins and P. Dayan: Machine learning, Vol. 8 (1992), p.279

Google Scholar

[6] Y.W. Lin, V.B. Le, Y. Xue, Z.Y. Feng and Y.J. Zhang: High Technology Letters, Vol. 14 (2008), No.3, p.238

Google Scholar

[7] J.L. Hu and M.P. Wellman: Journal of Machine Learning Research, Vol.4 (2003), p.1039

Google Scholar

[8] J. Joseph and Jr. LaViola: Proc. the 2003 American Control Conference (Denver, CO, USA, 2003), Vol. 3, p.2435

Google Scholar