Reinforcement Learning for Routing Strategy Considering the State of Network Link

Zhao Hui Hu

doi:10.4028/www.scientific.net/AMM.220-223.2772

Paper Titles

MAP Based Super-Resolution Image Reconstruction Method
p.2754

Forecasting Model Based on Improved Grey-Markov
p.2758

The Design and Implementation of a Website Internal Search System Based on Sphinx
p.2763

The Prediction of Energy Consumption in Henan Based on Genetic Neural Network
p.2768

Reinforcement Learning for Routing Strategy Considering the State of Network Link
p.2772

Preliminary Study on CAD of Removable Partial Denture Framework
p.2777

Research on Formal Modeling Based on CPN for Movement Authority of High-Speed Railway CTCS-3
p.2783

Study on the Development and Application of osgART Based on MFC
p.2788

A Multi-Scale Data Fusion-Based Method for Modular Decomposition
p.2794

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 220-223Reinforcement Learning for Routing Strategy...

Reinforcement Learning for Routing Strategy Considering the State of Network Link

Abstract:

This paper presents reinforcement learning (RL) algorithm for routing strategy considering the state of network link, which can be deemed as a dynamic programming problem with stochastic needs. Through modeling those four elements and experiments, we draw the conclusion that upon the state of network link, RL is an efficient algorithm for routing strategy; the data can be efficient forwarded to the destination.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 220-223)

Pages:

2772-2776

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.220-223.2772

Citation:

Cite this paper

Online since:

November 2012

Authors:

Zhao Hui Hu

Keywords:

Epsilon Greedy, Network Link Status, Reinforcement Learning, Routing Strategy

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] C. Lochert, H Hartenstein, J Tian, etc, "A routing strategy for vehicular ad hoc networks in city environments" Proceedings of the 2003 IEEE International Conference on intelligent Vehicles Symposium, 2003, pp, 156-161.

DOI: 10.1109/ivs.2003.1212901

Google Scholar

[2] G. Malkin, "Rip version 2: carrying additional information", http://etherpad.tools.ietf.org/html/rfc1723

Google Scholar

[3] K. Kompella, Ed. and Y. Rekhter, Ed. ,"OSPF Extensions in Support of Generalized Multi-Protocol Label witching (GMPLS)", http://trac.tools.ietf.org/html/rfc4203

DOI: 10.17487/rfc4203

Google Scholar

[4] R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Massachusetts London, England: The MIT Press Cambridge, 1998.

Google Scholar

[5] Wen Shang and Dong Sun, "Distributed neural network-based policy gradient reinforcement learning for multi-robot formations", Proceedings of the 2008 IEEE International Conference on Information and Automation, pp.113-118.

DOI: 10.1109/icinfa.2008.4607978

Google Scholar

[6] L. Peshkin and V.Savova , "Reinforcement learning for adaptive routing", International Joint Conference on Neural Networks (IJCNN), 2002, pp, 1825-1830

DOI: 10.1109/ijcnn.2002.1007796

Google Scholar

[7] JA. Boyan and ML. Littman, "Packet routing in dynamically changing networks: a reinforcement learning approach", In Advances in Neural Information Processing Systems, 1994, pp, 671-678.

Google Scholar

[8] Z. H. Hu, D. B. Zhao, "Reinforcement learning for multi-agent patrol policy," IEEE International Conference on Cognitive Informatics, 2010, pp.530-535

DOI: 10.1109/coginf.2010.5599681

Google Scholar

[9] C. Watkins, "Q-learning," Machine Learning, 1992, vol. 8, no.3, pp.279-292.

Google Scholar

[10] A. G. Barto, T. G. Dietterich, "Reinforcement learning and its relationship to supervised learning," in J. Si, A. Barto, W. Powell, and D. Wunsch. Handbook of Learning and Approximate Dynamic Programming, IEEE Press, John Wiley & sons, Inc., 2004, pp.47-63.

DOI: 10.1109/9780470544785

Google Scholar

[11] D. B. Zhao, Z. Zhang, Y. J. Dai. "Self-teaching adaptive dynamic programming for Go-Moku," Neurocomputing, vol.78, 2012, pp.23-29.

DOI: 10.1016/j.neucom.2011.05.032

Google Scholar