SHP-VI Method of Solving DEC-POMDP Problem

Xiao Ping Wan; Shu Yu Li

doi:10.4028/www.scientific.net/AMR.926-930.3245

Paper Titles

Research on Traffic Condition Recognition Algorithm Based on the Internet of Things
p.3228

Resource Scheduling Algorithm of Cloud Computing for Energy Optimization Base on Virtual Machine Migration
p.3232

Review of Task Scheduling Algorithm Research in Cloud Computing
p.3236

Riccati Equation Solutions to Higher Order Korteweg-de Vries Equation
p.3240

SHP-VI Method of Solving DEC-POMDP Problem
p.3245

Simulation Research of Sow Trajectory about Direct Insert Precision Hill-Seeder with Corn Whole Plastic-Film Mulching on Double Ridges
p.3250

Sine-Fit Estimation of Time-Base Distortion Based on Inner-Triggering
p.3254

Stereo Vision Based Distance Measurement and its Application
p.3258

Study on the Effectiveness Evaluation of the Weapon Equipment Based on Artificial Neural Network
p.3262

HomeAdvanced Materials ResearchAdvanced Materials Research Vols. 926-930SHP-VI Method of Solving DEC-POMDP Problem

SHP-VI Method of Solving DEC-POMDP Problem

Abstract:

DEC-POMDP(Distributed Partially Observable Markov Decision Process) model is a multi-agent model of collaborative decision-making is important, but due to an alarming number of DEC-POMDP problem state space and great strategy solution space, so DEC-POMDP solution of the problem becomes very difficult. The agent from the initial state to the target state during the interaction with the environment, the system's maximum benefit is often only with some small amount of a higher reward states. This article by searching from the initial belief state to the target state to get a shortest Hamiltonian path, according to the corresponding sequence of actions on the path forward search to get faith belief state space trajectory, and then along the trajectory reverse convictions value function iteration, thus forming the state with the largest gains beliefs trajectory corresponding optimal strategy. In this paper, shortest Hamiltonian path-based value iteration to search the optimal path of faith so as to solve the state Hamiltonian larger DEC-POMDP problem.

You might also be interested in these eBooks

Progress in Applied Sciences, Engineering and Technology

View Preview

Info:

Periodical:

Advanced Materials Research (Volumes 926-930)

Pages:

3245-3249

DOI:

https://doi.org/10.4028/www.scientific.net/AMR.926-930.3245

Citation:

Cite this paper

Online since:

May 2014

Authors:

Xiao Ping Wan*, Shu Yu Li

Keywords:

DEC-POMDP, Hamiltonian Path, Markov Decision, Multi-Agent System (MAS)

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Littman M L, Dean T, Kaelbling L P, On the Complexity of Solving Markov Decision Problems. Proc. of the Eleventh International Conference on Uncertainty in Artificial Intelligence, UAI1995.