Modified UCT Algorithm with Risk Dominance Methods in Imperfect Information Game

Jia Jia Zhang; Xuan Wang; Lin Yao; Jing Peng Li; Xue Dong Shen

doi:10.4028/www.scientific.net/AMM.610.367

Paper Titles

An Improved TLBO Algorithm for Balancing Stochastic Two-Sided Assembly Line
p.345

Time Bayesian Net Fault Prognostics
p.350

An Improved Adaptive Threshold Skin Color Model
p.358

Probabilistic Neural Network Southern Jujube Pest Stress Index Leaf Pigment Estimation Model Based Hyperspectral
p.362

Modified UCT Algorithm with Risk Dominance Methods in Imperfect Information Game
p.367

Fuzzy Roughness Degree Measurement Model Based on Level Effect
p.377

Implementation of Optical Flow Base on Pyramid on the Zedboard
p.381

Speech Perception Hash Authentication Algorithm Based on Immittance Spectral Pairs
p.385

Anti-Occlusion Algorithm for Object Tracking Based on Multi-Feature Fusion
p.393

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vol. 610Modified UCT Algorithm with Risk Dominance Methods...

Modified UCT Algorithm with Risk Dominance Methods in Imperfect Information Game

Abstract:

UCT (Upper confidential bounds on Trees) has been applied quite well as a selection approach in MCTS(Monte Carlo Tree Search) in imperfect information games like poker. By using risk dominance as complementary part of decision method besides payoff dominance, opponent strategies is better characterized as their risk factors, like bluff actions in Texas Hold’em Poker . In this paper, estimation method about the influence of risk factors on computing game equilibrium is provided. A novel algorithm, UCT-risk is proposed as modification about UCT algorithm basing on risk estimation methods. To verify the performance of new algorithm, Texas Hold’em, a popular test-bed for AI research is chosen as the experiment platform. The Agent adopted UCT-risk algorithm performs as well or better as the best previous approaches in experiments. And also it is applied in a poker agent named HITSZ_CS_13 in the 2013 AAAI Computer Poker Competition, which confirms the effectiveness of the UCT-risk provided in this paper.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volume 610)

Pages:

367-376

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.610.367

Citation:

Cite this paper

Online since:

August 2014

Authors:

Jia Jia Zhang*, Xuan Wang, Lin Yao, Jing Peng Li, Xue Dong Shen

Keywords:

Imperfect Information Game, Risk Dominance, Uct

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Howard James Bampton, Solving imperfect information games using the Monte Carlo heuristic, Knoxville, Master thesis: University of Tennessee ( 1994).