A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game.

We formulate an automatic strategy acquisition problem for the multi-agent card game "Hearts" as a reinforcement learning problem. The problem can approximately be dealt with in the framework of a partially observable Markov decision process (POMDP) for a single-agent system. Hearts is an...

সম্পূর্ণ বিবরণ

গ্রন্থ-পঞ্জীর বিবরন
প্রকাশিত:	Machine learning. 59, 1-2 (2005).
প্রধান লেখক:	Ishii, Shin
বিন্যাস:	প্রবন্ধ
ভাষা:	English
বিষয়গুলি:	Reinforcement learning. POMDP. Multi-agent system. Card game. Model-based.