论文标题
自动配置和用于议价的策略组合的使用
Automated Configuration and Usage of Strategy Portfolios for Bargaining
论文作者
论文摘要
讨价还价可用于解决多代理系统中的混合运动游戏。尽管在自动化谈判代理商中实施了大量的谈判策略,但大多数代理商都是基于单个固定策略的,而广泛地承认,对于所有谈判环境,都没有最佳表现的策略。 在本文中,我们专注于对手反复遇到的议价设置,但议价问题发生了变化。我们介绍了一种新颖的方法,该方法会自动创建和部署互补谈判策略的投资组合,并通过每次设定的策略选择在从未见过的讨价还价设置中优化收益。我们的方法取决于以下贡献。我们介绍了一个功能表示,该特征表示对手和议价问题的特征。我们根据对手在谈判过程中的行为对行为进行了对其行为的行为,这表明了其谈判策略,以便在将来的遭遇中更有效。 我们将基于功能的方法的结合概括为新的谈判设置,就像在实际上一样,随着时间的流逝,它选择了未来遭遇的有效反策略。我们的方法在类似ANAC的比赛中进行了测试,我们表明我们有能力赢得与亚军相比,收益增长5.6%的锦标赛。
Bargaining can be used to resolve mixed-motive games in multi-agent systems. Although there is an abundance of negotiation strategies implemented in automated negotiating agents, most agents are based on single fixed strategies, while it is widely acknowledged that there is no single best-performing strategy for all negotiation settings. In this paper, we focus on bargaining settings where opponents are repeatedly encountered, but the bargaining problems change. We introduce a novel method that automatically creates and deploys a portfolio of complementary negotiation strategies using a training set and optimise pay-off in never-before-seen bargaining settings through per-setting strategy selection. Our method relies on the following contributions. We introduce a feature representation that captures characteristics for both the opponent and the bargaining problem. We model the behaviour of an opponent during a negotiation based on its actions, which is indicative of its negotiation strategy, in order to be more effective in future encounters. Our combination of feature-based methods generalises to new negotiation settings, as in practice, over time, it selects effective counter strategies in future encounters. Our approach is tested in an ANAC-like tournament, and we show that we are capable of winning such a tournament with a 5.6% increase in pay-off compared to the runner-up agent.