Research 研究内容

Natural-language Werewolf agents, dialogue systems, and long-context reasoning. 自然言語人狼エージェント、対話システム、長期文脈推論に関する研究。

Focusing on the AIWolf natural language division and agents that can reason, persuade, and deceive through conversation. AI 人狼コンテスト自然言語部門を中心に、会話による推論・説得・欺瞞が可能なエージェントを研究しています。

Affiliation 所属 Kano Laboratory, Shizuoka University 静岡大学情報学部狩野研究室

Focus 研究分野 NLP, AIWolf, Dialogue Systems 自然言語処理、AIWolf、対話システム

Development of AIWolf Agent AIWolf エージェントの開発

I am working on developing autonomous agents for the natural language division of the AIWolf contest, where agents play the game of Werewolf using only text-based conversation for reasoning, persuasion, and deception.

自然言語部門の AI 人狼コンテストにおいて、チャットのみを用いて推論・説得・欺瞞を行い、自律的に人狼ゲームをプレイするエージェントの開発を行っています。

About the AIWolf Project AIWolf プロジェクトについて

The natural language division of the Werewolf Intelligence Competition is a contest in which agent players compete autonomously in games of Werewolf, using only natural language conversation to perform reasoning, persuasion, and deception. Its goal is to evaluate the limitations and potential of dialogue systems and large language models.

The competition features a 5-player village track and a 13-player village track. In complex situations that include multiple roles and secret conversations among the werewolves, agents developed by each team connect to the game server and play autonomously.

Evaluation is conducted from multiple angles, not only by win rate but also through human subjective assessment, automated subjective evaluation using LLM-as-a-judge, and detailed log analysis.

AI 人狼知能コンテストの自然言語部門は、エージェント同士が人狼ゲームを自動でプレイし、その中で行われる会話のみを用いて推論・説得・欺瞞を行う競技です。対話システムや大規模言語モデルの限界と可能性を評価することを目的としています。

大会では 5人村と 13人村のトラックがあり、多数の役職や人狼同士の秘密会話などを含む複雑な状況の中で、各チームが開発したエージェントがゲームサーバに接続して自律的にプレイします。

評価は単なる勝率だけでなく、人間による主観評価、LLM-as-a-judge による自動主観評価、詳細なログ解析など多面的に行われます。

Issues 課題

Fails to respond appropriately to questions or others’ statements, leading to poor turn-taking.
Makes remarks that don’t match the situation or repeats content, showing weak grasp of game state.
Struggles to handle player relationships and cooperative actions in complex multi-role settings.
Tends to follow specific opinions, making diverse and strategic counterarguments difficult.
Vulnerable to repeated prompting, sometimes leaking secret information or breaking consistency.
Small number of participants makes win rates and subjective evaluations unstable.
LLM-based responses are slow, limiting real-time play with human players.

質問や他者発話への応答が不適切になりやすく、会話のターン交代や文脈利用が不十分。
状況と合わない発言や同内容の繰り返しが多く、ゲーム状態の把握や自然さに課題がある。
役職やプレイヤー数が増える複雑な状況では、プレイヤー間関係や協調行動の扱いが難しい。
過去ログを参照する一方で、特定の意見に追従しやすく、多様な戦略的反論や視点の提示が難しい。
繰り返しのプロンプトに弱く、秘密情報の漏洩や、嘘と真実の矛盾維持が難しい。
参加チーム数が少なく、同一エージェント同士の対戦が多いため、勝率や主観評価が安定しにくい。
LLM 応答に時間がかかるため、人間とのリアルタイム混成対戦には高速化が必要。

Approach アプローチ

Hierarchical BDI architecture to align local actions with long-term strategies.
Personality modeling via cognitive biases and 24 MBTI-derived parameters.
Commitment tracking for contradiction detection and game state understanding.

階層型 BDI アーキテクチャにより、長期戦略と整合する各ステージの行動選択を行うエージェントの構築。
認知バイアスを取り入れ、MBTI 由来の 24 パラメータによる個性を再現するエージェント設計。
コミットメントトラッキングを用いて矛盾検出とゲーム状態理解を行うエージェントの構築。