The Yōkai Learning Environment: Tracking Beliefs Over Space and Time

Constantin Ruhdorfer, Matteo Bortoletto, Andreas Bulling

IJCAI Workshop on Generative AI & Theory of Mind In Communicating Agents, pp. 1–24, 2025.

Oral Presentation

Abstract

Developing collaborative AI hinges on Theory of Mind (ToM) - the ability to reason about the beliefs of others to build and maintain common ground. Existing ToM benchmarks, however, are restricted to passive observer settings or lack an assessment of how agents establish and maintain common ground over time. To address these gaps, we introduce the Yokai Learning Environment (YLE) - a multi-agent reinforcement learning (RL) environment based on the cooperative card game Yokai. In the YLE, agents take turns peeking at hidden cards and moving them to form clusters based on colour. Success requires tracking evolving beliefs, remembering past observations, and maintaining common ground with teammates. Our evaluation yields two key findings: First, current RL agents struggle to solve the YLE, even when given access to perfect memory. Second, while belief modelling improves performance, agents are still unable to effectively generalise to unseen partners or form accurate beliefs over longer games, exposing a reliance on brittle conventions rather than robust belief tracking. We use the YLE to investigate research questions in belief modelling, memory, partner generalisation, and scaling to higher-order ToM.

Links

doi: 10.48550/arXiv.2508.12480

Paper: ruhdorfer25_ijcaiw.pdf

BibTeX

@inproceedings{ruhdorfer25_ijcaiw, author = {Ruhdorfer, Constantin and Bortoletto, Matteo and Bulling, Andreas}, title = {The Yōkai Learning Environment: Tracking Beliefs Over Space and Time}, booktitle = {IJCAI Workshop on Generative AI & Theory of Mind In Communicating Agents}, year = {2025}, pages = {1--24}, doi = {10.48550/arXiv.2508.12480} }