论文标题
人类交流的共同心理理论
Mutual Theory of Mind for Human-AI Communication
论文作者
论文摘要
新的发展使AI系统能够根据人类的明确或隐性行为和言语提示来感知,认识和回应社会线索。这些AI系统配备了相当于人类的心理理论(TOM)功能,目前是约会平台的媒人,帮助学生作为助教的学生学习,并提高作为工作伙伴的生产力。它们标志着人类互动(HAI)的新时代(HAI)与传统人工互动(HCI)分歧,其中计算机通常被视为工具而不是社会参与者。在这个新兴的HAI时代,设计和理解人类的看法和经验成为AI系统满足人类需求并减轻社会环境中的风险的紧迫和关键问题。在本文中,我们提出了相互的思维理论(MTOM)框架,灵感来自于汤姆在人类通信中的能力,通过强调人类交流的迭代和相互塑造本质来指导这一新一代的HAI研究。我们讨论了MTOM框架的动机及其在三个阶段迭代地塑造人类交流的三个关键组成部分。然后,我们描述了受MTOM框架启发的两项经验研究,以证明MTOM在指导人类交流的设计和理解方面的力量。最后,我们通过MTOM镜头讨论了人类互动中未来的研究机会。
New developments are enabling AI systems to perceive, recognize, and respond with social cues based on inferences made from humans' explicit or implicit behavioral and verbal cues. These AI systems, equipped with an equivalent of human's Theory of Mind (ToM) capability, are currently serving as matchmakers on dating platforms, assisting student learning as teaching assistants, and enhancing productivity as work partners. They mark a new era in human-AI interaction (HAI) that diverges from traditional human-computer interaction (HCI), where computers are commonly seen as tools instead of social actors. Designing and understanding the human perceptions and experiences in this emerging HAI era becomes an urgent and critical issue for AI systems to fulfill human needs and mitigate risks across social contexts. In this paper, we posit the Mutual Theory of Mind (MToM) framework, inspired by our capability of ToM in human-human communications, to guide this new generation of HAI research by highlighting the iterative and mutual shaping nature of human-AI communication. We discuss the motivation of the MToM framework and its three key components that iteratively shape the human-AI communication in three stages. We then describe two empirical studies inspired by the MToM framework to demonstrate the power of MToM in guiding the design and understanding of human-AI communication. Finally, we discuss future research opportunities in human-AI interaction through the lens of MToM.