论文标题
对象文件和架构:将动态系统中的声明性和程序知识分解
Object Files and Schemata: Factorizing Declarative and Procedural Knowledge in Dynamical Systems
论文作者
论文摘要
建模像视频游戏这样的结构化,动态的环境需要跟踪对象及其声明知识),并预测对象的行为方式(程序知识)。具有单层隐藏状态的黑盒模型通常无法始终如一,统一地应用程序知识,即它们缺乏系统性。例如,在视频游戏中,正确预测一个敌人的轨迹无法确保正确预测他人的轨迹。我们通过架构解决此问题,该架构将声明性和程序知识分配,并在每种知识形式中施加模块化。该体系结构由称为对象文件的活动模块组成,这些模块可维护单个对象的状态并调用称为Schemata的被动外部知识源,称为状态更新。要使用视频游戏作为插图,同一类型的两个敌人将共享模式,但将具有单独的对象文件来编码其独特的状态(例如,健康,位置)。我们建议使用注意来确定要更新哪些对象文件,schemata的选择以及对象文件之间的信息传播。最终的体系结构是与正常复发网络相同的输入输出接口(例如LSTM,GRU)符合相同的输入输出界面的置换式替换,但在具有相同类型的多个对象标记的环境上实现了更好的概括,包括具有挑战性的直觉物理学基准。
Modeling a structured, dynamic environment like a video game requires keeping track of the objects and their states declarative knowledge) as well as predicting how objects behave (procedural knowledge). Black-box models with a monolithic hidden state often fail to apply procedural knowledge consistently and uniformly, i.e., they lack systematicity. For example, in a video game, correct prediction of one enemy's trajectory does not ensure correct prediction of another's. We address this issue via an architecture that factorizes declarative and procedural knowledge and that imposes modularity within each form of knowledge. The architecture consists of active modules called object files that maintain the state of a single object and invoke passive external knowledge sources called schemata that prescribe state updates. To use a video game as an illustration, two enemies of the same type will share schemata but will have separate object files to encode their distinct state (e.g., health, position). We propose to use attention to determine which object files to update, the selection of schemata, and the propagation of information between object files. The resulting architecture is a drop-in replacement conforming to the same input-output interface as normal recurrent networks (e.g., LSTM, GRU) yet achieves substantially better generalization on environments that have multiple object tokens of the same type, including a challenging intuitive physics benchmark.