论文标题
职位论文:离线计划的在线建模
Position Paper: Online Modeling for Offline Planning
论文作者
论文摘要
计划问题的定义和表示是AI计划研究的核心。关键部分是代表动作模型。数十年的进步改善声明性行动模型表示,导致了许多理论进步,并且有能力,有效的,独立于领域的计划者。但是,尽管该领域成熟,但AI规划技术仍然很少在研究社区之外使用,这表明当前的表示未能捕获现实世界中的要求,例如利用复杂的数学功能和从数据中汲取的模型。我们认为这是因为假定建模过程已在计划过程之前进行并完成,即离线计划的离线建模。这种方法固有的一些挑战,包括:声明性建模语言的表现力有限;早期致力于建模选择和计算,这是使用每个动作模型的最合适分辨率的排除的 - 只有在计划期间才能知道;并且难以可靠地使用非决定性,学识渊博的模型。 因此,我们建议更改AI规划过程,以便在离线计划中进行在线建模,即使用访问计划过程的一部分计算甚至生成的动作模型。这概括了现有方法(离线建模)。拟议的定义承认了新的计划过程,我们建议一种具体的实施,以证明这种方法。我们勾勒出作为第一次尝试通过使用行动成本估算器进行计划的初步尝试获得的初始结果。我们通过讨论公开挑战来总结。
The definition and representation of planning problems is at the heart of AI planning research. A key part is the representation of action models. Decades of advances improving declarative action model representations resulted in numerous theoretical advances, and capable, working, domain-independent planners. However, despite the maturity of the field, AI planning technology is still rarely used outside the research community, suggesting that current representations fail to capture real-world requirements, such as utilizing complex mathematical functions and models learned from data. We argue that this is because the modeling process is assumed to have taken place and completed prior to the planning process, i.e., offline modeling for offline planning. There are several challenges inherent to this approach, including: limited expressiveness of declarative modeling languages; early commitment to modeling choices and computation, that preclude using the most appropriate resolution for each action model -- which can only be known during planning; and difficulty in reliably using non-declarative, learned, models. We therefore suggest to change the AI planning process, such that is carries out online modeling in offline planning, i.e., the use of action models that are computed or even generated as part of the planning process, as they are accessed. This generalizes the existing approach (offline modeling). The proposed definition admits novel planning processes, and we suggest one concrete implementation, demonstrating the approach. We sketch initial results that were obtained as part of a first attempt to follow this approach by planning with action cost estimators. We conclude by discussing open challenges.