论文标题
云服务事件中的神经知识提取
Neural Knowledge Extraction From Cloud Service Incidents
论文作者
论文摘要
在过去的十年中,两个范式转移重塑了软件行业 - 从盒装产品到服务以及广泛采用云计算。这对软件开发生命周期和DevOps流程产生了巨大影响。特别是,事件管理对于开发和运营大规模服务至关重要。创建事件是为了确保及时沟通服务问题以及其解决方案。事件管理上的先前工作一直集中在事件进行分盘和解除局面的挑战上。在这项工作中,我们解决了从服务事件中提取结构化知识的基本问题。我们已经建立了柔软的人,这是从服务事件中提取无监督的知识的框架。我们将知识提取问题框架作为提取事实信息的指定实体识别任务。柔软的人利用诸如钥匙,价值对和表的结构模式来引导训练数据。此外,我们构建了一种新型的基于多任务学习的BilstM-CRF模型,该模型不仅利用语义上下文,还利用数据类型来提取指定性的提取。我们已经在主要的云服务提供商Microsoft部署了Softner,并在超过2个月的云事件中对其进行了评估。我们表明,基于机器学习的方法的高精度为0.96。我们的基于多任务学习的深度学习模型还胜过了艺术模型的状态。最后,使用Softner提取的知识,我们能够为重要的下游任务等重要的模型构建更准确的模型。
In the last decade, two paradigm shifts have reshaped the software industry - the move from boxed products to services and the widespread adoption of cloud computing. This has had a huge impact on the software development life cycle and the DevOps processes. Particularly, incident management has become critical for developing and operating large-scale services. Incidents are created to ensure timely communication of service issues and, also, their resolution. Prior work on incident management has been heavily focused on the challenges with incident triaging and de-duplication. In this work, we address the fundamental problem of structured knowledge extraction from service incidents. We have built SoftNER, a framework for unsupervised knowledge extraction from service incidents. We frame the knowledge extraction problem as a Named-entity Recognition task for extracting factual information. SoftNER leverages structural patterns like key,value pairs and tables for bootstrapping the training data. Further, we build a novel multi-task learning based BiLSTM-CRF model which leverages not just the semantic context but also the data-types for named-entity extraction. We have deployed SoftNER at Microsoft, a major cloud service provider and have evaluated it on more than 2 months of cloud incidents. We show that the unsupervised machine learning based approach has a high precision of 0.96. Our multi-task learning based deep learning model also outperforms the state of the art NER models. Lastly, using the knowledge extracted by SoftNER we are able to build significantly more accurate models for important downstream tasks like incident triaging.