Deeva：一种基于深度学习和物联网的计算机视觉系统，可解决能源行业生产地点的安全性

论文标题

Deeva：一种基于深度学习和物联网的计算机视觉系统，可解决能源行业生产地点的安全性

DEEVA: A Deep Learning and IoT Based Computer Vision System to Address Safety and Security of Production Sites in Energy Industry

论文作者

Awalgaonkar, Nimish M., Zheng, Haining, Gurciullo, Christopher S.

论文摘要

在满足不同生产/施工站点的安全/安全性需求时，准确检测工人，车辆，设备很重要，并构成了基于计算机视觉的监视系统（CVSS）的组成部分。传统的CVSS系统着重于使用不同的计算机视觉和模式识别算法的使用过于依赖于功能和小数据集的手动提取，从而限制了它们的用法，因为其准确性低，需要专家知识和高计算成本。本文的主要目的是为网站的决策者提供实用但全面的深度学习和基于IoT的解决方案，以解决各种与计算机视觉相关的问题，例如场景分类，场景中的对象检测，语义细分，场景字幕等。我们的总体目标是解决该站点的中心问题以及在自动化时尚中所发生的事情，以确保自动化的需求，以使人们的需求量最小化。我们开发了深入的ExxonMobil Eye进行视频分析（DEEVA）软件包，以处理场景分类，对象检测，语义分割和场景的字幕。结果表明，使用视网膜对象探测器进行转移学习能够检测工人的存在，不同类型的车辆/建筑设备，相关的对象高度准确性（高于90％）。在深度学习的帮助下，自动提取功能和物联网技术以自动捕获，转移和处理大量的实时图像，该框架是迈向开发智能监视系统的重要一步，旨在解决安全/安全性监测，生产力评估和未来决策领域中无数开放式问题。

When it comes to addressing the safety/security related needs at different production/construction sites, accurate detection of the presence of workers, vehicles, equipment important and formed an integral part of computer vision-based surveillance systems (CVSS). Traditional CVSS systems focus on the use of different computer vision and pattern recognition algorithms overly reliant on manual extraction of features and small datasets, limiting their usage because of low accuracy, need for expert knowledge and high computational costs. The main objective of this paper is to provide decision makers at sites with a practical yet comprehensive deep learning and IoT based solution to tackle various computer vision related problems such as scene classification, object detection in scenes, semantic segmentation, scene captioning etc. Our overarching goal is to address the central question of What is happening at this site and where is it happening in an automated fashion minimizing the need for human resources dedicated to surveillance. We developed Deep ExxonMobil Eye for Video Analysis (DEEVA) package to handle scene classification, object detection, semantic segmentation and captioning of scenes in a hierarchical approach. The results reveal that transfer learning with the RetinaNet object detector is able to detect the presence of workers, different types of vehicles/construction equipment, safety related objects at a high level of accuracy (above 90%). With the help of deep learning to automatically extract features and IoT technology to automatic capture, transfer and process vast amount of realtime images, this framework is an important step towards the development of intelligent surveillance systems aimed at addressing myriads of open ended problems in the realm of security/safety monitoring, productivity assessments and future decision making.

下载PDF全文

下载文献需遵守相关版权规定

论文标题