融合的深神经网络转移学习在封闭的面部分类和人重新识别中

论文标题

融合的深神经网络转移学习在封闭的面部分类和人重新识别中

Fused Deep Neural Network based Transfer Learning in Occluded Face Classification and Person re-Identification

论文作者

Mohana, Mohamed, B, Prasanalakshmi, Alelyani, Salem, Alsaqer, Mohammed Saleh

论文摘要

最近的大流行时期，即使遮挡的面部图像也很重要，但使用掩膜数量增加。本文旨在认识到面部图像中四种类型之一的阻塞。测试了各种转移学习方法，结果表明，具有门控复发单元（GRU）的Mobilenet V2的性能比任何其他转移学习方法都更好，并且在图像的分类中具有99％的精度，如有或没有遮挡，然后是闭塞的类型。同时，完成了从设备捕获的图像中识别感兴趣的区域。该提取的感兴趣区域用于面部识别。这种面部识别过程是使用带有CAFFE实现的Resnet模型完成的。为了减少执行时间，在确认面部遮挡类型后，搜索了该人，以在注册数据库中确认其面部图像。从两个同时过程中获得的人的面部标签以其匹配分数验证。如果匹配分数高于90，则该人的公认标签将登录到具有其名称，掩码类型，日期和识别时间的文件中。 Mobilenetv2是一个轻巧的框架，也可以在嵌入式或IoT设备中使用，以使用CCTV素材在可疑的调查区域进行实时检测和识别。当将MobilenetV2与GRU结合使用时，获得了可靠的精度。本文提供的数据属于两类，要么从Google图像中收集以进行遮挡分类，面部识别和面部地标，要么在现场工作中收集。这项研究的动机是识别和记录可以在社会电子政务中提供监视活动的人的细节。

Recent period of pandemic has brought person identification even with occluded face image a great importance with increased number of mask usage. This paper aims to recognize the occlusion of one of four types in face images. Various transfer learning methods were tested, and the results show that MobileNet V2 with Gated Recurrent Unit(GRU) performs better than any other Transfer Learning methods, with a perfect accuracy of 99% in classification of images as with or without occlusion and if with occlusion, then the type of occlusion. In parallel, identifying the Region of interest from the device captured image is done. This extracted Region of interest is utilised in face identification. Such a face identification process is done using the ResNet model with its Caffe implementation. To reduce the execution time, after the face occlusion type was recognized the person was searched to confirm their face image in the registered database. The face label of the person obtained from both simultaneous processes was verified for their matching score. If the matching score was above 90, the recognized label of the person was logged into a file with their name, type of mask, date, and time of recognition. MobileNetV2 is a lightweight framework which can also be used in embedded or IoT devices to perform real time detection and identification in suspicious areas of investigations using CCTV footages. When MobileNetV2 was combined with GRU, a reliable accuracy was obtained. The data provided in the paper belong to two categories, being either collected from Google Images for occlusion classification, face recognition, and facial landmarks, or collected in fieldwork. The motive behind this research is to identify and log person details which could serve surveillance activities in society-based e-governance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题