Benedetta Liberatori

Yolo-based face mask detection on low-end devices using pruning and quantization

Jubilee International Convention on Information, Communication and Electronic Technology (MIPRO), 2022

Benedetta Liberatori
Ciro Antonio Mami
Giovanni Santacatterina
Marco Zullich
Felice Andrea Pellegrino

Abstract

Deploying Deep Learning (DL) based object detection (OD) models in low-end devices, such as single board computers, may lead to poor performance in terms of frames-per-second (FPS). Pruning and quantization are well-known compression techniques that can potentially lead to a reduction of the computational burden of a DL model, with a possible decrease of performance in terms of detection accuracy. Motivated by the widespread introduction of face mask mandates by many institutions during the Covid-19 pandemic, we aim at training and compressing an OD model based on YOLOv4 to recognize the presence of face masks, to be deployed on a Raspberry Pi 4. We investigate the capability of different kinds of pruning and quantization techniques of increasing the FPS with respect to the uncompressed model, while retaining the detection accuracy. We quantitatively assess the pruned and quantized models in terms of Mean Average Precision (mAP) and FPS, and show that with proper pruning and quantization, the FPS can be doubled with a moderate loss in mAP. The results provide guidelines for compression of other OD models based on YOLO.

BibTeX

			
@inproceedings{9803406,
author={Liberatori, Benedetta and Mami, Ciro Antonio and Santacatterina, Giovanni and Zullich, Marco and Pellegrino, Felice Andrea},
booktitle={2022 45th Jubilee International Convention on Information, Communication and Electronic Technology (MIPRO)}, 
title={YOLO-Based Face Mask Detection on Low-End Devices Using Pruning and Quantization}, 
year={2022}}