Real-time unsupervised video object detection on the edge

dc.contributor.authorRuiz Barroso, Paula
dc.contributor.authorCastro Payán, Francisco Manuel
dc.contributor.authorGuil-Mata, Nicolás
dc.date.accessioned2025-05-09T09:29:29Z
dc.date.available2025-05-09T09:29:29Z
dc.date.issued2025-02-06
dc.departamentoArquitectura de Computadoreses_ES
dc.description.abstractObject detection in video is an essential computer vision task. Consequently, many efforts have been devoted to developing precise and fast deep-learning models for this task. These models are commonly deployed on discrete and powerful GPU devices to meet both frame rate performance and detection accuracy requirements. Furthermore, model training is usually performed in a strongly supervised way so that samples must be previously labelled by humans using a slow and costly process. In this paper, we develop a real-time implementation for unsupervised object detection in video employing a low-power device. We improve typical approaches for object detection using information supplied by optical flow to detect moving objects. Besides, we use an unsupervised clustering algorithm to group similar detections that avoid manual object labelling. Finally, we propose a methodology to optimize the deployment of our resulting framework on an embedded heterogeneous platform. Thus, we illustrate how all the computational resources of a Jetson AGX Xavier (CPU, GPU, and DLAs) can be used to fulfil frame rate, accuracy, and energy consumption requirements. Three different data representations (FP32, FP16 and INT8) are studied for the pipeline networks in order to evaluate the impact of all of them in our pipeline. Obtained results show that our proposed optimizations can improve up to 23.6x energy consumption and 32.2x execution time with respect to the non-optimized pipeline without penalizing the original mAP (59.44). This computational complexity reduction is achieved through knowledge distillation, using FP16 data precision, and deploying concurrent tasks in different computing units.es_ES
dc.description.sponsorshipFunding for open access charge: Universidad de Málaga / CBUAes_ES
dc.identifier.citationRuiz-Barroso, P., Castro, F. M., & Guil, N. (2025). Real-time unsupervised video object detection on the edge. Future Generation Computer Systems, 167, 107737.es_ES
dc.identifier.doi10.1016/j.future.2025.107737
dc.identifier.issn0167-739X
dc.identifier.urihttps://hdl.handle.net/10630/38549
dc.language.isoenges_ES
dc.publisherElsevieres_ES
dc.rightsAtribución-NoComercial 4.0 Internacional*
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/*
dc.subjectTelecomunicacioneses_ES
dc.subjectAprendizaje automático (Inteligencia artificial)es_ES
dc.subjectDetecciónes_ES
dc.subject.otherEmbedded systemes_ES
dc.subject.otherUnsupervised object detectiones_ES
dc.subject.otherModel quantizationes_ES
dc.subject.otherKnowledge distillationes_ES
dc.titleReal-time unsupervised video object detection on the edgees_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublicationbed8ca48-652e-4212-8c3c-05bfdc85a378
relation.isAuthorOfPublication.latestForDiscoverybed8ca48-652e-4212-8c3c-05bfdc85a378

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1-s2.0-S0167739X25000329-main.pdf
Size:
2.06 MB
Format:
Adobe Portable Document Format
Description:

Collections