Automate d lab eling of training data for improved object detection in traffic videos by fine-tuned deep convolutional neural networks

García Aguilar, Iván; García-González, Jorge; Luque-Baena, Rafael Marcos; López-Rubio, Ezequiel

doi:https://doi.org/10.1016/j.patrec.2023.01.015

Automate d lab eling of training data for improved object detection in traffic videos by fine-tuned deep convolutional neural networks

Files

1-s2.0-S0167865523000223-main.pdf (1.62 MB)

Identifiers

URI: https://hdl.handle.net/10630/26302

DOI: https://doi.org/10.1016/j.patrec.2023.01.015

Publication date

2023

Authors

García Aguilar, Iván

García-González, Jorge

Luque-Baena, Rafael Marcos

López-Rubio, Ezequiel

Publisher

Elsevier

Metrics

Share

Export

Center

E.T.S.I. Informática

Department/Institute

Lenguajes y Ciencias de la Computación

Keywords

Redes de neuronas (Informática)

Abstract

The exponential increase in the use of technology in road management systems has led to real-time vi- sual information in thousands of locations on road networks. A previous step in preventing or detecting accidents involves identifying vehicles on the road. The application of convolutional neural networks in object detection has significantly improved this field, enhancing classical computer vision techniques. Al- though, there are deficiencies due to the low detection rate provided by the available pre-trained models, especially for small objects. The main drawback is that they require manual labeling of the vehicles that appear in the images from each IP camera located on the road network to retrain the model. This task is not feasible if we have thousands of cameras distributed across the extensive road network of each nation or state. Our proposal presented a new automatic procedure for detecting small-scale objects in traffic sequences. In the first stage, vehicle patterns detected from a set of frames are generated automatically through an offline process, using super-resolution techniques and pre-trained object detection networks. Subsequently, the object detection model is retrained with the previously obtained data, adapting it to the analyzed scene. Finally, already online and in real-time, the retrained model is used in the rest of the traffic sequence or the video stream generated by the camera. This framework has been successfully tested on the NGSIM and the GRAM datasets.

Bibliographic citation

García-Aguilar, García-González, J., Luque-Baena, R. M., & López-Rubio, E. (2023). Automated labeling of training data for improved object detection in traffic videos by fine-tuned deep convolutional neural networks. Pattern Recognition Letters, 167, 45–52. https://doi.org/10.1016/j.patrec.2023.01.015

Collections

Artículos

Creative Commons license

Except where otherwised noted, this item's license is described as Atribución 4.0 Internacional

Full item page

Automate d lab eling of training data for improved object detection in traffic videos by fine-tuned deep convolutional neural networks

Files

Identifiers

Publication date

Reading date

Authors

Collaborators

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Metrics

Share

Export

Research Projects

Organizational Units

Journal Issue

Center

Department/Institute

Keywords

Abstract

Description

Bibliographic citation

Collections

Endorsement

Review

Supplemented By

Referenced by

Creative Commons license