Energy-based tuning of convolution neural networks on multi-GPUs

Castro Payán, Francisco Manuel; Guil-Mata, Nicolás; Marín Jiménez, Manuel Jesús; Pérez-Serrano, Jesús; Ujaldon-Martínez, Manuel

doi:https://doi.org/10.1002/cpe.4786

Energy-based tuning of convolution neural networks on multi-GPUs

Files

paper.pdf (4.13 MB)

Description: Artículo Principal

Identifiers

URI: https://hdl.handle.net/10630/30234

DOI: https://doi.org/10.1002/cpe.4786

Publication date

2019-11

Authors

Castro Payán, Francisco Manuel

Guil-Mata, Nicolás

Marín Jiménez, Manuel Jesús

Pérez-Serrano, Jesús

Ujaldon-Martínez, Manuel

Publisher

Wiley

Metrics

Share

Export

Center

E.T.S.I. Informática

Department/Institute

Arquitectura de Computadores

Keywords

Redes neuronales (Informática)

Abstract

Deep Learning (DL) applications are gaining momentum in the realm of Artificial Intelligence, particularly after GPUs have demonstrated remarkable skills for accelerating their challenging computational requirements. Within this context, Convolutional Neural Network (CNN) models constitute a representative example of success on a wide set of complex applications, particularly on datasets where the target can be represented through a hierarchy of local features of increas- ing semantic complexity. In most of the real scenarios, the roadmap to improve results relies on CNN settings involving brute force computation, and researchers have lately proven Nvidia GPUs to be one of the best hardware counterparts for acceleration. Our work complements those find- ings with an energy study on critical parameters for the deployment of CNNs on flagship image and video applications, ie, object recognition and people identification by gait, respectively. We evaluate energy consumption on four different networks based on the two most popular ones (ResNet/AlexNet), ie, ResNet (167 layers), a 2D CNN (15 layers), a CaffeNet (25 layers), and a ResNetIm (94 layers) using batch sizes of 64, 128, and 256, and then correlate those with speed-up and accuracy to determine optimal settings. Experimental results on a multi-GPU server endowed with twin Maxwell and twin Pascal Titan X GPUs demonstrate that energy correlates with per- formance and that Pascal may have up to 40% gains versus Maxwell. Larger batch sizes extend performance gains and energy savings, but we have to keep an eye on accuracy, which sometimes shows a preference for small batches. We expect this work to provide a preliminary guidance for a wide set of CNN and DL applications in modern HPC times, where the GFLOPS/w ratio constitutes the primary goal.

Bibliographic citation

Castro FM, Guil N, Marín-Jiménez MJ, Pérez-Serrano J, Ujaldón M. Energy-based tuning of convolutional neural networks on multi-GPUs. Concurrency Computat Pract Exper. 2019;31:e4786. https://doi.org/10.1002/cpe.4786

Collections

Artículos

Creative Commons license

Except where otherwised noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 Internacional

Full item page

Energy-based tuning of convolution neural networks on multi-GPUs

Files

Identifiers

Publication date

Reading date

Authors

Collaborators

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Metrics

Share

Export

Research Projects

Organizational Units

Journal Issue

Center

Department/Institute

Keywords

Abstract

Description

Bibliographic citation

Collections

Endorsement

Review

Supplemented By

Referenced by

Creative Commons license