Sobiad Atıf Dizini

İndirme 1

Makale Detay

Benzer Makaleler

Dergi Bilgisi

Eseri Dinleyin

Alıntı Yap

Bu Sayfayı Yazdırın

Paylaş

Optimizing FPGA-based CNN accelerator for energy efficiency with an extended Roofline model

2018

Dergi:

Turkish Journal of Electrical Engineering and Computer Science

Yazar:

Özet:

In recent years, the convolutional neural network (CNN) has found wide acceptance in solving practical computer vision and image recognition problems. Also recently, due to its flexibility, faster development time, and energy efficiency, the field-programmable gate array (FPGA) has become an attractive solution to exploit the inherent parallelism in the feedforward process of the CNN. However, to meet the demands for high accuracy of today's practical recognition applications that typically have massive datasets, the sizes of CNNs have to be larger and deeper. Enlargement of the CNN aggravates the problem of off-chip memory bottleneck in the FPGA platform since there is not enough space to save large datasets on-chip. In this work, we propose a memory system architecture that best matches the off-chip memory traffic with the optimum throughput of the computation engine, while it operates at the maximum allowable frequency. With the help of an extended version of the Roofline model proposed in this work, we can estimate memory bandwidth utilization of the system at different operating frequencies since the proposed model considers operating frequency in addition to bandwidth utilization and throughput. In order to find the optimal solution that has the best energy efficiency, we make a trade-off between energy efficiency and computational throughput. This solution saves 18% of energy utilization with the trade-off having less than 2% reduction in throughput performance. We also propose to use a race-to-halt strategy to further improve the energy efficiency of the designed CNN accelerator. Experimental results show that our CNN accelerator can achieve a peak performance of 52.11 GFLOPS and energy efficiency of 10.02 GFLOPS/W on a ZYNQ ZC706 FPGA board running at 250 MHz, which outperforms most previous approaches.

Anahtar Kelimeler:

Atıf Yapanlar

Bilgi: Bu yayına herhangi bir atıf yapılmamıştır.

Benzer Makaleler

1. Estimation of distribution-based multiobjective design space exploration for energy and throughput-optimized MPSoCs

2020

Turkish Journal of Electrical Engineering and Computer Science

2. A Power-Aware Real-Time System for Multi-Video Treatment on FPGA with Dynamic Partial Reconfiguration and Voltage Scaling

2022

Engineering, Technology & Applied Science Research

3. Behavior of metamaterial-based microwave components for sensing and heating of nanoliter-scale volumes

2016

Turkish Journal of Electrical Engineering and Computer Science

4. FPGA implementation of a HEVC deblocking filter for fast processing of super high resolution applications

2016

Turkish Journal of Electrical Engineering and Computer Science

5. Multi-Objective Mayfly Optimization in Phase Optimization of OFDM

2023

IIUM Engineering Journal

6. Transmission power control using state estimation-based received signal strength prediction for energy efficiency in wireless sensor networks

2017

Turkish Journal of Electrical Engineering and Computer Science

Alan : Mühendislik

Dergi Türü : Uluslararası

Metrikler

Makale : 2.879

Atıf : 1.406

2023 Impact/Etki : 0.016

Detaylı İncele

Turkish Journal of Electrical Engineering and Computer Science

Özet
Eseri Dinleyin

Yazar : --

Dergi :

Sayı

Yıl

Tür

Atıf Sayısı

PDF Görüntüle

Benzer Makaleler
Bu Yayına Atıf Yapanlar

Benzer Makaleler	Yazar	#

Makale	Yazar	#

Kullanım Kılavuzu

Menü

Mendeley

Endnote

Optimizing FPGA-based CNN accelerator for energy efficiency with an extended Roofline model

2018

Dergi:

Turkish Journal of Electrical Engineering and Computer Science

Yazar:

Özet:

Anahtar Kelimeler:

Atıf Yapanlar

Bilgi: Bu yayına herhangi bir atıf yapılmamıştır.

Benzer Makaleler

Turkish Journal of Electrical Engineering and Computer Science

Metrikler