JOURNAL ARTICLE

Power and Performance Tradeoff of a Floating-Point Intensive Kernel on OpenCL FPGA Platform

Abstract

As power is recognized as one of first-order constraints in high-performance computing, understanding how power and performance are affected using a high-level synthesis tool for a floating-point intensive kernel is important. FPGAs offer a promising solution for high-performance and energyefficient computing applications. This paper presents the impact of the optimizations of a floating-point intensive kernel from a geographical information system upon the performance and power using an FPGA. Using an OpenCL-based high-level synthesis tool for FPGAs, we evaluate the resource usage, performance and power consumption of the kernel implementations on an Arria10-based FPGA platform. We compare the performance and energy efficiency of the kernel implementations on an Arria 10 GX1150 FPGA, an Intel's Xeon Phi Knights Landing CPU, and an NVIDIA's Tesla K80 GPU. Our experiment shows that the performance per watt of the kernel implementation on the FPGA is 1.79X better than the CPU and 1.56X better than the GPU. The execution time on the FPGA is approximately 2.9X slower.

Keywords:
Field-programmable gate array Computer science Kernel (algebra) Xeon Embedded system Parallel computing Floating point Coprocessor Operating system

Metrics

8
Cited By
1.49
FWCI (Field Weighted Citation Impact)
19
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Parallel Computing and Optimization Techniques
Physical Sciences →  Computer Science →  Hardware and Architecture
Embedded Systems Design Techniques
Physical Sciences →  Computer Science →  Hardware and Architecture
Advanced Data Storage Technologies
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.