Birk M., Balzer M., Ruiter N.V., Becker J.
in Computers and Electrical Engineering, 40 (2014) 1171-1185. DOI:10.1016/j.compeleceng.2013.11.033
Abstract
In heterogeneous computing, application developers have to identify the best-suited target platform from a variety of alternatives. In this work, we compare performance and architectural efficiency of Graphics Processing Units (GPUs) and Field Programmable Gate Arrays (FPGAs) for two algorithms taken from a novel medical imaging method named 3D ultrasound computer tomography. From the 40 nm and 28 nm generations, we use top-notch devices and those with similar power consumption values. For our two benchmark algorithms from the signal processing and imaging domain, the results show that if power consumption is not considered, the GPU and FPGA from the 40nm generation give both, a similar performance and efficiency per transistor. In the 28 nm process, in contrast, the FPGA is superior to its GPU counterpart by 86% and 39%, depending on the algorithm. If power is limited, FPGAs outperform GPUs in each investigated case by at least a factor of four. © 2013 Elsevier Ltd. All rights reserved.
Lopez-Ramirez M.et al.: FPGA-based methodology for depth-of-field extension in a single image in Digital Signal Processing: A Review Journal, 70 (2017) 14-23.
Lizarraga-Morales R.et al.: Novel FPGA-based methodology for early broken rotor bar detection and classification through homogeneity estimation in IEEE Transactions on Instrumentation and Measurement, 66 (2017) 1760-1769 7865931.
Schenck W.et al.: Comparing parallel hardware architectures for visually guided robot navigation in Concurrency Computation, 29 (2017) e3833.
Birk M.et al.: High-Speed Medical Imaging in 3D Ultrasound Computer Tomography in IEEE Transactions on Parallel and Distributed Systems, 27 (2016) 455-467 7045577.