Publications of the IPE expert group for embedded parallel systems
Stevanovic U., Caselle M., Chilingaryan S., Herth A., Kopmann A., Vogelgesang M., Balzer M., Weber M.
in Conference on Design and Architectures for Signal and Image Processing, DASIP (2012) 383-384, 6385417.
The first prototype of a high-speed camera with embedded image processing has been developed. Beside high frame rate and high through-put, the camera introduces a novel self triggering architecture to increase the frame rate and to reduce the amount of received data. The camera is intended for synchrotron ultra-fast X-ray radiography and tomography, but it’s concept is also suitable for other fields. © 2012 ECSI.
Birk M., Balzer M., Ruiter N., Becker J.
in 2012 International Conference on Reconfigurable Computing and FPGAs, ReConFig 2012 (2012), 6416735. DOI:10.1109/ReConFig.2012.6416735
With the rise of heterogeneous computing architectures, application developers are confronted with a multitude of hardware platforms and the challenge of identifying the most suitable processing platform for their application. Strong competitors for the acceleration of 3D Ultrasound Computer Tomography, a medical imaging method for early breast cancer diagnosis, are GPU and FPGA devices. In this work, we evaluate processing performance and efficiency metrics for current FPGA and GPU devices. We compare top-notch devices from the 40 nm generation as well as FPGA and GPU devices, which draw the same amount of power. For our two benchmark algorithms, the results show that if power consumption is not considered the GPU and the FPGA give both, a similar processing performance and processing efficiency per transistor. However, if the power budget is limited to a similar value, the FPGA performs between six and eight times better than the GPU. © 2012 IEEE.
Balzer M., Birk M., Dapp R., Gemmeke H., Kretzek E., Menshikov S., Zapf M., Ruiter N.V.
in 2012 18th IEEE-NPSS Real Time Conference, RT 2012 (2012), 6418198. DOI:10.1109/RTC.2012.6418198
3D ultrasound computer tomography (USCT) is a new and promising method for early breast cancer diagnosis. An ultrasound computer tomograph was developed by the Karlsruhe Institute of Technology (KIT) and provides a resolution of 0.2 mm. The main components are the semi-ellipsoidal aperture with 628 ultrasound (US) emitters and 1413 US receivers and the 480 input channel data acquisition system. An additional external computing system is used for the time consuming image reconstruction. To reduce the reconstruction time different implementations on massive parallel computing architectures like multicore processor, GPUs and FPGAs were tested. One realization applies reconfiguration of the 60 FPGAs inside the DAQ system. The investigations show significant acceleration of the reconstruction time up to a factor of 15.8 for the latest FPGA generation and 17.6 for a state of the art GPU. © 2012 IEEE.
Haas D., Mexner W., Spangenberg T., Cecilia A., Vagovic P., Kopmann A., Balzer M., Vogelgesang M., Pasic H., Chilingaryan S.
in PCaPAC 2012 – 9th International Workshop on Personal Computers and Particle Accelerator Controls (2012) 103-105.
X-ray imaging permits to spatially resolve the 2D and 3D structure in materials and organisms, which is crucial for the understanding of their properties. Additional temporal resolution of structure evolution gives access to dynamics of processes and allows to understand functionality of devices and organisms with the goal to optimize technological processes. Such time-resolved dynamic analysis of micro-sized structures is now possible by aid of ultrafast tomography, as being developed at the TopoTomo beamline of the synchrotron light source ANKA. At TopoTomo, the whole experimental workflow has been significantly improved in order to decrease the total duration of a tomography experiment down to the range of minutes. To meet these requirements, detectors and the computing infrastructure have been optimized, comprising a Tango-based control system for ultra fast tomography with a data throughput of several 100 MB/s. Multi-GPU based computing allows for high speed data processing by using a special reconstruction scheme. Furthermore the data management infrastructure will allow for a life cycle management of data sets accumulating several TByte/day. The final concept will also be part of the IMAGE beamline, which is going to be installed in 2013. © 2012 by the respective authors.
Balzer M., Caselle M., Chilingaryian S., Herth A., Kopmann A., Stevanovic U., Vogelgesang M., Rolo T.D.S.
in SEI 2012 – 103. Tagung der Studiengruppe Elektronische Instrumentierung im Fruhjahr 2012 (2012) 121-132.
Birk M., Guth A., Zapf M., Balzer M., Ruiter N., Hubner M., Becker J.
in Conference on Design and Architectures for Signal and Image Processing, DASIP (2011) 67-74, 6136856. DOI:10.1109/DASIP.2011.6136856
As today’s standard screening methods frequently fail to diagnose breast cancer before metastases have developed, earlier breast cancer diagnosis is still a major challenge. Three-dimensional ultrasound computer tomography promises high-quality images of the breast, but is currently limited by a time-consuming synthetic aperture focusing technique based image reconstruction. In this work, we investigate the acceleration of the image reconstruction by a GPU, and by the FPGAs embedded in our custom data acquisition system. We compare the obtained performance results with a recent multi-core CPU and show that both platforms are able to accelerate processing. The GPU reaches the highest performance. Furthermore, we draw conclusions in terms of applicability of the accelerated reconstructions in future clinical application and highlight general principles for speed-up on GPUs and FPGAs. © 2011 IEEE.
Birk M., Koehler S., Balzer M., Huebner M., Ruiter N.V., Becker J.
in IEEE Transactions on Nuclear Science, 58 (2011) 1647-1651, 5942184. DOI:10.1109/TNS.2011.2159017
A three-dimensional ultrasound computer tomography (3-D USCT) system for early breast cancer diagnosis is currently being developed at Institute for Data Processing and Electronics (IPE) at Karlsruhe Institute of Technology, Karlsruhe, Germany. A field-programmable gate array (FPGA)-based data acquisition (DAQ) system is used to acquire the measurement data. Over 3.5 million data sets are recorded for each breast, resulting in 20 GB of raw data. The data is subsequently transferred to a PC for time-consuming image reconstruction, which is in the order of days. A promising approach in reducing reconstruction time is mapping further computation to the FPGAs in the DAQ system. This paper covers the investigation of FPGA-based processing for a set of signal processing algorithms. In the best suited approach, processing time per data set is approximately 50 μs, occupying less than 30% of device resources. © 2006 IEEE.
Birk M., Hagner C., Balzer M., Ruiter N.V., Hubner M., Becker J.
in International Journal of Reconfigurable Computing, 2011 (2011), 952937. DOI:10.1155/2011/952937
As today’s standard screening methods often fail to diagnose breast cancer before metastases have developed, an earlier breast cancer diagnosis is still a major challenge. To improve this situation, we are currently developing a fully three-dimensional ultrasound computer tomography (3D USCT) system, promising high-quality volume images of the breast. For obtaining these images, a time-consuming reconstruction has to be performed. As this is currently done on a PC, parallel processing in reconfigurable hardware could accelerate both signal and image processing. In this work, we investigated the suitability of an existing data acquisition (DAQ) system for further computation tasks. The reconfiguration features of the embedded FPGAs have been exploited to enhance the systems functionality. We have adapted the DAQ system to allow for bidirectional communication and to provide an overall process control. Our results show that the studied system can be applied for data processing. Copyright © 2011 Matthias Birk et al.
Phillips, D.G et al.
in IEEE Nuclear Science Symposium Conference Record
2010, Article number 5874002, Pages 1399-1403
This article will describe the procedures used to validate and characterize the combined hardware and software DAQ system of the KATRIN experiment. The Mk4 DAQ Electronics is the latest version in a series of field programmable gate array (FPGA)-based electronics developed at the Karlsruhe Institute of Technology’s Institute of Data Processing and Electronics (IPE). This system will serve as the primary detector readout in the KATRIN experiment. The KATRIN data acquisition software is a MacOS X application called ORCA (Object-oriented Real-time Control and Acquisition), which includes a powerful scripting language called ORCAScript. This article will also describe how ORCAScript is used in the validation and characterization tests of the Mk4 DAQ electronics system. © 2010 IEEE.