Overcoming the Challenges of Porting OpenCV to TI's Embedded ARM + DSP Platforms

Joseph Coombs; Rahul Prabhu; Greg Peake

doi:10.7227/ijeee.49.3.6

Overcoming the Challenges of Porting OpenCV to TI's Embedded ARM + DSP Platforms

International Journal of Electrical Engineering Education ◽

10.7227/ijeee.49.3.6 ◽

2012 ◽

Vol 49 (3) ◽

pp. 260-274 ◽

Cited By ~ 5

Author(s):

Joseph Coombs ◽

Rahul Prabhu ◽

Greg Peake

Keyword(s):

Digital Signal Processor ◽

Digital Signal ◽

Academic Community ◽

System On Chip ◽

Floating Point ◽

Software Packages ◽

Functional Development ◽

On Chip ◽

Algorithm Implementation ◽

Memory Constraints

The growing performance and decreasing price of embedded processors are opening many doors, for both developers in the industry and in academia. However, the complexities of these systems can create serious developmental bottlenecks. Sophisticated software packages such as OpenCV can assist in both the functional development and educational aspects of these otherwise complex applications; such tools lend themselves very well to use by the academic community, in particular in providing examples of algorithm implementation. However the task of migrating this software to embedded platforms poses its own challenges. This paper will review how to mitigate some of these issues, including C++ implementation, memory constraints, floating-point support, and opportunities to maximise performance using vendor-optimised libraries and integrated accelerators or co-processors. Finally, we will introduce a new effort by Texas Instruments to optimise vision systems by running OpenCV on the C6000™ digital signal processor architecture. Benchmarks will show the advantage of using the DSP by comparing the performance of a DSP+ARM® system-on-chip (SoC) processor against an ARM-only device.

Download Full-text

ADI's revolutionary BF60x vision focused digital signal processor system on chip: 25 billion operations/sec @ 80 mW and zero bandwidth

2012 IEEE Hot Chips 24 Symposium (HCS) ◽

10.1109/hotchips.2012.7476488 ◽

2012 ◽

Cited By ~ 1

Author(s):

Robert Bushey

Keyword(s):

Digital Signal Processor ◽

Digital Signal ◽

System On Chip ◽

On Chip ◽

Signal Processor

Download Full-text

Argus CNN Accelerator Based on Kernel Clustering and Resource-Aware Pruning

Elektronika ir Elektrotechnika ◽

10.5755/j02.eie.28922 ◽

2021 ◽

Vol 27 (3) ◽

pp. 57-70

Author(s):

Damjan M. Rakanovic ◽

Vuk Vranjkovic ◽

Rastislav J. R. Struharik

Keyword(s):

Digital Signal Processor ◽

State Of The Art ◽

Digital Signal ◽

Pruning Algorithm ◽

Kernel Clustering ◽

Field Programmable ◽

Comparable Performance ◽

On Chip ◽

Resource Characteristics ◽

Resource Aware

Paper proposes a two-step Convolutional Neural Network (CNN) pruning algorithm and resource-efficient Field-programmable gate array (FPGA) CNN accelerator named “Argus”. The proposed CNN pruning algorithm first combines similar kernels into clusters, which are then pruned using the same regular pruning pattern. The pruning algorithm is carefully tailored for FPGAs, considering their resource characteristics. Regular sparsity results in high Multiply-accumulate (MAC) efficiency, reducing the amount of logic required to balance workloads among different MAC units. As a result, the Argus accelerator requires about 170 Look-up tables (LUTs) per Digital Signal Processor (DSP) block. This number is close to the average LUT/DPS ratio for various FPGA families, enabling balanced resource utilization when implementing Argus. Benchmarks conducted using Xilinx Zynq Ultrascale + Multi-Processor System-on-Chip (MPSoC) indicate that Argus is achieving up to 25 times higher frames per second than NullHop, 2 and 2.5 times higher than NEURAghe and Snowflake, respectively, and 2 times higher than NVDLA. Argus shows comparable performance to MIT’s Eyeriss v2 and Caffeine, requiring up to 3 times less memory bandwidth and utilizing 4 times fewer DSP blocks, respectively. Besides the absolute performance, Argus has at least 1.3 and 2 times better GOP/s/DSP and GOP/s/Block-RAM (BRAM) ratios, while being competitive in terms of GOP/s/LUT, compared to some of the state-of-the-art solutions.

Download Full-text

A four-channel digital signal processor in 1.2- mu m CMOS with on-chip D/A and A/D conversion serving four speech channels in a new-generation subscriber line circuit

IEEE Journal of Solid-State Circuits ◽

10.1109/4.92024 ◽

1991 ◽

Vol 26 (7) ◽

pp. 1038-1046 ◽

Cited By ~ 3

Author(s):

D. Haspeslagh ◽

J. Sevenhans ◽

A. Delarbre ◽

L. Kiss ◽

E. Moerman

Keyword(s):

Digital Signal Processor ◽

Digital Signal ◽

Subscriber Line ◽

On Chip ◽

New Generation ◽

Signal Processor

Download Full-text

System on Chip Design for Multi-Principle of Relay Protection in the FPGA

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.668-669.857 ◽

2014 ◽

Vol 668-669 ◽

pp. 857-861

Author(s):

Peng Fei Hu ◽

Yu Xiang Yuan ◽

Zhi Juan Qu ◽

Xue Ping Jiang

Keyword(s):

Signal Processing ◽

Digital Signal Processing ◽

Relay Protection ◽

Digital Signal ◽

System On Chip ◽

Process Scheduling ◽

Chip Design ◽

Protection Devices ◽

On Chip ◽

Set Up

To improve the reliability and integration of relay protection devices in power, the system on chip design for multi-principle of relay protection on FPGA is proposed. The data acquisition, digital signal processing, hardware protection algorithm, FPGA and MCU process scheduling, MCU and peripheral devices communication are designed, the hardware compilation model is set up by QuartusII on FPGA, and the simulation and experimental verification are performed. The results show that the proposed system can improve the speed of hardware protection and reduce the volume of the device, and has reconstruction on architecture.

Download Full-text

Radiation hardness evaluation of a class V 32-bit floating-point digital signal processor

IEEE Radiation Effects Data Workshop, 2005. ◽

10.1109/redw.2005.1532669 ◽

2005 ◽

Cited By ~ 4

Author(s):

R. Joshi ◽

R. Daniels ◽

M. Shoga ◽

M. Gauthier

Keyword(s):

Digital Signal Processor ◽

Digital Signal ◽

Radiation Hardness ◽

Floating Point ◽

Class V ◽

Signal Processor

Download Full-text

Low-frequency oscillator for floating-point digital signal processor chips

Electronics Letters ◽

10.1049/el:19921007 ◽

1992 ◽

Vol 28 (17) ◽

pp. 1582

Author(s):

J.I. Acha ◽

J. Calvo

Keyword(s):

Digital Signal Processor ◽

Low Frequency ◽

Digital Signal ◽

Floating Point ◽

Frequency Oscillator ◽

Signal Processor

Download Full-text

Dose Rate and Total Dose Radiation Testing of the Texas Instruments TMS320C30 32-Bit Floating Point Digital Signal Processor.

10.21236/ada239767 ◽

1991 ◽

Author(s):

P. F. Siy ◽

J. T. Carter ◽

L. R. D'Addario ◽

D. A. Loeber

Keyword(s):

Total Dose ◽

Dose Rate ◽

Digital Signal Processor ◽

Digital Signal ◽

Floating Point ◽

Radiation Testing ◽

Dose Radiation ◽

Signal Processor

Download Full-text

A hybrid floating-point/logarithmic number system digital signal processor

10.1109/icassp.1989.266619 ◽

2003 ◽

Cited By ~ 4

Author(s):

T. Stouraitis

Keyword(s):

Digital Signal Processor ◽

Digital Signal ◽

Number System ◽

Floating Point ◽

Logarithmic Number System ◽

Logarithmic Number ◽

Signal Processor

Download Full-text

The DSP32C: AT&Ts second generation floating point digital signal processor

IEEE Micro ◽

10.1109/40.16779 ◽

1988 ◽

Vol 8 (6) ◽

pp. 30-48 ◽

Cited By ~ 16

Author(s):

M.L. Fuccio ◽

R.N. Gadenz ◽

C.J. Garen ◽

J.M. Huser ◽

B. Ng ◽

...

Keyword(s):

Digital Signal Processor ◽

Second Generation ◽

Digital Signal ◽

Floating Point ◽

Signal Processor

Download Full-text

OpenRISC-based System-on-Chip for digital signal processing

2014 XIX Symposium on Image, Signal Processing and Artificial Vision ◽

10.1109/stsiva.2014.7010123 ◽

2014 ◽

Cited By ~ 2

Author(s):

Alexander Lopez-Parrado ◽

Juan-Camilo Valderrama-Cuervo

Keyword(s):

Signal Processing ◽

Digital Signal Processing ◽

Digital Signal ◽

System On Chip ◽

On Chip

Download Full-text