scholarly journals A Two-Level Task Scheduler on Multiple DSP System for OpenCL

2014 ◽  
Vol 6 ◽  
pp. 754835
Author(s):  
Li Tian ◽  
Cai Meng ◽  
Fugen Zhou

This paper addresses the problem that multiple DSP system does not support OpenCL programming. With the compiler, runtime, and the kernel scheduler proposed, an OpenCL application becomes portable not only between multiple CPU and GPU, but also between embedded multiple DSP systems. Firstly, the LLVM compiler was imported for source-to-source translation in which the translated source was supported by CCS. Secondly, two-level schedulers were proposed to support efficient OpenCL kernel execution. The DSP/BIOS is used to schedule system level tasks such as interrupts and drivers; however, the synchronization mechanism resulted in heavy overhead during task switching. So we designed an efficient second level scheduler especially for OpenCL kernel work-item scheduling. The context switch process utilizes the 8 functional units and cross path links which was superior to DSP/BIOS in the aspect of task switching. Finally, dynamic loading and software managed CACHE were redesigned for OpenCL running on multiple DSP system. We evaluated the performance using some common OpenCL kernels from NVIDIA, AMD, NAS, and Parboil benchmarks. Experimental results show that the DSP OpenCL can efficiently exploit the computing resource of multiple cores.

2020 ◽  
Vol 96 (3s) ◽  
pp. 89-96
Author(s):  
А.А. Беляев ◽  
Я.Я. Петричкович ◽  
Т.В. Солохина ◽  
И.А. Беляев

Рассмотрены особенности архитектуры и основные характеристики аппаратного видеокодека по стандарту H.264, входящего в состав микросхемы 1892ВМ14Я (MCom-02). Описан механизм синхронизации потоков данных на основе набора флагов событий. Приведены экспериментальные результаты измерения характеристик производительности разработанного видеокодека на реальных видеосюжетах при различных форматах передаваемого изображения. The paper considers main architectural features and characteristics of H.264 hardware video codec IP-core as a part of MCom- 02 system-on-chip (SoC). Bedides, it presents data flow synchronization mechanism based on event flags set, as well as experimental results of performance measurements for the designed video codec IP-core obtained for different video sequences and different image formats.


2020 ◽  
Vol 835 ◽  
pp. 229-242
Author(s):  
Oboso P. Bernard ◽  
Nagih M. Shaalan ◽  
Mohab Hossam ◽  
Mohsen A. Hassan

Accurate determination of piezoelectric properties such as piezoelectric charge coefficients (d33) is an essential step in the design process of sensors and actuators using piezoelectric effect. In this study, a cost-effective and accurate method based on dynamic loading technique was proposed to determine the piezoelectric charge coefficient d33. Finite element analysis (FEA) model was developed in order to estimate d33 and validate the obtained values with experimental results. The experiment was conducted on a piezoelectric disc with a known d33 value. The effect of measuring boundary conditions, substrate material properties and specimen geometry on measured d33 value were conducted. The experimental results reveal that the determined d33 coefficient by this technique is accurate as it falls within the manufactures tolerance specifications of PZT-5A piezoelectric film d33. Further, obtained simulation results on fibre reinforced and particle reinforced piezoelectric composite were found to be similar to those that have been obtained using more advanced techniques. FE-results showed that the measured d33 coefficients depend on measuring boundary condition, piezoelectric film thickness, and substrate material properties. This method was proved to be suitable for determination of d33 coefficient effectively for piezoelectric samples of any arbitrary geometry without compromising on the accuracy of measured d33.


2021 ◽  
Vol 11 (12) ◽  
pp. 5523
Author(s):  
Qian Ye ◽  
Minyan Lu

The main purpose of our provenance research for DSP (distributed stream processing) systems is to analyze abnormal results. Provenance for these systems is not nontrivial because of the ephemerality of stream data and instant data processing mode in modern DSP systems. Challenges include but are not limited to an optimization solution for avoiding excessive runtime overhead, reducing provenance-related data storage, and providing it in an easy-to-use fashion. Without any prior knowledge about which kinds of data may finally lead to the abnormal, we have to track all transformations in detail, which potentially causes hard system burden. This paper proposes s2p (Stream Process Provenance), which mainly consists of online provenance and offline provenance, to provide fine- and coarse-grained provenance in different precision. We base our design of s2p on the fact that, for a mature online DSP system, the abnormal results are rare, and the results that require a detailed analysis are even rarer. We also consider state transition in our provenance explanation. We implement s2p on Apache Flink named as s2p-flink and conduct three experiments to evaluate its scalability, efficiency, and overhead from end-to-end cost, throughput, and space overhead. Our evaluation shows that s2p-flink incurs a 13% to 32% cost overhead, 11% to 24% decline in throughput, and few additional space costs in the online provenance phase. Experiments also demonstrates the s2p-flink can scale well. A case study is presented to demonstrate the feasibility of the whole s2p solution.


2021 ◽  
Author(s):  
Paolo Carbone ◽  
Guido De Angelis ◽  
Valter Pasku ◽  
Alessio De Angelis ◽  
Marco Dionigi ◽  
...  

<div><div><div><p>This paper describes the design and realization of a Magnetic Indoor Positioning System. The system is entirely realized using off-the-shelf components and is based on inductive coupling between resonating coils. Both system-level architecture and realization details are described along with experimental results. The realized system exhibits a maximum positioning error of less than 10 cm in an indoor environment over a 3×3 m2 area. Extensive experiments in larger areas, in non-line-of-sight conditions, and in unfavorable geometric configurations, show sub-meter accuracy, thus validating the robustness of the system with respect to other existing solutions.</p></div></div></div>


Author(s):  
G. I. Odnokopylov ◽  
Z. R. Galyautdinov ◽  
V. B. Maksimov

The paper presents the experimental results of strength and deformability of reinforced concrete slabs on yielding supports arranged along the perimeter under the dynamic loading. Crushable ring-shaped inserts deforming at the elastic, plastic and curing stages are considered as yielding supports. The displacement, velocity and acceleration are evaluated depending on the deformation stage of yielding supports. The high efficiency is shown for the use of yielding supports, which leads to a significant reduction in the structure displacement, strain, and stress.


2021 ◽  
Author(s):  
Paolo Carbone

<div> <div> <div> <p>This paper describes the design and realization of a 5.6 GHz ultra–wide bandwidth based position measurement system. The system has been entirely made using off–the– shelf components and achieves centimeter level accuracy in an indoor environment. It is based on asynchronous modulated pulse round–trip–time measurements. Both system level and realization details are described along with experimental results including estimates of measurement uncertainties. </p> </div> </div> </div>


2021 ◽  
Vol 20 (5s) ◽  
pp. 1-22
Author(s):  
Wei-Ming Chen ◽  
Tei-Wei Kuo ◽  
Pi-Cheng Hsiu

Intermittent systems enable batteryless devices to operate through energy harvesting by leveraging the complementary characteristics of volatile (VM) and non-volatile memory (NVM). Unfortunately, alternate and frequent accesses to heterogeneous memories for accumulative execution across power cycles can significantly hinder computation progress. The progress impediment is mainly due to more CPU time being wasted for slow NVM accesses than for fast VM accesses. This paper explores how to leverage heterogeneous cores to mitigate the progress impediment caused by heterogeneous memories. In particular, a delegable and adaptive synchronization protocol is proposed to allow memory accesses to be delegated between cores and to dynamically adapt to diverse memory access latency. Moreover, our design guarantees task serializability across multiple cores and maintains data consistency despite frequent power failures. We integrated our design into FreeRTOS running on a Cypress device featuring heterogeneous dual cores and hybrid memories. Experimental results show that, compared to recent approaches that assume single-core intermittent systems, our design can improve computation progress at least 1.8x and even up to 33.9x by leveraging core heterogeneity.


2018 ◽  
Vol 29 (13) ◽  
pp. 2754-2765 ◽  
Author(s):  
Shengli Tian ◽  
Xiaoan Chen ◽  
Ye He ◽  
Tianchi Chen ◽  
Peiming Li

A high-speed dynamic loading test is a key step when testing the dynamic performance and running quality of a high-speed motorized spindle. A loading test is very difficult to perform at high speeds. Based on the rheological behavior of the magnetorheological fluid, a novel high-speed dynamic loading system for a high-speed motorized spindle was designed, fabricated, and tested. The working principles and structure of this loading system are described. The torque model of the loader was derived based on the Herschel–Bulkley model and electromagnetic simulation using the finite element method. In addition, the torque–current relationship under different speeds was analyzed by experiments, and we found non-linear relationships between the viscosity and shear stress of the magnetorheological fluid with the shear rate. The Herschel–Bulkley model was corrected by fitting for the experimental results. The loading torque, calculated by the modified model, complied with the experimental results. This lays the foundation for the design of a high-speed transmission device based on the magnetorheological shear principle. Experiments of torque stability, temperature stability, and reusability verified the feasibility and accuracy of the proposed loading system. It provides a novel method to test the dynamic loading performance of high-speed motorized spindles.


Sign in / Sign up

Export Citation Format

Share Document