integrated gpu Latest Research Papers

Straightforward Heterogeneous Computing with the oneAPI Coexecutor Runtime

Electronics ◽

10.3390/electronics10192386 ◽

2021 ◽

Vol 10 (19) ◽

pp. 2386

Author(s):

Raúl Nozal ◽

Jose Luis Bosque

Keyword(s):

Energy Efficiency ◽

High Performance ◽

Heterogeneous Computing ◽

Programming Model ◽

Heterogeneous Systems ◽

Ease Of Use ◽

Embedded Devices ◽

Computing Systems ◽

Key Points ◽

Integrated Gpu

Heterogeneous systems are the core architecture of most computing systems, from high-performance computing nodes to embedded devices, due to their excellent performance and energy efficiency. Efficiently programming these systems has become a major challenge due to the complexity of their architectures and the efforts required to provide them with co-execution capabilities that can fully exploit the applications. There are many proposals to simplify the programming and management of acceleration devices and multi-core CPUs. However, in many cases, portability and ease of use compromise the efficiency of different devices—even more so when co-executing. Intel oneAPI, a new and powerful standards-based unified programming model, built on top of SYCL, addresses these issues. In this paper, oneAPI is provided with co-execution strategies to run the same kernel between different devices, enabling the exploitation of static and dynamic policies. This work evaluates the performance and energy efficiency for a well-known set of regular and irregular HPC benchmarks, using two heterogeneous systems composed of an integrated GPU and CPU. Static and dynamic load balancers are integrated and evaluated, highlighting single and co-execution strategies and the most significant key points of this promising technology. Experimental results show that co-execution is worthwhile when using dynamic algorithms and improves the efficiency even further when using unified shared memory.

Download Full-text

Efficient ROS-Compliant CPU-iGPU Communication on Embedded Platforms

Journal of Low Power Electronics and Applications ◽

10.3390/jlpea11020024 ◽

2021 ◽

Vol 11 (2) ◽

pp. 24

Author(s):

Mirco De Marchi ◽

Francesco Lumpp ◽

Enrico Martini ◽

Michele Boldo ◽

Stefano Aldegheri ◽

...

Keyword(s):

Energy Savings ◽

Communication Model ◽

Embedded Devices ◽

Performance Loss ◽

Zero Copy ◽

System Memory ◽

Easy Integration ◽

Integrated Gpu ◽

The Cost ◽

Advanced Model

Many modern programmable embedded devices contain CPUs and a GPU that share the same system memory on a single die. Such a unified memory architecture (UMA) allows programmers to implement different communication models between CPU and the integrated GPU (iGPU). Although the simpler model guarantees implicit synchronization at the cost of performance, the more advanced model allows, through the zero-copy paradigm, the explicit data copying between CPU and iGPU to be eliminated with the benefit of significantly improving performance and energy savings. On the other hand, the robot operating system (ROS) has become a de-facto reference standard for developing robotic applications. It allows for application re-use and the easy integration of software blocks in complex cyber-physical systems. Although ROS compliance is strongly required for SW portability and reuse, it can lead to performance loss and elude the benefits of the zero-copy communication. In this article we present efficient techniques to implement CPU–iGPU communication by guaranteeing compliance to the ROS standard. We show how key features of each communication model are maintained and the corresponding overhead involved by the ROS compliancy.

Download Full-text

Performance and energy consumption of a Gram–Schmidt process for vector orthogonalization on a processor integrated GPU

Sustainable Computing Informatics and Systems ◽

10.1016/j.suscom.2020.100456 ◽

2020 ◽

pp. 100456

Author(s):

Thomas Jakobs ◽

Lukas Reinhardt ◽

Gudula Rünger

Keyword(s):

Energy Consumption ◽

Integrated Gpu

Download Full-text

iGPU Leak: An Information Leakage Vulnerability on Intel Integrated GPU

2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC) ◽

10.1109/asp-dac47756.2020.9045745 ◽

2020 ◽

Author(s):

Wenjian HE ◽

Wei Zhang ◽

Sharad Sinha ◽

Sanjeev Das

Keyword(s):

Information Leakage ◽

Integrated Gpu

Download Full-text

A Study of Geodesic Distance Kernel on an Integrated GPU

10.2172/1576565 ◽

2019 ◽

Author(s):

Zheming Jin

Keyword(s):

Geodesic Distance ◽

Integrated Gpu

Download Full-text

Towards an Integrated GPU Accelerated SoC as a Flight Computer for Small Satellites

2019 IEEE Aerospace Conference ◽

10.1109/aero.2019.8741765 ◽

2019 ◽

Cited By ~ 1

Author(s):

Caleb Adams ◽

Allen Spain ◽

Jackson Parker ◽

Matthew Hevert ◽

James Roach ◽

...

Keyword(s):

Small Satellites ◽

Integrated Gpu ◽

Flight Computer

Download Full-text

What is driving the TSV business: Market & Technology Trends

Additional Conferences (Device Packaging HiTEC HiTEN & CICMT) ◽

10.4071/2380-4491-2019-dpc-presentation_wp1_060 ◽

2019 ◽

Vol 2019 (DPC) ◽

pp. 000808-000833

Author(s):

Santosh Kumar

Keyword(s):

Mobile Communications ◽

Autonomous Driving ◽

Image Sensor ◽

Rf Filters ◽

Business Market ◽

End Products ◽

Communications Protocol ◽

Integrated Gpu ◽

5G Mobile Communications ◽

Technology Trends

TSV interconnect based 3D/2.5D packaging has gained significant attention since its introduction in FPGA (for die partitioning) and HBM integrated GPU module (for gaming application). The performance potential offered by this technology is unequalled by any other packaging platform today. High-end applications like deep learning, datacenter networking, AR/VR, and autonomous driving are becoming real, thereby pushing the limits of other current packaging platforms. Fueled by increasing bandwidth needs for moving data in cloud-computing and supercomputing applications, performance-driven markets have adopted 3D stacked technologies in a row. Imaging, as the first market adopter of 3D integration, is propelling the market with an increasing number of sensors in smartphones and tablets, including 3D imaging. TSV-based products can be classified in three ranges: low, middle, and high-end. The middle and high-end product markets like CMOS image sensor, memory cube, and interposer are based on a via-middle process. In low-end products, we can also find TSV based on via-middle (i.e. in Apple's fingerprint sensor), but for cost reasons the MEMS industry is using essentially a via-last process, which is cheaper than a via-middle process. TSV's penetration rate in low-end products will remain stable, with the main source of growth due to RF filters in smartphone front-end modules, which keep increasing in order to support the different frequency bands used in 5G mobile communications protocol. This presentation will discuss about the market and technology trends of the TSV based 3D/2.5D packaging.

Download Full-text

General purpose Arithmetic computation on AMD Platform, APU Based with Integrated GPU and finding Empirical results of GPGPU Acceleration

2018 4th International Conference for Convergence in Technology (I2CT) ◽

10.1109/i2ct42659.2018.9058254 ◽

2018 ◽

Author(s):

Sneha Shetty R. ◽

Raghavendra Swamy

Keyword(s):

General Purpose ◽

Empirical Results ◽

Arithmetic Computation ◽

Integrated Gpu

Download Full-text

Performance Characterisation and Simulation of Intel's Integrated GPU Architecture

2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) ◽

10.1109/ispass.2018.00027 ◽

2018 ◽

Cited By ~ 6

Author(s):

Prasun Gera ◽

Hyojong Kim ◽

Hyesoon Kim ◽

Sunpyo Hong ◽

Vinod George ◽

...

Keyword(s):

Integrated Gpu ◽

Gpu Architecture

Download Full-text

A statistic approach for power analysis of integrated GPU

Soft Computing ◽

10.1007/s00500-017-2786-1 ◽

2017 ◽

Vol 23 (3) ◽

pp. 827-836 ◽

Cited By ~ 2

Author(s):

Qiong Wang ◽

Ning Li ◽

Li Shen ◽

Zhiying Wang

Keyword(s):

Power Analysis ◽

Integrated Gpu ◽

Statistic Approach

Download Full-text

integrated gpu
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Straightforward Heterogeneous Computing with the oneAPI Coexecutor Runtime

Efficient ROS-Compliant CPU-iGPU Communication on Embedded Platforms

Performance and energy consumption of a Gram–Schmidt process for vector orthogonalization on a processor integrated GPU

iGPU Leak: An Information Leakage Vulnerability on Intel Integrated GPU

A Study of Geodesic Distance Kernel on an Integrated GPU

Towards an Integrated GPU Accelerated SoC as a Flight Computer for Small Satellites

What is driving the TSV business: Market & Technology Trends

General purpose Arithmetic computation on AMD Platform, APU Based with Integrated GPU and finding Empirical results of GPGPU Acceleration

Performance Characterisation and Simulation of Intel's Integrated GPU Architecture

A statistic approach for power analysis of integrated GPU

Export Citation Format

integrated gpuRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Straightforward Heterogeneous Computing with the oneAPI Coexecutor Runtime

Efficient ROS-Compliant CPU-iGPU Communication on Embedded Platforms

Performance and energy consumption of a Gram–Schmidt process for vector orthogonalization on a processor integrated GPU

iGPU Leak: An Information Leakage Vulnerability on Intel Integrated GPU

A Study of Geodesic Distance Kernel on an Integrated GPU

Towards an Integrated GPU Accelerated SoC as a Flight Computer for Small Satellites

What is driving the TSV business: Market & Technology Trends

General purpose Arithmetic computation on AMD Platform, APU Based with Integrated GPU and finding Empirical results of GPGPU Acceleration

Performance Characterisation and Simulation of Intel's Integrated GPU Architecture

A statistic approach for power analysis of integrated GPU

integrated gpu
Recently Published Documents