knights landing Latest Research Papers

В настоящее время поиск похожих подпоследовательностей требуется в широком спектре приложений интеллектуального анализа временных рядов: моделирование климата, финансовые прогнозы, медицинские исследования и др. В большинстве указанных приложений при поиске используется мера схожести Dynamic Time Warping (DTW), поскольку на сегодняшний день научное сообщество признает меру DTW одной из лучших для большинства предметных областей. Мера DTW имеет квадратичную вычислительную сложность относительно длины искомой подпоследовательности, в силу чего разработан ряд параллельных алгоритмов ее вычисления на устройствах FPGA и многоядерных ускорителях с архитектурами GPU и Intel MIC. В настоящей статье предлагается новый параллельный алгоритм для поиска похожих подпоследовательностей в сверхбольших временных рядах на кластерных системах с узлами на базе многоядерных процессоров Intel Xeon Phi поколения Knights Landing (KNL). Вычисления распараллеливаются на двух уровнях: на уровне всех узлов кластера - с помощью технологии MPI и в рамках одного узла кластера - с помощью технологии OpenMP. Алгоритм предполагает использование дополнительных структур данных и избыточных вычислений, позволяющих эффективно задействовать возможности векторизации вычислений на процессорных системах Phi KNL. Эксперименты, проведенные на синтетических и реальных наборах данных, показали хорошую масштабируемость алгоритма. Nowadays, the subsequence similarity search is required in a wide range of time series mining applications: climate modeling, financial forecasts, medical research, etc. In most of these applications, the Dynamic Time Warping (DTW) similarity measure is used, since DTW is empirically confirmed as one of the best similarity measures for the majority of subject domains. Since the DTW measure has a quadratic computational complexity with respect to the length of query subsequence, a number of parallel algorithms for various many-core architectures are developed, namely FPGA, GPU, and Intel MIC. In this paper we propose a new parallel algorithm for subsequence similarity search in very large time series on computer cluster systems with nodes based on Intel Xeon Phi Knights Landing (KNL) many-core processors. Computations are parallelized on two levels as follows: by MPI at the level of all cluster nodes and by OpenMP within a single cluster node. The algorithm involves additional data structures and redundant computations, which make it possible to efficiently use the capabilities of vector computations on Phi KNL. Experimental evaluation of the algorithm on real-world and synthetic datasets shows that the proposed algorithm is highly scalable.

Download Full-text

Optimization strategies for geophysics models on manycore systems

The International Journal of High Performance Computing Applications ◽

10.1177/1094342018824150 ◽

2019 ◽

Vol 33 (3) ◽

pp. 473-486 ◽

Cited By ~ 4

Author(s):

Matheus S Serpa ◽

Eduardo HM Cruz ◽

Matthias Diener ◽

Arthur M Krause ◽

Philippe OA Navaux ◽

...

Keyword(s):

Wave Propagation ◽

High Performance ◽

Oil And Gas ◽

State Of The Art ◽

Propagation Model ◽

Cache Memory ◽

Performance Scaling ◽

Knights Landing ◽

The Impact ◽

Performance Computing

Many software mechanisms for geophysics exploration in oil and gas industries are based on wave propagation simulation. To perform such simulations, state-of-the-art high-performance computing architectures are employed, generating results faster with more accuracy at each generation. The software must evolve to support the new features of each design to keep performance scaling. Furthermore, it is important to understand the impact of each change applied to the software to improve the performance as most as possible. In this article, we propose several optimization strategies for a wave propagation model for six architectures: Intel Broadwell, Intel Haswell, Intel Knights Landing, Intel Knights Corner, NVIDIA Pascal, and NVIDIA Kepler. We focus on improving the cache memory usage, vectorization, load balancing, portability, and locality in the memory hierarchy. We analyze the hardware impact of the optimizations, providing insights of how each strategy can improve the performance. The results show that NVIDIA Pascal outperforms the other considered architectures by up to 8.5[Formula: see text].

Download Full-text

Optimization of elastodynamic finite integration technique on Intel Xeon Phi Knights Landing processors

Journal of Computational Physics ◽

10.1016/j.jcp.2018.07.049 ◽

2018 ◽

Vol 374 ◽

pp. 550-562 ◽

Cited By ~ 3

Author(s):

William C. Schneck ◽

Elizabeth D. Gregory ◽

Cara A.C. Leckey

Keyword(s):

Xeon Phi ◽

Intel Xeon Phi ◽

Integration Technique ◽

Finite Integration Technique ◽

Knights Landing ◽

Intel Xeon

Download Full-text

Multiobjective Evaluation and Optimization of CMT-bone on Intel Knights Landing

2018 Ninth International Green and Sustainable Computing Conference (IGSC) ◽

10.1109/igcc.2018.8752152 ◽

2018 ◽

Author(s):

Mohamed Gadou ◽

Tania Banerjee ◽

Meena Arunachalam ◽

Galen Shipman ◽

Sanjay Ranka

Keyword(s):

Knights Landing

Download Full-text

Performance Implications of Global Virtual Time Algorithms on a Knights Landing Processor

2018 IEEE/ACM 22nd International Symposium on Distributed Simulation and Real Time Applications (DS-RT) ◽

10.1109/distra.2018.8600923 ◽

2018 ◽

Author(s):

Ali Eker ◽

Barry Williams ◽

Nitesh Mishra ◽

Dushyant Thakur ◽

Kenneth Chiu ◽

...

Keyword(s):

Virtual Time ◽

Knights Landing

Download Full-text

Performance and Energy Evaluation of SAR Reconstruction on Intel Knights Landing

2018 Ninth International Green and Sustainable Computing Conference (IGSC) ◽

10.1109/igcc.2018.8752136 ◽

2018 ◽

Author(s):

Adeesha Wijayasiri ◽

Tania Banerjee ◽

Sanjay Ranka ◽

Sartaj Sahni ◽

Mark Schmalz

Keyword(s):

Energy Evaluation ◽

Knights Landing

Download Full-text

knights landing
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Improving blocked matrix-matrix multiplication routine by utilizing AVX-512 instructions on intel knights landing and xeon scalable processors

Scaling Deep Learning workloads: NVIDIA DGX-1/Pascal and Intel Knights Landing

Tour 6B. Junction with US 395–Portola–Blairsden–Quincy–Rich Bar–Oroville–Marysville–Knights Landing–Woodland; 203.1 m. State 24

Scalability of Hybrid SpMV on Intel Xeon Phi Knights Landing

The use of MPI and OpenMP technologies for subsequence similarity search in very long time series on a computer cluster system with nodes based on the Intel Xeon Phi Knights Landing many-core processor

Optimization strategies for geophysics models on manycore systems

Optimization of elastodynamic finite integration technique on Intel Xeon Phi Knights Landing processors

Multiobjective Evaluation and Optimization of CMT-bone on Intel Knights Landing

Performance Implications of Global Virtual Time Algorithms on a Knights Landing Processor

Performance and Energy Evaluation of SAR Reconstruction on Intel Knights Landing

Export Citation Format

knights landingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Improving blocked matrix-matrix multiplication routine by utilizing AVX-512 instructions on intel knights landing and xeon scalable processors

Scaling Deep Learning workloads: NVIDIA DGX-1/Pascal and Intel Knights Landing

Tour 6B. Junction with US 395–Portola–Blairsden–Quincy–Rich Bar–Oroville–Marysville–Knights Landing–Woodland; 203.1 m. State 24

Scalability of Hybrid SpMV on Intel Xeon Phi Knights Landing

The use of MPI and OpenMP technologies for subsequence similarity search in very long time series on a computer cluster system with nodes based on the Intel Xeon Phi Knights Landing many-core processor

Optimization strategies for geophysics models on manycore systems

Optimization of elastodynamic finite integration technique on Intel Xeon Phi Knights Landing processors

Multiobjective Evaluation and Optimization of CMT-bone on Intel Knights Landing

Performance Implications of Global Virtual Time Algorithms on a Knights Landing Processor

Performance and Energy Evaluation of SAR Reconstruction on Intel Knights Landing

knights landing
Recently Published Documents