High-Performance Computing Probabilistic Fracture Mechanics Implementation for Gas Turbine Rotor Disks On Distributed Architectures Including Graphics Processing Units (GPUS)

Author(s):  
Mrugesh Gajjar ◽  
Christian Amann ◽  
Kai Kadau

Abstract We present an efficient Monte Carlo (MC) based probabilistic fracture mechanics simulation implementation on heterogeneous high-performance (HPC) architectures including CPUs and GPUs for large heavy-duty gas turbine rotor components for the energy sector. A reliable probabilistic risk quantification requires simulating millions to billions of MC samples. We apply a modified Runge-Kutta algorithm to solve numerically the fatigue crack growth for this large number of cracks for varying initial crack sizes, locations, material and service conditions. This compute intensive simulation was demonstrated to perform efficiently and scalable on parallel and distributed architectures with hundreds of CPUs utilizing Message Passing Interface (MPI). In this work, we include GPUs in parallelization strategy. We develop a load distribution scheme to share one or more GPUs on compute nodes distributed over network. We detail technical challenges and strategies in performing the simulations on GPUs efficiently. We show that the key computation of the modified Runge-Kutta integration step speeds up over two orders of magnitude on a typical GPU compared to a single threaded CPU supported by use of GPU textures for efficient interpolation of multi-dimensional tables. We demonstrate weak and strong scaling of our GPU implementation, i.e., that we can efficiently utilize large number of GPUs/CPUs to solve for more MC samples, or reduce the computational turnaround time, respectively. On seven different GPUs spanning four generations, our probabilistic fracture mechanics simulation tool ProbFM achieves speedups ranging from 16.4x to 47.4x compared to single threaded CPU implementation.

2021 ◽  
Author(s):  
Mrugesh Gajjar ◽  
Christian Amann ◽  
Kai Kadau

Abstract We present an efficient Monte Carlo based probabilistic fracture mechanics simulation implementation for heterogeneous high-performance (HPC) architectures including CPUs and GPUs. The specific application focuses on large heavy-duty gas turbine rotor components for the energy sector. A reliable probabilistic risk quantification requires the simulation of millions to billions of Monte Carlo (MC) samples. We apply a modified Runge-Kutta algorithm in order to solve numerically the fatigue crack growth for this large number of cracks for varying initial crack sizes, locations, material and service conditions. This compute intensive simulation has already been demonstrated to perform efficiently and scalable on parallel and distributed HPC architectures including hundreds of CPUs utilizing the Message Passing Interface (MPI) paradigm. In this work, we go a step further and include GPUs in our parallelization strategy. We develop a load distribution scheme to share one or more GPUs on compute nodes distributed over a network. We detail technical challenges and solution strategies in performing the simulations on GPUs efficiently. We show that the key computation of the modified Runge-Kutta integration step speeds up over two orders of magnitude on a typical GPU compared to a single threaded CPU. This is supported by our use of GPU textures for efficient interpolation of multi-dimensional tables utilized in the implementation. We demonstrate weak and strong scaling of our GPU implementation, i.e., that we can efficiently utilize a large number of GPUs/CPUs in order to solve for more MC samples, or reduce the computational turn-around time, respectively. On seven different GPUs spanning four generations, the presented probabilistic fracture mechanics simulation tool ProbFM achieves a speed-up ranging from 16.4x to 47.4x compared to single threaded CPU implementation.


Author(s):  
Kai Kadau ◽  
Phillip W. Gravett ◽  
Christian Amann

We developed and successfully applied a direct simulation Monte-Carlo scheme to quantify the risk of fracture for heavy duty rotors commonly used in the energy sector. The developed Probabilistic Fracture Mechanics high-performance computing methodology and code ProbFM routinely assesses relevant modes of operation for a component by performing billions of individual fracture mechanics simulations. The methodology can be used for new design and life-optimization of components, as well as for the risk of failure quantification of in service rotors and their re-qualifications in conjunction with non-destructive examination techniques, such as ultrasonic testing. The developed probabilistic scheme integrates material data, ultra-sonic testing information, duty-cycle data, and finite element analysis in order to determine the risk of failure. The methodology provides an integrative and robust measure of the fitness for service and allows for a save and reliable operation management of heavy duty rotating equipment.


2018 ◽  
Vol 140 (6) ◽  
Author(s):  
Kai Kadau ◽  
Phillip W. Gravett ◽  
Christian Amann

We developed and successfully applied a direct simulation Monte Carlo (MC) scheme to quantify the risk of fracture for heavy-duty rotors commonly used in the energy sector. The developed probabilistic fracture mechanics (FM), high-performance computing methodology, and code ProbFM routinely assess relevant modes of operation for a component by performing billions of individual FM simulations. The methodology can be used for new design and life optimization of components, as well as for the risk of failure RoF quantification of in service rotors and their requalifications in conjunction with nondestructive examination techniques, such as ultrasonic testing (UT). The developed probabilistic scheme integrates material data, UT information, duty-cycle data, and finite element analysis (FEA) in order to determine the RoF. The methodology provides an integrative and robust measure of the fitness for service and allows for a save and reliable operation management of heavy-duty rotating equipment.


Author(s):  
Alan Gray ◽  
Kevin Stratford

Leading high performance computing systems achieve their status through use of highly parallel devices such as NVIDIA graphics processing units or Intel Xeon Phi many-core CPUs. The concept of performance portability across such architectures, as well as traditional CPUs, is vital for the application programmer. In this paper we describe targetDP, a lightweight abstraction layer which allows grid-based applications to target data parallel hardware in a platform agnostic manner. We demonstrate the effectiveness of our pragmatic approach by presenting performance results for a complex fluid application (with which the model was co-designed), plus separate lattice quantum chromodynamics particle physics code. For each application, a single source code base is seen to achieve portable performance, as assessed within the context of the Roofline model. TargetDP can be combined with Message Passing Interface (MPI) to allow use on systems containing multiple nodes: we demonstrate this through provision of scaling results on traditional and graphics processing unit-accelerated large scale supercomputers.


Author(s):  
Indar Sugiarto ◽  
Doddy Prayogo ◽  
Henry Palit ◽  
Felix Pasila ◽  
Resmana Lim ◽  
...  

This paper describes a prototype of a computing platform dedicated to artificial intelligence explorations. The platform, dubbed as PakCarik, is essentially a high throughput computing platform with GPU (graphics processing units) acceleration. PakCarik is an Indonesian acronym for Platform Komputasi Cerdas Ramah Industri Kreatif, which can be translated as “Creative Industry friendly Intelligence Computing Platform”. This platform aims to provide complete development and production environment for AI-based projects, especially to those that rely on machine learning and multiobjective optimization paradigms. The method for constructing PakCarik was based on a computer hardware assembling technique that uses commercial off-the-shelf hardware and was tested on several AI-related application scenarios. The testing methods in this experiment include: high-performance lapack (HPL) benchmarking, message passing interface (MPI) benchmarking, and TensorFlow (TF) benchmarking. From the experiment, the authors can observe that PakCarik's performance is quite similar to the commonly used cloud computing services such as Google Compute Engine and Amazon EC2, even though falls a bit behind the dedicated AI platform such as Nvidia DGX-1 used in the benchmarking experiment. Its maximum computing performance was measured at 326 Gflops. The authors conclude that PakCarik is ready to be deployed in real-world applications and it can be made even more powerful by adding more GPU cards in it.


Author(s):  
H. R. Millwater ◽  
Y.-T. Wu ◽  
J. W. Cardinal ◽  
G. G. Chell

This paper describes the application of an advanced probabilistic fracture mechanics computational algorithm with inspection simulation to the probabilistic life assessment of a turbine blade attachment, sometimes referred to as a steeple or fir tree. The life of the steeple is limited by high cycle fatigue. The methodology utilized combines structural finite element analysis, stochastic fatigue crack growth, and crack inspection and repair. The resulting information provides the engineer with an assessment of the probability of failure of the structure as a function of operating time and the effect of the inspection procedure. This information can form the basis of inspection planning and retirement-for-cause decisions.


Author(s):  
Scott Keller

The failure of vital components is not uncommon in the gas turbine industry. In the event excessive degradation occurs within a component, e.g. extensive cracking in a turbine blade or vane, solutions exist to either repair or replace defective parts. Such parts are readily accessible and mostly exchangeable in the field, limiting the amount of outage time and assessment required for defective parts. When more critical components exhibit extreme wear or cracking, e.g. a crack in a rotor disk, repairs typically necessitate a complete rotor destack and refurbishment or have the potential to require the replacement of individual disks. In extreme cases, defects found in rotor disks can be known to retire an entire compressor or turbine rotor. The OEM solution of replacing disks puts a substantial cost on the customer, thus providing an incentive for characterization and advanced analyses to determine the residual life in critical rotating components. Considered an advanced analysis, linear elastic fracture mechanics (LEFM) provides the theory and fundamental structure to conduct crack growth analyses in components that exhibit nominally elastic behavior. Successful implementation of LEFM requires extensive characterization of the material, engine operating boundary conditions, and high fidelity finite element models. Upon the detection of a flaw, whether an internal or external indication, the results from finite element analyses can be used to derive the crack tip stress field and subsequent crack tip driving parameters. These parameters are then utilized in a comprehensive crack propagation model, calibrated to temperature- and load-dependent material data, to determine the number of cycles to unstable propagation. As a result, the remaining life of a component with a given indication is readily obtained, enabling our engineering team to provide a thorough life assessment of critical rotating components. An overview of the linear elastic fracture mechanics crack growth analyses conducted is presented, with a special emphasis on compressor and turbine disks.


Author(s):  
Héctor Martínez ◽  
Sergio Barrachina ◽  
Maribel Castillo ◽  
Joaquín Tárraga ◽  
Ignacio Medina ◽  
...  

The advances in genomic sequencing during the past few years have motivated the development of fast and reliable software for DNA/RNA sequencing on current high performance architectures. Most of these efforts target multicore processors, only a few can also exploit graphics processing units, and a much smaller set will run in clusters equipped with any of these multi-threaded architecture technologies. Furthermore, the examples that can be used on clusters today are all strongly coupled with a particular aligner. In this paper we introduce an alignment framework that can be leveraged to coordinately run any “single-node” aligner, taking advantage of the resources of a cluster without having to modify any portion of the original software. The key to our transparent migration lies in hiding the complexity associated with the multi-node execution (such as coordinating the processes running in the cluster nodes) inside the generic-aligner framework. Moreover, following the design and operation in our Message Passing Interface (MPI) version of HPG Aligner RNA BWT, we organize the framework into two stages in order to be able to execute different aligners in each one of them. With this configuration, for example, the first stage can ideally apply a fast aligner to accelerate the process, while the second one can be tuned to act as a refinement stage that further improves the global alignment process with little cost.


Author(s):  
Christian Amann ◽  
Kai Kadau ◽  
Peter Gumbsch

In the development of heavy duty gas turbines for the energy sector oftentimes the majority of the design work is performed in either the 50 Hz or the 60 Hz size. Many aspects of the designed engine for one market (50 Hz as an example) can then be used to design with significantly reduced effort for the other market (i.e. 60 Hz). For example, most dimensions of rotor components can be geometrically scaled such that centrifugal forces in those massively rotating parts are conserved. This article investigates the transferability of probabilistic fracture mechanics results from one market to the other one. Or in other words: can we perform probabilistic fracture mechanics for the 50 Hz rotor design and deduce the risk of failure for the scaled 60 Hz design (or vice versa)? Multiple challenges must be considered in the transferability including the different volume, surface to volume ratio, as well as the different transient behavior for the smaller 60 Hz design. We address that challenge by building up complexity for a generic rotor design in order to separate the different effects and associated design features. We then discuss several Siemens rotor designs with respect to transferability of probabilistic fracture mechanics results and correlations to deterministic approaches. This work enables the creation of design rules to avoid unnecessary work for scaled 50 Hz/60 Hz market engines and therefore supports the reduction of product development costs.


Sign in / Sign up

Export Citation Format

Share Document