Experimental Analysis of Chiller Cooling Failure in a Small Size Data Center Environment Using Wireless Instrumentation

Author(s):  
Mohammad Tradat ◽  
Bahgat Sammakia ◽  
Husam Alissa ◽  
Kourosh Nemati

Given the vital rule of data center availability and since the inlet temperature of the IT equipment increase rapidly until reaching a certain threshold value after which IT starts throttling or shut down because of overheat during cooling system failure. Hence, it is especially important to understand failures and their effects. This study presented experimental investigation and analysis of a facility-level cooling system failure scenario in which chilled water interruption introduced to the data center. Quantitative instrumentation tools including wireless technology such as wireless temperature and pressure sensors were used to measure the discrete air inlet temperature and pressure differential though cold aisle enclosure, respectively. In addition, Intelligent Platform Management Interface (IPMI) and cooling system data during failure/recovery were reported. Furthermore, the IT equipment performance and response for opened and contained environments were simulated and compared. Finally, an experiment based analysis of the Ride Through Time (RTT) of servers during chilled water interruption of the cooling infrastructure presented as well. The results showed that for all three classes of servers tested during the cooling failure, CAC helped keep the server’s cooler for longer. The containment provided a barrier between the hot and cold air streams and caused slight negative pressure to build up, which allowed the servers to pull cold air from the underfloor plenum. In addition, the results show that the effect of CAC in containment solutions on the IT equipment performance and response could vary and depend on the server’s airflow, generation and hence types of servers deployed in cold aisle enclosure. Moreover, it was shown that when compared to the discrete sensors, the IPMI inlet temperature sensors underestimate the Ride Through Time (RTT) by 42% and 12% for the CAC and opened cases, respectively.

Energies ◽  
2019 ◽  
Vol 12 (15) ◽  
pp. 2996 ◽  
Author(s):  
Jinkyun Cho ◽  
Beungyong Park ◽  
Yongdae Jeong

If a data center experiences a system outage or fault conditions, it becomes difficult to provide a stable and continuous information technology (IT) service. Therefore, it is critical to design and implement a backup system so that stability can be maintained even in emergency (unforeseen) situations. In this study, an actual 20 MW data center project was analyzed to evaluate the thermal performance of an IT server room during a cooling system outage under six fault conditions. In addition, a method of organizing and systematically managing operational stability and energy efficiency verification was identified for data center construction in accordance with the commissioning process. Up to a chilled water supply temperature of 17 °C and a computer room air handling unit air supply temperature of 24 °C, the temperature of the air flowing into the IT server room fell into the allowable range specified by the American Society of Heating, Refrigerating, and Air-Conditioning Engineers standard (18–27 °C). It was possible to perform allowable operations for approximately 320 s after cooling system outage. Starting at a chilled water supply temperature of 18 °C and an air supply temperature of 25 °C, a rapid temperature increase occurred, which is a serious cause of IT equipment failure. Due to the use of cold aisle containment and designs with relatively high chilled water and air supply temperatures, there is a high possibility that a rapid temperature increase inside an IT server room will occur during a cooling system outage. Thus, the backup system must be activated within 300 s. It is essential to understand the operational characteristics of data centers and design optimal cooling systems to ensure the reliability of high-density data centers. In particular, it is necessary to consider these physical results and to perform an integrated review of the time required for emergency cooling equipment to operate as well as the backup system availability time.


Author(s):  
Tianyi Gao ◽  
James Geer ◽  
Russell Tipton ◽  
Bruce Murray ◽  
Bahgat G. Sammakia ◽  
...  

The heat dissipated by high performance IT equipment such as servers and switches in data centers is increasing rapidly, which makes the thermal management even more challenging. IT equipment is typically designed to operate at a rack inlet air temperature ranging between 10 °C and 35 °C. The newest published environmental standards for operating IT equipment proposed by ASHARE specify a long term recommended dry bulb IT air inlet temperature range as 18°C to 27°C. In terms of the short term specification, the largest allowable inlet temperature range to operate at is between 5°C and 45°C. Failure in maintaining these specifications will lead to significantly detrimental impacts to the performance and reliability of these electronic devices. Thus, understanding the cooling system is of paramount importance for the design and operation of data centers. In this paper, a hybrid cooling system is numerically modeled and investigated. The numerical modeling is conducted using a commercial computational fluid dynamics (CFD) code. The hybrid cooling strategy is specified by mounting the in row cooling units between the server racks to assist the raised floor air cooling. The effect of several input variables, including rack heat load and heat density, rack air flow rate, in row cooling unit operating cooling fluid flow rate and temperature, in row coil effectiveness, centralized cooling unit supply air flow rate, non-uniformity in rack heat load, and raised floor height are studied parametrically. Their detailed effects on the rack inlet air temperatures and the in row cooler performance are presented. The modeling results and corresponding analyses are used to develop general installation and operation guidance for the in row cooler strategy of a data center.


Energies ◽  
2021 ◽  
Vol 14 (12) ◽  
pp. 3575
Author(s):  
Yu-Jin Kim ◽  
Ju-Wan Ha ◽  
Kyung-Soon Park ◽  
Young-Hak Song

The degree of integration of IT devices and consumption of cooling energy are consistently increasing owing to developments in the data center industry. Hence, to ensure the smooth operation and fault prevention of IT devices, the energy consumption of cooling systems has increased, leading to active research on improvements in cooling system performance for reducing energy consumption. This study examines the reduction in cooling energy consumption using a simulation by applying chilled water control and a water-side economizer (WSE) system to enhance the cooling system efficiency. The simulation results showed that the energy consumption was reduced by 1.8% when the chilled water temperature was set to 11 °C in a conventional system and by up to 19.6% when WSE was also applied. Furthermore, when the changes in chilled water temperature were applied for efficient operation of WSE, the energy consumption was reduced by up to 30.1% compared to that in conventional energy systems.


Author(s):  
Betsegaw Gebrehiwot ◽  
Nikhil Dhiman ◽  
Kasturi Rajagopalan ◽  
Dereje Agonafer ◽  
Naveen Kannan ◽  
...  

An information technology (IT) container needs to be supplied with cold air to cool IT equipment housed in it. The type of cooling system to be used depends on many factors including geographical location of the modular data center. Data centers located in regions where the climate is cold benefit from use of air-side economization (ASE) and those located in hot and dry climate benefit from use of direct and/or indirect evaporative cooling (DIEC) systems. In terms of energy saving, ASE, direct evaporative cooling (DEC) system, and indirect evaporative (IEC) systems are better than compressor based cooling systems such as computer room air conditioning (CRAC) units and air handling units (AHU). In this study, an existing DIEC unit which can also be operated in ASE mode is modeled in a computational fluid dynamics (CFD) tool. The cooling unit is intended to be used for supplying cold air to a containerized data center with specified volume flow rate, dry-bulb temperature and relative humidity. The CFD model is compared with published data of the cooling unit to see how well the CFD model represents the actual system and few design improvement ideas are tested by modifying the CFD model and running simulations. Results show that supplying air horizontally or as a downdraft to an IT container has negligible effect on the overall system. Results also show that orientation of dampers and placement of blanking panels inside the mixing chamber could affect the lifespan of air filters.


Author(s):  
Tianyi Gao ◽  
Marcelo del Valle ◽  
Alfonso Ortega ◽  
Bahgat G. Sammakia

The cross flow heat exchanger is at the heart of most cooling systems for data centers. Air/Water or air/refrigerant heat exchangers are the principal component in Central Room Air Conditioning (CRAC) units that condition data room air that is delivered through an underfloor plenum. Liquid/air heat exchangers are also increasingly deployed in close-coupled cooling systems such as rear door heat exchangers, in-row coolers, and overhead coolers. In all cases, the performance of liquid/air heat exchangers in both steady state and transient scenarios are of principal concern. Transient scenarios occur either by the accidental failure of the cooling system or by intentional dynamic control of the cooling system. In either scenario, transient boundary conditions involve time-dependent air or liquid inlet temperatures and mass flow rates that may be coupled in any number of potential combinations. Understanding and characterizing the performance of the heat exchanger in these transient scenarios is of paramount importance for designing better thermal solutions and improving the operational efficiency of existing cooling systems. In this paper, the transient performance of water to air cross flow heat exchangers is studied using numerical modeling and experimental measurements. Experimental measurements in 12 in. × 12 in. heat exchanger cores were performed, in which the liquid (water) mass flow rate or inlet temperature are varied in time following controlled functional forms (step jump, ramp). The experimental data were used to validate a transient numerical model developed with traditional assumptions of space averaging of heat transfer coefficients, and volume averaging of thermal capacitances. The complete numerical model was combined with the transient effectiveness methodology in which the traditional heat exchanger effectiveness approach is extended into a transient domain, and is then used to model the heat exchanger transient response. Different transient scenarios were parametrically studied to develop an understanding of the impact of critical variables such as, the fluid inlet temperature variation and the fluid mass flow rate variation, and a more comprehensive understanding of the characteristics of the transient effectiveness. Agreement between the novel transient effectiveness modeling approach and the experimental measurements enable use of the models as verified predictive design tools. Several studies are designed based on the practical problems related to data center thermal environments and the results are analyzed.


Author(s):  
Uschas Chowdhury ◽  
Walter Hendrix ◽  
Thomas Craft ◽  
Willis James ◽  
Ankit Sutaria ◽  
...  

Abstract In a data center, electronic equipment such as server and switches dissipate heat and the corresponding cooling systems contribute to typically 25–35% of total energy consumption. The heat load continues to increase as there is a greater need for miniaturization and convergence. In 2014, data centers in the U.S. consumed an estimated 70 billion kWh, representing about 1.8% of total U.S. electricity consumption. Based on current trend estimates, U.S. data centers are projected to consume approximately 73 billion kWh in 2020 [1]. Many research and strategies are adopted to minimize energy cost. The recommended dry bulb temperature for long-term operation and reliability for air cooling is between 18–27°C and the largest allowable inlet temperature range to operate at is between 5°C and 45°C with American Society of Heating, Refrigeration, and Air-Conditioning Engineers (ASHRAE) enabling much broader allowable zones) [2]. But understanding a proper cooling system is very important especially for thermal management of IT equipment with high heat loads such as 1U or 2U multi-core, high-end servers and blade servers which provide more computing per watt. Many problems like high inlet temperature due to the mixing of hot air with cold air, local hot spots, lower system reliability, increased failure, and downtime may occur. Among many other approaches to managing high-density racks, in-row coolers are used in between racks to provide cold air and minimize local hot spots. This paper describes a computational study being performed by applying in-row coolers for different rack power configuration with and without aisle containment. The power, as well as the number of racks, are varied to study the effect of raised inlet temperature for the IT equipment in a Computational Fluid Dynamics (CFD) model developed in 6SigmaRoom with the help of built-in library items. A comparative analysis is also performed for a typical small-sized non-raised facility to investigate the efficacy and limitations of in-row coolers in thermal management of IT equipment with variation in rack heat load and containment. Several other aspects like a parametric study of variable opening areas of duct between racks and in-row coolers, the variation of operating flow rate and failure scenarios are also studied to find proper flow distribution, uniformity of outlet temperature and predict better performance, energy savings and reliability. The results are presented for general guidance for flexible and quick installation and safe operation of in-row coolers to improve thermal efficiency.


Author(s):  
Sami Alkharabsheh ◽  
Bharath Ramakrishnan ◽  
Bahgat Sammakia

In this paper, the impact of direct liquid cooling (DLC) system failure on the IT equipment is studied experimentally. The main factors that are anticipated to affect the IT equipment response during failure are the CPU utilization, coolant set point temperature (SPT) and the server type. These factors are varied experimentally and the IT equipment response is studied in terms of chip temperature and power, CPU utilization and total server power. It was found that failure of the cooling system is hazardous and can lead to data center shutdown in less than a minute. Additionally, the CPU frequency throttling mechanism was found to be vital to understand the change in chip temperature, power, and utilization. Other mechanisms associated with high temperatures were also observed such as the leakage power and the fans speed change. Finally, possible remedies are proposed to reduce the probability and the consequences of the cooling system failure.


Sign in / Sign up

Export Citation Format

Share Document