Packaging the Blue Gene/L supercomputer

2005 ◽  
Vol 49 (2.3) ◽  
pp. 213-248 ◽  
Author(s):  
P. Coteus ◽  
H. R. Bickford ◽  
T. M. Cipolla ◽  
P. G. Crumley ◽  
A. Gara ◽  
...  
Keyword(s):  
2020 ◽  
Author(s):  
Bo Zhang ◽  
Hongyu Zhang ◽  
Pablo Moscato

<div>Complex software intensive systems, especially distributed systems, generate logs for troubleshooting. The logs are text messages recording system events, which can help engineers determine the system's runtime status. This paper proposes a novel approach named ADR (stands for Anomaly Detection by workflow Relations) that employs matrix nullspace to mine numerical relations from log data. The mined relations can be used for both offline and online anomaly detection and facilitate fault diagnosis. We have evaluated ADR on log data collected from two distributed systems, HDFS (Hadoop Distributed File System) and BGL (IBM Blue Gene/L supercomputers system). ADR successfully mined 87 and 669 numerical relations from the logs and used them to detect anomalies with high precision and recall. For online anomaly detection, ADR employs PSO (Particle Swarm Optimization) to find the optimal sliding windows' size and achieves fast anomaly detection.</div><div>The experimental results confirm that ADR is effective for both offline and online anomaly detection. </div>


Author(s):  
Jan Stoess ◽  
Udo Steinberg ◽  
Volkmar Uhlig ◽  
Jens Kehne ◽  
Jonathan Appavoo ◽  
...  

2008 ◽  
Author(s):  
Sudip Seal ◽  
Michael Moody ◽  
Anna Ceguerra ◽  
Simon Ringer ◽  
Krishna Rajan ◽  
...  

2016 ◽  
Vol 43 (3S) ◽  
pp. 144-157 ◽  
Author(s):  
Takuya Nakaike ◽  
Rei Odaira ◽  
Matthew Gaudet ◽  
Maged M. Michael ◽  
Hisanobu Tomari

2013 ◽  
Vol 57 (1/2) ◽  
pp. 2:1-2:13 ◽  
Author(s):  
P. W. Coteus ◽  
S. A. Hall ◽  
T. Takken ◽  
R. A. Rand ◽  
S. Tian ◽  
...  
Keyword(s):  

Author(s):  
John W. Romein ◽  
Jan David Mol ◽  
Rob V. van Nieuwpoort ◽  
P. Chris Broekema
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document