DSpace
 

University of Jos Institutional Repository >
Natural Sciences >
Computer Science >

Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/2819

Title: Identifying Recovery Patterns from Resource Usage Data of Cluster Systems
Authors: Gurumdimma, Nentawe
Bibu, Gideon Dadik
Bisandu, Desmond Bala
Alams, Mammuan Titus
Keywords: Change point detection
recovery sequence
detection
large-scale HPC systems
Issue Date: 2018
Publisher: Science World Journal
Series/Report no.: Vol.13;No.4; Pp 87-94
Abstract: Failure of Cluster Systems has proven to be of adverse effect and it can be costly. System administrators have employed divide and conquer approach to diagnosing the root-cause of such failure in order to take corrective or preventive measures. Most times, event logs are the source of the information about the failures. Events that characterized failures are then noted and categorized as causes of failure. However, not all the ’causative’ events lead to eventual failure, as some faults sequence experience recovery. Such sequences or patterns constitute challenge to system administrators and failure prediction tools as they add to false positives. Their presence are always predicted as “failure causing“, while in reality, they will not. In order to detect such recovery patterns of events from failure patterns, we proposed a novel approach that utilizes resource usage data of cluster systems to identify recovery and failure sequences. We further propose an online detection approach to the same problem. We experiment our approach on data from Ranger Supercomputer System and the results are positive.
URI: http://hdl.handle.net/123456789/2819
ISSN: 1597-6343
Appears in Collections:Computer Science

Files in This Item:

File Description SizeFormat
18907-Article Text-76145-1-10-20181230.pdf404.82 kBAdobe PDFView/Open
View Statistics

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback