Paper Abstract and Keywords |
Presentation |
2022-05-20 13:50
Proposal for a Method of Estimating IT System Failure Locations Using Alert Scoring Reiko Kondo, Kazutaka Ogihara, Takashi Shiraishi (FUJITSU) ICM2022-8 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In a system where services (microservices) consisting of a combination of many applications are deployed on multiple virtual and physical machines, each device works together, so when a failure occurs, alerts are generated from a wide range of devices, making it difficult to identify the location of the failure. In addition, the alert thresholds are set to identify the failure location, but if the settings are inappropriate, the equipment cannot be identified as the failure location. Also, because of the large number of devices, multiple different failures can occur in a short period of time, and it is possible that multiple alerts generated from different failure locations may be investigated as the same failure.
Therefore, we propose a failure location estimation method that evaluates possibility of a failure location from the alerts of each device. The failure location estimation method has the following three features. By integrating and analyzing the dependencies between applications and infrastructure, the failure location can be estimated from alerts for a wide range of devices from applications to infrastructures. In addition, by reflecting the propagation of alerts between devices in the score, even devices that do not raise alerts due to a threshold setting error, etc., can be estimated as fault locations. Furthermore, by grouping related alerts based on configuration information and dependencies between devices, it is expected to be applied to the classification of multiple failures. By using the proposed technology, operators can investigate the equipment with the highest score first, which is expected to shorten the time required for fault recovery. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Container / Operation / System configuration / Scoring / Multiple failures / Docker / Kubernetes / Istio |
Reference Info. |
IEICE Tech. Rep., vol. 122, no. 32, ICM2022-8, pp. 36-41, May 2022. |
Paper # |
ICM2022-8 |
Date of Issue |
2022-05-12 (ICM) |
ISSN |
Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
ICM2022-8 |
Conference Information |
Committee |
ICM IPSJ-CSEC IPSJ-IOT |
Conference Date |
2022-05-19 - 2022-05-20 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
|
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
ICM |
Conference Code |
2022-05-ICM-CSEC-IOT |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Proposal for a Method of Estimating IT System Failure Locations Using Alert Scoring |
Sub Title (in English) |
|
Keyword(1) |
Container |
Keyword(2) |
Operation |
Keyword(3) |
System configuration |
Keyword(4) |
Scoring |
Keyword(5) |
Multiple failures |
Keyword(6) |
Docker |
Keyword(7) |
Kubernetes |
Keyword(8) |
Istio |
1st Author's Name |
Reiko Kondo |
1st Author's Affiliation |
FUJITSU LIMITED (FUJITSU) |
2nd Author's Name |
Kazutaka Ogihara |
2nd Author's Affiliation |
FUJITSU LIMITED (FUJITSU) |
3rd Author's Name |
Takashi Shiraishi |
3rd Author's Affiliation |
FUJITSU LIMITED (FUJITSU) |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2022-05-20 13:50:00 |
Presentation Time |
25 minutes |
Registration for |
ICM |
Paper # |
ICM2022-8 |
Volume (vol) |
vol.122 |
Number (no) |
no.32 |
Page |
pp.36-41 |
#Pages |
6 |
Date of Issue |
2022-05-12 (ICM) |
|