Logs have been widely employed to ensure the reliability of distributed systems, because logs are often the only data available that records system runtime information. Compared with logs generated by traditional standalone systems, distributed system logs are often large-scale and of great complexity, invalidating many existing log management methods. To address this problem, the paper describes and envisions an end-to-end log management framework for distributed systems. Specifically, this framework includes strategic logging placement, log collection, log parsing, interleaved logs mining, anomaly detection, and problem identification.
Xiaoying BaiWei‐Tek TsaiRobin PaulTecheng ShenBing Li
Y. K. KimSung-Hwan LeeYonghyun KimChung-Kil Hur
Denise J. EcklundVera GoebelThomas PlagemannEarl F. Ecklund
Denise J. EcklundVera GoebelThomas PlagemannEarl F. Ecklund