Adaptive and Efficient Log Parsing as a Cloud Service
By: Zeyan Li , Jie Song , Tieying Zhang and more
Potential Business Impact:
Cleans up computer messages 840% faster.
Logs are a critical data source for cloud systems, enabling advanced features like monitoring, alerting, and root cause analysis. However, the massive scale and diverse formats of unstructured logs pose challenges for adaptable, efficient, and accurate parsing methods. This paper introduces ByteBrain-LogParser, an innovative log parsing framework designed specifically for cloud environments. ByteBrain-LogParser employs a hierarchical clustering algorithm to allow real-time precision adjustments, coupled with optimizations such as positional similarity distance, deduplication, and hash encoding to enhance performance. Experiments on large-scale datasets show that it processes 229,000 logs per second on average, achieving an 840% speedup over the fastest baseline while maintaining accuracy comparable to state-of-the-art methods. Real-world evaluations further validate its efficiency and adaptability, demonstrating its potential as a robust cloud-based log parsing solution.
Similar Papers
A Word is Worth 4-bit: Efficient Log Parsing with Binary Coded Decimal Recognition
Computation and Language
Finds hidden computer problems by reading code details.
System Log Parsing with Large Language Models: A Review
Machine Learning (CS)
Helps computers understand computer error messages better.
InferLog: Accelerating LLM Inference for Online Log Parsing via ICL-oriented Prefix Caching
Software Engineering
Makes computer logs understandable much faster.