Agentic OpsBot for 24/7 Production-Support Ticket Resolution
Keywords:
OpsBot, LLMs, telemetry, incident triage, log analysis, autonomous remediation, ticket resolution, after-hours supportAbstract
The objective of this paper is to automate production-support system Agentic OpsBot handles corporate IT help questions using LLMs, system telemetry, and log analytics which describes its performance. Using scripts and contextual awareness, OpsBot solves 60% of after-hours production difficulties which includes problem identification, solution, and implementation are emphasised.
Downloads
References
A. Gulenko, M. Wallschlager, and C. Momm, "Automated IT incident ticket classification using supervised learning," 2014 IEEE International Conference on Big Data (Big Data), Washington, DC, 2014, pp. 312-317.
T. Nguyen, A. S. Namin, and J. A. Perez, "Deep learning for log data mining: An application to anomaly detection," 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 2018, pp. 5021–5023.
S. E. Coull, J. Branch, B. Szymanski, and E. Breimer, "Intrusion detection: A bioinformatics approach," IEEE Transactions on Systems, Man, and Cybernetics, Part C, vol. 39, no. 3, pp. 263–275, May 2009.
J. Oliner, A. Aiken, and J. Stearley, "Alert detection in system logs," Proceedings of the 2008 IEEE International Conference on Data Mining, Pisa, 2008, pp. 959–964.
D. Lo, L. Mariani, and M. Pezzè, "Automatic Steering of Behavioral Model Inference," Automated Software Engineering, vol. 16, no. 3-4, pp. 273–304, Sep. 2009.
A. Bagnall, P. Lines, A. Bostrom, J. Large, and E. Keogh, "The Great Time Series Classification Bake Off: A Review and Experimental Evaluation of Recent Algorithmic Advances," Data Mining and Knowledge Discovery, vol. 31, no. 3, pp. 606–660, May 2017.
A. M. Turing, "Computing Machinery and Intelligence," Mind, vol. LIX, no. 236, pp. 433–460, Oct. 1950.
J. Dean et al., "Large scale distributed deep networks," Advances in Neural Information Processing Systems, vol. 25, pp. 1223–1231, 2012.
A. Vaswani et al., "Attention Is All You Need," Advances in Neural Information Processing Systems, vol. 30, pp. 5998–6008, 2017.
D. Sculley et al., "Hidden technical debt in machine learning systems," Advances in Neural Information Processing Systems, vol. 28, pp. 2503–2511, 2015.
B. Chandramouli et al., "Trill: A High-Performance Incremental Query Processor for Diverse Analytics," Proceedings of the VLDB Endowment, vol. 8, no. 4, pp. 401–412, Dec. 2014.
A. Eckert et al., "LogClass: An unsupervised algorithm for log message classification," IEEE Transactions on Dependable and Secure Computing, vol. 15, no. 4, pp. 532–545, July-Aug. 2018.
P. Bodik, M. Goldszmidt, A. Fox, D. B. Woodard, and H. Andersen, "Fingerprinting the datacenter: Automated classification of performance crises," Proceedings of the 5th European Conference on Computer Systems (EuroSys), Paris, 2010, pp. 111–124.
M. K. Sharma, N. Lathwal, A. Verma, and A. Sharma, "Intelligent Ticketing System using NLP and Classification Techniques," 2018 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), Madurai, India, 2018, pp. 1–5.
A. Adhikari, C. Chen, and S. Das, "LogLens: A Real-time Log Analysis System," Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 2019, pp. 2791–2799.
D. Breunig, A. Maedche, and A. Morana, "Designing Autonomous Agents for Digital Work: A Review and Research Agenda," Proceedings of the 40th International Conference on Information Systems (ICIS), Munich, Germany, 2019.
M. Du, F. Li, G. Zheng, and V. Srikumar, "DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning," Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, Dallas, TX, 2017, pp. 1285–1298.
A. Marcus, D. A. Marco, and D. P. Sbaraglia, Enterprise Systems Backup and Recovery: A Corporate Insurance Policy, Wiley, 2003.
R. R. Harmon, N. Raffo, and S. G. Faulk, "Standards and interoperability for automation of enterprise IT services," 2008 IEEE International Conference on Industrial Engineering and Engineering Management, Singapore, 2008, pp. 989–993.
C. He, J. Zhu, P. He, and M. R. Lyu, "Loghub: A large collection of system log datasets towards automated log analytics," 2018 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW), Memphis, TN, 2018, pp. 265–269.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.