Speeding up incident response with DagKnows (Part 2)
Вставка
- Опубліковано 9 жов 2024
- In this video, I demonstrate how to create a basic diagnostic workflow using DagKnows AI and trigger it upon receiving a Grafana alert. Here’s what’s covered:
- Setting up a webhook in Grafana
- Introducing an error in the demo app to fire an alert
- Interact with AI agent to deal with the alert
Create or fetch a Jira ticket
Get k8s cluster and pod statuses and dump in the ticket
Get logs from ELK and add to the ticket
Conditionally restart pods and add the info to the ticket
- Create a reusable workflow from the AI agent interaction
- Re-execute the workflow automatically on the next alert.