Speeding up incident response with DagKnows (Part 2)

Поділитися
Вставка
  • Опубліковано 9 жов 2024
  • In this video, I demonstrate how to create a basic diagnostic workflow using DagKnows AI and trigger it upon receiving a Grafana alert. Here’s what’s covered:
    - Setting up a webhook in Grafana
    - Introducing an error in the demo app to fire an alert
    - Interact with AI agent to deal with the alert
    Create or fetch a Jira ticket
    Get k8s cluster and pod statuses and dump in the ticket
    Get logs from ELK and add to the ticket
    Conditionally restart pods and add the info to the ticket
    - Create a reusable workflow from the AI agent interaction
    - Re-execute the workflow automatically on the next alert.

КОМЕНТАРІ •