DevOps Agent - Argonaut
Argonaut is an LLM-powered CLI and conversational AI tool that automatically diagnoses, recommends, and helps fix issues with ArgoCD applications — directly within a Slack channel or command line interface or Google chat.
It acts as an intelligent assistant that listens to alerts and user queries, recommends the exact commands needed to debug or resolve an ArgoCD issue, executes them (automatically or on demand), analyzes the output, and continues the diagnostic loop — all within a single Slack thread/ Google chat thread, maintaining full context throughout the conversation. Argonaut bridges the gap between ArgoCD alerts firing and engineers knowing exactly what to do next — replacing guesswork with AI-guided, executable recommendations.
Why Use Argonaut in OpsMx?
ArgoCD is the backbone of GitOps-based continuous delivery for Kubernetes applications. However, when ArgoCD applications fail — sync errors, OutOfSync states, health degradations, missing resources — engineers must manually investigate across multiple tools (kubectl, argocd CLI, git, Prometheus, Elasticsearch) to diagnose and fix the issue.
Argonaut solves this by:
Eliminating manual triage — no more guessing which ArgoCD or kubectl command to run
Closing the loop between alerts and action — Prometheus, ArgoCD, and Elasticsearch can post directly into Slack; Argonaut picks up those alerts and immediately recommends corrective commands
Keeping context across the conversation — the entire diagnostic session lives in a Slack thread, so every recommendation is informed by what came before
Enabling LLM-guided GitOps fixes — Argonaut can suggest changes to files in Git, create a PR, and merge it via shortcuts — all from Slack
Supporting both manual and automated remediation — teams can run with human approval or enable AUTO-RUN once confident
Scope for the Users
1. Instant, Actionable Diagnostics
Engineers no longer need to know every ArgoCD or kubectl command by heart. Argonaut recommends the exact command — including complex compound commands using && and | — to diagnose and fix the issue immediately.
2. Reduced Mean Time to Resolution (MTTR)
The continuous loop of recommend → execute → analyze → recommend shortens the time from "alert fired" to "issue resolved," reducing the manual investigation cycle that typically spans multiple tools and terminal sessions.
3. Context-Aware Assistance
Because the entire session lives in a Slack thread, every subsequent recommendation is informed by the full conversation history — Argonaut doesn't ask you to repeat yourself or lose track of what has already been tried.
4. Works Where Engineers Already Are
No new portal to log into. Argonaut operates directly in Slack — where alerts already arrive and where engineering teams already collaborate — making adoption frictionless.
5. GitOps Fix Automation
With GIT-FIX, GIT-PR, and GIT-MERGE, engineers can make configuration corrections, raise a PR, and merge it — all from within the same Slack thread — without switching to a code editor or GitHub UI.
6. Flexible Automation Level
Teams can start with full human approval (manual RUN) and graduate to AUTO-RUN as confidence in Argonaut's recommendations grows — giving organizations control over how much autonomy they grant the tool.
7. Full Alert-to-Fix Traceability
With ArgoCD, Prometheus, and Elasticsearch all posting into Slack, Argonaut creates a single, traceable thread from alert fired → diagnosis → commands run → fix applied — a built-in audit trail for every incident.
How It Works — The Argonaut Flow
Key conversation rules:
All Argonaut responses begin with
NAUT— to clearly identify AI responsesAll tool execution outputs begin with
TOOLArgonaut never responds to messages starting with
NAUT— preventing infinite response loopsUsers can bypass Argonaut and talk to each other by prefixing messages with
NAUTUsers can run their own commands via:
RUN argocd <your command>
Integrations
Built-in CLI Tools
Argonaut has native access to the following executables out of the box:
argocd
ArgoCD application management and diagnostics
kubectl
Kubernetes resource inspection and management
git
Repository operations
gh
GitHub CLI for PR creation and management
yq
YAML processing and editing
jq
JSON parsing and filtering
bash
Standard shell commands and compound operations
User Interface
Slack — primary interface for all interactions, alert ingestion, command recommendations, and output display
Email — notification delivery
LLM Interface
OpenAI GPT-4o and GPT-4.1 via OpenAI APIs — powers all command recommendation and output analysis
Conversation Database
File-based — lightweight, local conversation history
Elasticsearch — scalable conversation storage for larger deployments
Alert Sources (Slack-integrated)
ArgoCD — application sync and health alerts
Prometheus — infrastructure and application metric alerts
Elasticsearch — log-based alerts
Key Features & Commands
AUTO-RUN
Automatically executes Argonaut's recommended commands without requiring the user to type RUN — ideal once the team has gained confidence in Argonaut's recommendations
RUN
Manual trigger to execute the command recommended by Argonaut in the Slack thread
GIT-FIX
Shortcut to make changes to files in a Git repository
GIT-PR
Shortcut to create a pull request from the current changes
GIT-MERGE
Shortcut to merge an open pull request
SUMMARIZE
Summarizes the entire diagnostic conversation in the current Slack thread
HELP
Lists all available keywords, shortcuts, and concepts
Limitations
Configuration and performance of ArgoCD itself — server setup, ArgoCD controller tuning, infrastructure-level concerns cannot be solved by Argonaut.
Last updated