Intelligent agents that reason, evaluate, and adapt.
Multi-agent orchestration with defense-in-depth security.
ReAct reasoning, cost-aware model routing, self-evaluation, A2A interop, WASM-sandboxed plugins, and enterprise compliance — all in Rust.
18 utility skills (calculator, JSON query, web search, prompt guard, secret scanner, diff…) plus a guardrails pipeline that blocks PII and prompt injection in real time.
16 real tool executions + 4 guardrail tests — zero API keys, zero mocks.
An Argentor agent running an automated 8-step DevOps pipeline — real tool execution, no API keys, no mocks.
Interactive player — click to pause, scroll to rewind.
Most agent frameworks trade security for flexibility, or intelligence for simplicity. Argentor gives you all three.
ReAct reasoning loops, self-evaluation with quality scoring, cost-aware model routing, and adaptive memory that learns across sessions.
WASM sandboxing, capability-based permissions, encrypted credentials, SSRF prevention, RBAC, and human-in-the-loop approval gates.
4498 tests, 0 failures. Persistent state, control plane, web dashboard, OpenTelemetry, and A2A protocol for agent interop.
| Capability | Argentor | CrewAI | AutoGPT | LangGraph |
|---|---|---|---|---|
| ReAct Reasoning Engine | ✓ | ✗ | Basic | Manual |
| Self-Evaluation Loop | ✓ | ✗ | ✗ | ✗ |
| Cost-Aware Model Routing | ✓ | ✗ | ✗ | ✗ |
| A2A Protocol (Agent Interop) | ✓ | ✗ | ✗ | ✗ |
| WASM Sandboxed Plugins | ✓ | ✗ | ✗ | ✗ |
| Multi-Agent Patterns (6+) | ✓ | 2 | ✗ | ✓ |
| Compliance (GDPR, ISO) | ✓ | ✗ | ✗ | ✗ |
| Control Plane + Dashboard | ✓ | ✗ | ✗ | Plugin |
| Memory-Safe (Rust) | ✓ | Python | Python | Python |
Not just tool calling — structured reasoning, self-correction, and cost optimization built into every agent.
Structured Think → Act → Observe → Reflect cycle. Agents decompose complex tasks into reasoning steps, track confidence per step, and know when to stop or ask for clarification.
TF-IDF relevance scoring filters tools before sending to the LLM, reducing token waste by up to 80%. Tracks per-tool success rates and adapts selection strategy automatically.
Every response is scored on 4 dimensions: relevance, consistency, completeness, and clarity. Below-threshold responses trigger automatic refinement loops before delivery.
Routes simple tasks to fast/cheap models and complex tasks to powerful ones. 7-factor complexity estimation with 4 strategies: CostOptimized, QualityOptimized, Balanced, and Tiered.
Cross-session memory that auto-extracts facts, tool patterns, and error resolutions. Importance decay over time keeps memory relevant. Keyword-based recall with configurable relevance thresholds.
Claude, GPT-4, Gemini, Ollama, Mistral, xAI, Azure OpenAI, Cerebras, Together, DeepSeek, vLLM, OpenRouter, Groq, and more. Automatic failover across backends.
14 crates, 187K+ lines of code, 50+ built-in skills, every component tested and documented.
Structured reasoning with Think/Act/Observe/Reflect cycle. Quality scoring on 4 dimensions with automatic refinement loops. Agents know when their answer isn't good enough.
IntelligenceRoute simple tasks to Haiku/GPT-4o-mini and complex ones to Opus/o1. Budget tracking, 7-factor complexity estimation, and 4 configurable routing strategies.
IntelligenceSkills run in WebAssembly (wasmtime + WASI) with capability-based permissions. No skill can escape its sandbox. SSRF prevention, path traversal blocking, and shell injection blocking.
Security10 specialized agent roles with DAG task queue, dependency resolution, inter-agent messaging via A2A MessageBus, and dynamic replanning with 6 recovery strategies.
CoreGoogle Agent-to-Agent interop via JSON-RPC 2.0. AgentCard discovery, task send/get/cancel/list, streaming SSE for real-time updates. Your agents talk to the world.
NewCentralized MCP proxy with intelligent routing (round-robin, least-loaded, pattern-based), circuit breaker, credential vault with rotation, and token pool management.
CoreHybrid BM25 + embedding vector search for long-term memory. Adaptive memory auto-extracts facts, tool patterns, and error resolutions across sessions.
IntelligenceGDPR, ISO 27001, ISO 42001, and DPGA compliance modules. Audit logging, consent tracking, bias monitoring, and automated reporting.
API scaffold generator (Rust/Axum, Python/FastAPI, Node/Express). IaC generator (Docker, Helm, Terraform, GitHub Actions). Git operations, code analysis, test runner.
17 REST API endpoints for deployment management. Web dashboard with real-time status, agent registry with 9 default definitions, health monitoring with auto-recovery.
NewPrometheus /metrics, OpenTelemetry traces, token budget tracking per agent, structured audit logging, and real-time agent monitoring with health state machine.
Deploy, scale, and monitor agents from the terminal. A2A discovery and task management. Compliance reports, config hot-reload, and skill management.
New10 new cognitive modules that make agents think deeper, recover faster, and learn continuously.
Chain-of-thought reasoning with configurable thinking budgets. Agents break complex problems into deliberate reasoning steps before acting.
ReasoningAgents review their own outputs against quality criteria, identify weaknesses, and revise before delivery. Multi-pass refinement built in.
QualityIntelligent summarization that compresses long conversations while preserving critical decisions and facts. Keeps agents coherent across long sessions.
MemoryDynamic tool registry with semantic search. Agents discover and compose tools at runtime based on task requirements, not static configuration.
ToolsStructured delegation between agents with context transfer. Agents know when to escalate, what context to pass, and how to resume.
OrchestrationSave and restore agent execution state at any point. Enables rollback, branching, and resumption of interrupted workflows.
ReliabilityFull execution traces with reasoning steps, tool calls, and decision points. Export to OpenTelemetry or view in the built-in dashboard.
ObservabilityAgents generate new tools on-the-fly from natural language descriptions. Created tools are sandboxed, tested, and optionally persisted for reuse.
ToolsStep-level reward signals that evaluate each reasoning step, not just the final answer. Guides agents toward better intermediate decisions.
ReasoningContinuous improvement from execution outcomes. Agents store success and failure patterns, adapting their strategies across sessions.
LearningFirst-class SDKs for Python and TypeScript. Build agents in your language of choice with full type safety.
Security is not a feature — it is the architecture. Every layer enforces isolation, authentication, and auditability.
Orchestrator-Workers pattern with intelligent core, centralized MCP proxy, A2A interop, and full control plane.
Google's Agent-to-Agent protocol for cross-platform agent communication. Your Argentor agents can discover, delegate, and collaborate with any A2A-compatible agent.
argentor a2a discover --url http://agentChoose the right multi-agent pattern for your task. Mix and match within a single orchestration.
Sequential stages. Each agent transforms and passes output to the next. Ideal for linear workflows.
Fan out to parallel workers, then aggregate results. Great for divide-and-conquer problems.
Agents argue opposing positions. A judge agent synthesizes the best answer from the discourse.
Multiple agents solve independently. Results are combined via voting, ranking, or weighted merge.
A supervisor agent monitors workers, re-assigns tasks on failure, and enforces quality gates.
Fully decentralized. Agents self-organize via a shared message bus with emergent coordination.
Generate production-ready scaffolds, infrastructure, and CI/CD pipelines from agent conversations.
Know exactly what your agents are doing, how much they cost, and when they need attention.
Built-in /metrics endpoint. Track latency, tokens, errors, and agent health in Grafana.
Distributed tracing with OTLP export. Instrument the full request path from gateway to tool execution.
Per-agent token budgets with real-time estimation. Track spend across 14 providers with cost-aware routing.
Every tool call, LLM request, and decision logged as structured JSONL. Query API with time-range filters.
Modular architecture. Use only what you need, or the full stack.
| Crate | Description |
|---|---|
| argentor-core | Core types, errors, Message, ToolCall, ToolResult, OpenTelemetry integration |
| argentor-agent | Agent runner, ReAct engine, self-evaluator, model router, tool selector, adaptive memory, 14 LLM backends, failover, streaming |
| argentor-security | Capabilities, RBAC, rate limiting, SSRF/path traversal prevention, audit, TLS/mTLS, encrypted store |
| argentor-orchestrator | Multi-agent engine, 6 collaboration patterns, DAG task queue, deployment manager, health checker, agent registry |
| argentor-mcp | MCP client, server, proxy with credential vault, token pool, circuit breaker, intelligent routing |
| argentor-a2a | Google A2A protocol: JSON-RPC 2.0 server/client, AgentCard, task management, SSE streaming |
| argentor-skills | Skill trait, WASM sandbox runtime (wasmtime + WASI), plugin registry, vetting pipeline |
| argentor-memory | Semantic vector memory, hybrid BM25 + embedding search, query expansion, JSONL persistence |
| argentor-builtins | Built-in skills: shell, file I/O, HTTP fetch, memory, browser automation, Docker sandbox, code gen |
| argentor-gateway | HTTP/WS gateway, REST API, control plane (17 endpoints), web dashboard, proxy management, Prometheus metrics |
| argentor-channels | Channel bridges for Slack, Discord, Telegram, and webchat integration |
| argentor-session | Session management, file and database persistence, conversation transcripts |
| argentor-compliance | GDPR, ISO 27001, ISO 42001, DPGA modules with automated reporting and hooks |
| argentor-cli | CLI binary: serve, deploy, agents, health, a2a, skill, compliance — full control from the terminal |
Up and running in under a minute. The demo requires no API keys.