agent autonomy

Squad para auditar, criar, diagnosticar e otimizar agentes autônomos. Aplica frameworks como Weng's 3 Pillars, ACI, ReAct/Reflexion, e IMPACT para garantir autonomia real dos agentes.

14installs
SAFE
Squads are published by third parties. squads.sh does not guarantee their safety or functionality. Use at your own risk. Read Terms
6Agents7Tasks2Workflows

Agent Autonomy Squad

Squad para auditar, criar, diagnosticar e otimizar agentes autônomos. Baseado em 14 elite minds da pesquisa e prática de agentes de IA.

Quick Start

text
/agent-autonomy:chief        Ativa o orchestrador (triage automático)
/agent-autonomy:audit         Auditar um agente existente
/agent-autonomy:create        Criar agente autônomo
/agent-autonomy:diagnose      Diagnosticar falhas de autonomia
/agent-autonomy:teach         Ensinar reasoning patterns
/agent-autonomy:find          Pesquisar libs/repos/benchmarks

Arquitetura

text
Orchestrator    autonomy-chief ──────── triage + routing

Tier 0          autonomy-auditor ────── diagnóstico + scoring (3 Pilares + L1-L5)

Tier 1          agent-architect ─────── design + otimização (ACI, Context Engineering)
                reasoning-engineer ──── patterns + teaching (ReAct, Reflexion, ToT, LATS)

Tier 2          tool-smith ──────────── build tools/scripts/docs (IMPACT, Lethal Trifecta)
                ecosystem-scout ─────── research open-source (Exa MCP, evaluation rubric)

Use Cases

IDUse CaseAgente Primário
UC1Auditar agente existenteautonomy-auditor
UC2Criar agente autônomoagent-architect
UC3Diagnosticar falhas de autonomiaautonomy-auditor
UC4Otimizar agente existenteagent-architect + reasoning-engineer
UC5Sugerir construção de toolstool-smith
UC6Recomendar repos open-sourceecosystem-scout
UC7Ensinar COMO o agente deve atuarreasoning-engineer
UC8Análise determinístico vs probabilísticoautonomy-auditor + agent-architect
UC9Construção de docs .md auxiliarestool-smith
UC10Construção de scripts auxiliarestool-smith
UC11Buscar bibliotecas Python para autonomiaecosystem-scout

Frameworks Core

FrameworkAutorUso no Squad
3 Pilares (Planning, Memory, Tool Use)Lilian WengDiagnóstico de autonomia
ACI (Agent-Computer Interface)Erik SchluntzDesign de tools
Autonomy Levels L1-L5Knight InstituteClassificação de agentes
ReAct / Reflexion / ToT / LATSYao, Shinn, ZhouReasoning patterns
IMPACTswyxAgent engineering
Lethal TrifectaswyxSecurity check
Context EngineeringHarrison ChaseOtimização de context window
Rule Maker PatternSchluntzSeparação det vs prob

Estrutura de Arquivos

text
squads/agent-autonomy/
├── agents/
│   ├── autonomy-chief.md          Orchestrador — triage e routing
│   ├── autonomy-auditor.md        Tier 0 — diagnóstico e scoring
│   ├── agent-architect.md         Tier 1 — design e otimização
│   ├── reasoning-engineer.md      Tier 1 — reasoning patterns
│   ├── tool-smith.md              Tier 2 — construção de tools
│   └── ecosystem-scout.md         Tier 2 — pesquisa open-source
├── tasks/
│   ├── audit-agent.md             Auditar agente (9 critérios + 4 FMs)
│   ├── create-autonomous-agent.md Criar agente L3+
│   ├── diagnose-autonomy-failure.md Diagnóstico com 5 Whys
│   ├── optimize-agent.md          Otimizar por impact score
│   ├── suggest-tools.md           Buscar/construir tools ACI
│   ├── search-ecosystem.md        Pesquisar libs com rubric
│   └── teach-reasoning.md         Ensinar patterns de reasoning
├── workflows/
│   ├── audit-optimize-cycle.md    Ciclo audit → optimize (max 3 iter)
│   └── create-agent-flow.md       Fluxo completo de criação
├── checklists/
│   └── autonomy-checklist.md      18 items — threshold L3/L4/L5
├── data/
│   └── agent-autonomy-kb.md       Knowledge base consolidada
├── config.yaml                    Configuração do squad
├── README.md
└── CHANGELOG.md

Quality Gates

IDNomeOwnerTipo
QG-001Request Classificationautonomy-chiefrouting
QG-002Diagnosis Completeautonomy-auditorblocking
QG-003Architecture Reviewagent-architectblocking
QG-004Reasoning Validatedreasoning-engineerblocking
QG-005Tool Qualitytool-smithblocking
QG-006Final Validationautonomy-chiefblocking

Autonomy Checklist (resumo)

18 items across 5 categorias:

  • Planning (peso 0.35): Task Decomposition, Self-Reflection, Goal Persistence
  • Memory (peso 0.30): Working Memory, Long-Term Memory, Cross-Agent Memory
  • Tool Use (peso 0.35): Tool Coverage, Tool Quality (ACI), Error Recovery
  • Failure Modes: Context Saturation, Tool Brittleness, Reasoning Drift, Evaluator Absence
  • Autonomia Geral: 80%+ tasks sem intervenção, det vs prob separados, halt condition, escalation criteria, security check

Thresholds: >= 13/18 para L3+, >= 15/18 para L4+, >= 17/18 para L5.

Elite Minds (14)

MenteContribuição
Lilian Weng3 Pilares (Planning, Memory, Tool Use)
Erik SchluntzACI — 5 princípios de tool design
Harrison ChaseContext Engineering
Shunyu YaoReAct, Tree of Thoughts
Noah ShinnReflexion
Andy ZhouLATS (Language Agent Tree Search)
Andrew Ng4 Agentic Design Patterns, Eval-Driven Dev
João MouraCrewAI, Multi-Agent Orchestration
Beth BarnesMETR — Agent Evaluation
Shawn Wang (swyx)IMPACT Framework, Lethal Trifecta
Simon WillisonAgentic Engineering Patterns
Chi WangAutoGen, Multi-Agent Conversations
Yohei NakajimaBabyAGI, Task-Driven Agents
Karthik NarasimhanWebArena, Agent Benchmarks

Validação

yaml
validated: true
score: 7.4/10
date: 2026-03-01
type: Expert Squad
result: PASS

Reviews

0 reviews

Write a review

No reviews yet. Be the first to review this squad!