agent autonomy

Squad para auditar, criar, diagnosticar e otimizar agentes autônomos. Aplica frameworks como Weng's 3 Pillars, ACI, ReAct/Reflexion, e IMPACT para garantir autonomia real dos agentes.

14installs
SAFE
小队由第三方发布。squads.sh 不保证其安全性或功能性。使用需自行承担风险。 阅读条款
6智能体7任务2工作流

Agent Autonomy Squad

Squad para auditar, criar, diagnosticar e otimizar agentes autônomos. Baseado em 14 elite minds da pesquisa e prática de agentes de IA.

Quick Start

text
/agent-autonomy:chief        Ativa o orchestrador (triage automático)
/agent-autonomy:audit         Auditar um agente existente
/agent-autonomy:create        Criar agente autônomo
/agent-autonomy:diagnose      Diagnosticar falhas de autonomia
/agent-autonomy:teach         Ensinar reasoning patterns
/agent-autonomy:find          Pesquisar libs/repos/benchmarks

Arquitetura

text
Orchestrator    autonomy-chief ──────── triage + routing

Tier 0          autonomy-auditor ────── diagnóstico + scoring (3 Pilares + L1-L5)

Tier 1          agent-architect ─────── design + otimização (ACI, Context Engineering)
                reasoning-engineer ──── patterns + teaching (ReAct, Reflexion, ToT, LATS)

Tier 2          tool-smith ──────────── build tools/scripts/docs (IMPACT, Lethal Trifecta)
                ecosystem-scout ─────── research open-source (Exa MCP, evaluation rubric)

Use Cases

IDUse CaseAgente Primário
UC1Auditar agente existenteautonomy-auditor
UC2Criar agente autônomoagent-architect
UC3Diagnosticar falhas de autonomiaautonomy-auditor
UC4Otimizar agente existenteagent-architect + reasoning-engineer
UC5Sugerir construção de toolstool-smith
UC6Recomendar repos open-sourceecosystem-scout
UC7Ensinar COMO o agente deve atuarreasoning-engineer
UC8Análise determinístico vs probabilísticoautonomy-auditor + agent-architect
UC9Construção de docs .md auxiliarestool-smith
UC10Construção de scripts auxiliarestool-smith
UC11Buscar bibliotecas Python para autonomiaecosystem-scout

Frameworks Core

FrameworkAutorUso no Squad
3 Pilares (Planning, Memory, Tool Use)Lilian WengDiagnóstico de autonomia
ACI (Agent-Computer Interface)Erik SchluntzDesign de tools
Autonomy Levels L1-L5Knight InstituteClassificação de agentes
ReAct / Reflexion / ToT / LATSYao, Shinn, ZhouReasoning patterns
IMPACTswyxAgent engineering
Lethal TrifectaswyxSecurity check
Context EngineeringHarrison ChaseOtimização de context window
Rule Maker PatternSchluntzSeparação det vs prob

Estrutura de Arquivos

text
squads/agent-autonomy/
├── agents/
│   ├── autonomy-chief.md          Orchestrador — triage e routing
│   ├── autonomy-auditor.md        Tier 0 — diagnóstico e scoring
│   ├── agent-architect.md         Tier 1 — design e otimização
│   ├── reasoning-engineer.md      Tier 1 — reasoning patterns
│   ├── tool-smith.md              Tier 2 — construção de tools
│   └── ecosystem-scout.md         Tier 2 — pesquisa open-source
├── tasks/
│   ├── audit-agent.md             Auditar agente (9 critérios + 4 FMs)
│   ├── create-autonomous-agent.md Criar agente L3+
│   ├── diagnose-autonomy-failure.md Diagnóstico com 5 Whys
│   ├── optimize-agent.md          Otimizar por impact score
│   ├── suggest-tools.md           Buscar/construir tools ACI
│   ├── search-ecosystem.md        Pesquisar libs com rubric
│   └── teach-reasoning.md         Ensinar patterns de reasoning
├── workflows/
│   ├── audit-optimize-cycle.md    Ciclo audit → optimize (max 3 iter)
│   └── create-agent-flow.md       Fluxo completo de criação
├── checklists/
│   └── autonomy-checklist.md      18 items — threshold L3/L4/L5
├── data/
│   └── agent-autonomy-kb.md       Knowledge base consolidada
├── config.yaml                    Configuração do squad
├── README.md
└── CHANGELOG.md

Quality Gates

IDNomeOwnerTipo
QG-001Request Classificationautonomy-chiefrouting
QG-002Diagnosis Completeautonomy-auditorblocking
QG-003Architecture Reviewagent-architectblocking
QG-004Reasoning Validatedreasoning-engineerblocking
QG-005Tool Qualitytool-smithblocking
QG-006Final Validationautonomy-chiefblocking

Autonomy Checklist (resumo)

18 items across 5 categorias:

  • Planning (peso 0.35): Task Decomposition, Self-Reflection, Goal Persistence
  • Memory (peso 0.30): Working Memory, Long-Term Memory, Cross-Agent Memory
  • Tool Use (peso 0.35): Tool Coverage, Tool Quality (ACI), Error Recovery
  • Failure Modes: Context Saturation, Tool Brittleness, Reasoning Drift, Evaluator Absence
  • Autonomia Geral: 80%+ tasks sem intervenção, det vs prob separados, halt condition, escalation criteria, security check

Thresholds: >= 13/18 para L3+, >= 15/18 para L4+, >= 17/18 para L5.

Elite Minds (14)

MenteContribuição
Lilian Weng3 Pilares (Planning, Memory, Tool Use)
Erik SchluntzACI — 5 princípios de tool design
Harrison ChaseContext Engineering
Shunyu YaoReAct, Tree of Thoughts
Noah ShinnReflexion
Andy ZhouLATS (Language Agent Tree Search)
Andrew Ng4 Agentic Design Patterns, Eval-Driven Dev
João MouraCrewAI, Multi-Agent Orchestration
Beth BarnesMETR — Agent Evaluation
Shawn Wang (swyx)IMPACT Framework, Lethal Trifecta
Simon WillisonAgentic Engineering Patterns
Chi WangAutoGen, Multi-Agent Conversations
Yohei NakajimaBabyAGI, Task-Driven Agents
Karthik NarasimhanWebArena, Agent Benchmarks

Validação

yaml
validated: true
score: 7.4/10
date: 2026-03-01
type: Expert Squad
result: PASS

评价

0 条评价

撰写评价

暂无评价。来做第一个评价者吧!