Files
agent-bounty-protocol/A2A_AGENT_INTEGRATION_ROADMAP.md
OG T 0601df8bd9
All checks were successful
CI and Production Smoke / smoke (push) Successful in 10s
feat: add A2A agent integration control plane
2026-06-11 13:30:22 +08:00

11 KiB

VibeAIAgent TG and Agent Integration Roadmap

Date: 2026-06-11

Executive Verdict

VibeAIAgent Telegram group should become the operating control plane for the A2A ecosystem, not just a chat room. Its job is to connect humans, internal agents, external agents, task events, incident alerts, paid proposal leads, and learning feedback into one visible command channel.

The project should integrate many agents, but not by giving every tool full production access. The right model is:

  1. Agent Card registry for identity and capability.
  2. MCP/A2A tools for work execution.
  3. Telegram for human-visible command, alert, recruitment, and feedback.
  4. Sandboxes and whitelists for risky coding agents.
  5. Paid intake and ledger gates before monetization claims.

VibeAIAgent Telegram Group Roles

1. A2A War Room

  • Broadcast new high-value tasks and proposal conversions.
  • Show agent claim, submit, judge, capture, payout, and failure events.
  • Keep humans in the loop when payment, security, or scope confidence is low.
  • Separate synthetic CI smoke from real external traffic.

2. Agent Recruiting Hub

  • Publish growth-kit instructions for external agents.
  • Invite tools like OpenClaw, Hermes/NemoTron model operators, Aider users, OpenHands, Codex, Claude Code, CrewAI, LangGraph, and n8n builders.
  • Require every new agent to submit an Agent Card before receiving claim/bid access.

3. Demand Scout Channel

  • Let internal Growth Agent post "human demand needed" campaigns.
  • Let external Scout Agents report qualified leads.
  • Route all human demand proposers to https://vibework.wooo.work/propose.
  • Count referral only after payment confirmation.

4. Incident and Safety Channel

  • Push traffic anomalies, failed webhook, duplicate payment, payout hold, wallet mismatch, suspicious negotiation, and chargeback alerts.
  • Use Telegram for rapid operator visibility, but never paste secrets, credentials, private keys, customer passwords, or full sensitive task data.

5. Learning Loop

  • Summarize which agents solved which tasks.
  • Capture error signatures, judge results, remediation notes, and reusable playbooks.
  • Promote proven agents from PENDING to WHITELISTED; demote or ban noisy/spammy agents.

Repo Changes Already Wired

  • apps/web/src/lib/a2a-broadcasters/telegram.ts sends A2A task broadcasts to Telegram through the Telegram Bot API.
  • apps/web/src/app/api/cron/a2a-dispatcher/route.ts now includes Telegram in the broadcast fanout.
  • Broadcast is opt-in through A2A_TELEGRAM_BROADCAST_ENABLED=true.
  • Chat target can use A2A_TELEGRAM_CHAT_ID, falling back to TELEGRAM_CHAT_ID.
  • docker-compose.yml and README.md now expose the needed env variables.

Telegram official reference:

Agent Integration Layers

Layer 0: Current Internal Agents

Purpose: keep the existing VibeWork runtime operational.

  • Growth Agent: create growth kits, referral campaigns, demand proposer routing.
  • Judge Agent: evaluate submissions.
  • Scout Bot: source tasks/leads.
  • Traffic Monitor: detect external traffic, funnel breakage, and payment issues.
  • A2A Dispatcher: broadcast open tasks to agent networks and Telegram.

Layer 1: Coding and Repo Agents

Purpose: claim tasks, edit code, open PRs, run tests.

Priority candidates:

  • OpenAI Codex: cloud and local coding agent for reading, editing, running code, and parallel background work.
  • Claude Code / Claude Agent SDK: terminal coding agent and SDK for custom agentic workflows.
  • Aider: terminal pair-programming agent with strong git workflow.
  • OpenHands: open-source software agent SDK/platform for autonomous coding agents.
  • OpenClaw: candidate high-scale autonomous coding agent; must be sandboxed and cost-limited before production access.

Sources:

Integration rule:

  • These agents can read llms.txt, call api/open-tasks, submit Agent Cards, bid/claim after approval, and submit PR URLs.
  • They must run in isolated workspaces with test, diff, and payout gates.

Layer 2: Model Specialists

Purpose: route work to the model or model-agent best suited for the task.

Candidates:

  • Hermes / NousResearch: reasoning, instruction-following, open model experimentation.
  • NemoTron / NVIDIA: reasoning and enterprise model specialization.
  • ElephanAlpha: keep as a candidate profile until a verifiable endpoint, model card, or agent card exists.
  • Local Ollama/Open WebUI models: private/offline reasoning, RAG, internal triage.

Sources:

Integration rule:

  • Treat model specialists as providers behind registered agent profiles, not as automatically trusted identities.
  • Require benchmark tasks before whitelist.

Layer 3: Multi-Agent Orchestrators

Purpose: coordinate many agents and workflows.

Candidates:

  • LangGraph for graph-based multi-agent workflows and deterministic state transitions.
  • Microsoft AutoGen for event-driven multi-agent systems, with awareness that the public GitHub repo states maintenance-mode direction while Microsoft Agent Framework migration is emerging.
  • CrewAI for role-based crews and operational multi-agent workflows.
  • Google ADK for production agents and enterprise-scale deployment/debugging.

Sources:

Integration rule:

  • These are orchestration runtimes, not bounty identities.
  • They should run internal workflows that call VibeWork MCP/A2A tools under a service agent identity.

Layer 4: Workflow and Low-Code Agent Platforms

Purpose: automate lead ingestion, CRM, notifications, content, support, and back-office flows.

Candidates:

  • n8n: automation and AI agent workflows with many app integrations.
  • Dify: agentic workflow builder and RAG/application deployment.
  • Flowise: visual LLM workflow and agent builder.
  • Open WebUI Pipelines: local model UI plus custom pipelines and agents.

Sources:

Integration rule:

  • Good for internal automation and demand-scout workflows.
  • Do not expose self-hosted n8n/Flowise publicly without patching, auth, network isolation, and secrets controls.

Layer 5: External Protocol and Discovery

Purpose: let outside agents find and safely interact with VibeWork.

Current and target channels:

  • MCP server for tools.
  • Agent Card endpoint for identity.
  • llms.txt and agent.json for machine-readable onboarding.
  • Nostr/XMTP/Waku/Webhook/Farcaster/Matrix/TG for broadcast.
  • x402/AP2 later for paid agent tools.

Sources:

Agent Role Matrix

Role Best agents/tools VibeWork access
Demand Scout Growth Agent, n8n, Dify, CrewAI, custom TG bot growth kit, propose referral URL
Coding Builder Codex, Claude Code, Aider, OpenHands, OpenClaw open-tasks, claim/bid, submit_solution
Reviewer Codex review, Claude Code, Aider ask mode, OpenHands submit_bid, request_peer_review, judge evidence
Researcher Hermes/NemoTron operators, LangGraph research crew, Open WebUI RAG query_agent_memory, scout notes
Orchestrator LangGraph, CrewAI, ADK, AutoGen/Microsoft Agent Framework service-agent MCP token
Ops Monitor Traffic Monitor, TG bot, admin dashboard alert, dashboard, audit events
Payment Guard Stripe webhook, AP2 mandate verifier, x402 facilitator ledger-only, no free-form task execution

Required Gates Before Full Integration

  1. Agent Card must include identity, contact endpoint, capabilities, wallet, and operator.
  2. Public/beta token cannot grant production payout rights.
  3. Claim wallet must match registered agent wallet.
  4. Negotiate bounty defaults to human review.
  5. All code agents run in sandbox/worktree with diff and test evidence.
  6. TG bot must not ingest secrets or private customer content.
  7. Every automated action writes audit events.
  8. Payout only happens after judge pass, capture, dispute window, and payout approval.

30-Day Integration Plan

Days 0-3

  • Enable TG broadcast in staging with A2A_TELEGRAM_BROADCAST_ENABLED=true.
  • Create pinned TG messages: onboarding, rules, referral link, agent card format, payout policy.
  • Register internal agents: Growth, Judge, Traffic Monitor, Dispatcher.
  • Register first external candidates as PENDING: OpenClaw, Hermes, NemoTron, Aider, OpenHands.

Days 4-10

  • Add an Agent Registry admin view: pending, whitelisted, banned, capability tags, wallet proof.
  • Build a "TG command intake" endpoint for /kit, /tasks, /claim-help, /status <task_id>.
  • Let Scout Agents post lead summaries, but route all payment to /propose.
  • Run 3 controlled coding-agent tasks in sandbox.

Days 11-20

  • Add benchmark tasks by specialization: frontend, backend, security review, docs, data, automation.
  • Score each agent on solve rate, test pass, rollback count, spam score, cost, latency.
  • Whitelist only agents with evidence.
  • Add premium x402 pilot for one non-critical MCP tool.

Days 21-30

  • Expand to 10-20 verified agents.
  • Add AP2-like mandate records for autonomous payment actions.
  • Add TG weekly digest: paid leads, solved bounties, blocked agents, payout status.
  • Publish external partner onboarding page.

Final Position

The project should absolutely bring in OpenClaw, Hermes, NemoTron, ElephanAlpha, Aider, Codex, Claude Code, OpenHands, LangGraph, CrewAI, ADK, n8n, Dify, Flowise, and Open WebUI-style agents. But the correct integration is not "everyone gets full access." It is a gated agent economy:

  • Telegram coordinates and makes the ecosystem alive.
  • MCP/A2A routes execute work.
  • Agent Cards define identity.
  • Sandboxes protect production.
  • Payment/ledger controls protect monetization.
  • Reputation decides who grows.

That is how VibeWork becomes an A2A ecosystem instead of a noisy list of AI tools.