wooo/awoooi

Files

Your Name 889b7b4229

Code Review / ai-code-review (push) Successful in 15s

Details

CD Pipeline / tests (push) Successful in 1m42s

Details

CD Pipeline / build-and-deploy (push) Successful in 3m58s

Details

CD Pipeline / post-deploy-checks (push) Has been cancelled

Details

Ansible / Reboot Recovery Contract / validate (push) Has been cancelled

Details

2026-06-26 11:55:21 +08:00

AI Agent 市場雷達與近期變更盤點

近期變更盤點

優先級	工作線	狀態	進度	下一步
`P0`	Product Governance Owner Response Dashboard / handoff 收斂	`read_model_ready_runtime_blocked`	`100%`	Owner questions 與 boundary acknowledgements 仍需逐項回覆。
`P0`	Status Cleanup Dashboard read-only API 正式化	`blocked_status_cleanup_apply_not_authorized`	`100%`	apply_allowed=false 前不得更新 project status 或 memory。
`P0`	Wazuh / IwoooS 可視性邊界	`blocked_waiting_manager_agent_registry_readback`	`35%`	等待 manager agent registry readback 與 live route readback。
`P0`	AI Agent market watch 最新 primary-source refresh	`market_refresh_done_integration_blocked`	`100%`	更新 scorecard 並進入 offline replay gate，不得直接替換。
`P1`	日報 / 週報 / 月報數據化報告	`report_contract_defined_runtime_delivery_blocked`	`65%`	接 Agent 工作量、Telegram receipt 與 human-review queue。
`P1`	工具 / 套件 / 服務 / 主機版本新鮮度	`read_only_inventory_defined_update_execution_blocked`	`55%`	定期產生版本 freshness snapshot；中低風險可 auto proposal，高風險維持人工審核。

做法	AWOOOI 判定	下一步
多 Agent handoff / specialist delegation	`partially_modeled`	將 OpenClaw / Hermes / NemoTron handoff 事件寫入可讀 timeline。
Tracing / tool call / guardrail observability	`missing_unified_trace`	建立 Agent run trace id，串接報告、Telegram receipt 與 replay outcome。
Durable execution / persistence / human-in-the-loop	`needed_for_incident_loop`	優先把 incident workflow kernel 設計成可暫停、恢復、審核與重放。
MCP / A2A / enterprise multi-agent interoperability	`watch_and_design`	MCP server 先做 read-only tool registry，再開 write adapter。
Evaluation / replay / profiling before integration	`strong_fit_for_nemotron`	NemoTron 維持 smoke / replay / evaluator，不直接接 production routing。
Agent SDK as programmable code/ops remediator	`candidate_for_remediation_lane`	只允許 no-write replay 與 patch proposal，禁止自動 merge / deploy。
Enterprise-scale ADK with evaluation and observability	`candidate_for_google_stack_review`	先納入 weekly watch，成本與資料邊界審核後才可 adapter。

Agent / 候選	建議角色	Gate 狀態	下一步
OpenClaw incumbent	生產仲裁者 / production decision core	`production_baseline`	formal_replacement_adr_and_promotion_gate_required
NVIDIA NeMo Agent Toolkit + Nemotron Fabric	離線 replay、模型能力評估、合約輸出 smoke gate	`integration_blocked`	refresh_source_evidence_then_5_record_smoke_only
NousResearch Hermes Agent	知識記憶、證據草稿、長期技能庫候選	`watch_only_blocked`	continue_watch_only_until_primary_source_evidence_is_sufficient
OpenAI Agents SDK Coordinator	Coordinator / handoff / tracing / guardrail 候選	`registered_no_review`	continue_weekly_primary_source_market_watch
LangGraph Incident Kernel	durable incident workflow kernel 候選	`registered_no_review`	continue_weekly_primary_source_market_watch
Claude Agent SDK Remediator	DevOps / code remediation patch proposal 候選	`integration_blocked`	refresh_scorecard_then_offline_replay_or_promotion_gate
Microsoft Agent Framework	MCP / A2A enterprise workflow 候選	`registered_no_review`	continue_weekly_primary_source_market_watch
Google Agent Development Kit Stack	Gemini / Vertex agent stack 候選	`registered_no_review`	continue_weekly_primary_source_market_watch
CrewAI Flows + Crews	快速多 Agent prototype 候選	`integration_blocked`	create_no_sdk_no_api_adapter_then_offline_replay

順序	工作	風險	自動化模式	完成定義
1	固定每週 AI Agent market watch 並產生治理 snapshot	`low`	`agent_auto_read_only`	每週一 09:00 Asia/Taipei 有 watch / integration / discovery / promotion / governance 五份 artifacts。
2	刷新 market capability scorecard	`medium`	`agent_propose_owner_review`	OpenAI / LangGraph / NeMo-Nemotron / Claude / Microsoft / Google / CrewAI 均有新版官方來源與分數差異。
3	建立 50 筆歷史 incident offline replay queue	`medium`	`agent_auto_prepare_human_approve_run`	replay fixture 不含 secret，候選結果可與 OpenClaw baseline 比較。
4	Agent 溝通 / 學習 / 成長可視化 readback	`medium`	`agent_auto_read_model`	每個 Agent 的 handoff、decision、learning writeback、review score 與 blocked action 可被前端和報告讀到。
5	Telegram Bot 報告與高風險審核橋接	`high`	`human_approve_before_send_or_action`	低中風險只告警回報，高風險需要 Telegram approval token / owner response 才能執行。
6	工具、套件、服務、主機版本自動 freshness 盤點	`medium`	`agent_auto_scan_agent_propose`	套件、服務、主機、MCP、AI provider、模型版本都有 stale / upgrade / rollback / approval gate。