feat(Phase 3): 學習閉環重建 — 三根因修復 + 2x EWMA + Evolver Agent
Some checks failed
CD Pipeline / build-and-deploy (push) Failing after 19m7s
Type Sync Check / check-type-sync (push) Failing after 1m18s

ADR-083 Phase 3 學習閉環重建:

**三根因修復**
- approval_execution.py: fire-and-forget create_task → await asyncio.wait_for(timeout=30) × 2
  (成功路徑 L265 + 失敗路徑 L353,超時記錄 learning_trigger_timeout metric,主流程不 crash)
- models/approval.py: ApprovalRequestBase 新增 matched_playbook_id 欄位
- decision_manager.py: _auto_execute 建立 ApprovalRequest 時填充 matched_playbook_id
- learning_service.py: 雙路徑查找 _matched_pb_id(matched_playbook_id + metadata fallback)

**2x EWMA 負向強化**
- models/playbook.py: 新增 trust_score: float = 0.3(EWMA 動態信任度欄位)
- repositories/playbook_repository.py: update_stats 加 EWMA
  成功: trust = 0.9 × old + 0.1 × 1.0
  失敗: trust = 0.8 × old + 0.2 × 0.0(衰減速度 2x)
  trust < 0.1 → log warning,等 Evolver 封存

**Evolver Agent(新建)**
- services/playbook_evolver.py: 三功能全靜態規則
  1. 低信任封存: trust < 0.1 → DEPRECATED
  2. 休眠封存: 30d 未使用 AND trust < 0.5 → DEPRECATED
  3. 相似合併: 症狀 Jaccard > 0.9 → 保留高 trust,封存低 trust
  AIOPS_P3_EVOLVER_ENABLED=False 預設關閉

**文件**
- ADR-083 學習閉環重建
- MASTER §8 Phase 3 完工記錄

AIOPS_P3_ENABLED=False(預設),骨架就位等統帥批准開啟

Co-Authored-By: Claude Sonnet 4.6(亞太)<noreply@anthropic.com>
This commit is contained in:
OG T
2026-04-15 14:01:28 +08:00
parent 7edb298a75
commit 7da64eaad2
9 changed files with 565 additions and 20 deletions

View File

@@ -143,6 +143,9 @@ class ApprovalRequestBase(BaseModel):
# 2026-04-14 Claude Sonnet 4.6: 上移 incident_id 到 Base
# 讓 ApprovalRequestCreate 也能攜帶(修 9b9ff5b 的 NoneAttr bug
incident_id: str | None = Field(default=None, description="關聯的 Incident ID")
# ADR-083 Phase 3: 命中的 Playbook ID學習迴路必填
# 2026-04-15 ogt + Claude Sonnet 4.6(亞太): Phase 3 matched_playbook_id 傳遞修復
matched_playbook_id: str | None = Field(default=None, description="命中的 Playbook ID供學習服務 EWMA 更新")
class ApprovalRequestCreate(ApprovalRequestBase):

View File

@@ -205,6 +205,12 @@ class Playbook(BaseModel):
success_count: int = Field(default=0, ge=0, description="成功執行次數")
failure_count: int = Field(default=0, ge=0, description="失敗執行次數")
last_used_at: datetime | None = Field(None, description="最後使用時間")
# ADR-083 Phase 3: EWMA 信任度0.0-1.0,初值 0.3
# 成功: trust_new = 0.9 * trust_old + 0.1 * 1.0
# 失敗: trust_new = 0.8 * trust_old + 0.2 * 0.02x 衰減)
# trust < 0.1 → 自動封存(由 Evolver Agent 處理)
# 2026-04-15 ogt + Claude Sonnet 4.6(亞太): Phase 3 EWMA 負向強化
trust_score: float = Field(default=0.3, ge=0.0, le=1.0, description="EWMA 動態信任度Phase 3 新增)")
# === 人工標記 ===
approved_by: str | None = Field(None, description="核准者")