OG T
604e38cf07
docs: Phase 14 紅區治理 + Skills 01/03 更新
...
- CLAUDE.md: 紅區治理章節
- Skills 01/03: 版本更新
- ADR/Architecture: 標準化
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 09:55:47 +08:00
OG T
9ea246c7c2
docs(logbook): Phase 12.4 取消 + 狀態更新
...
統帥裁定: 現有 Ollama→Gemini→Claude fallback 已足夠
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 09:55:09 +08:00
OG T
163d94a35b
docs(skills): Phase 14.2 CI/CD 架構審查 + dependency-cruiser 整合
...
- Skill 04: Runner 殭屍進程修復 + cancel-in-progress: false
- Skill 05: 新增 SRE QA 內容
- Skill 06: dependency-cruiser 依賴治理 (Layer Model + ADR-014)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 09:53:56 +08:00
OG T
45c3656004
fix(api): 修正 langfuse_client import 排序 (I001)
...
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 09:37:09 +08:00
OG T
46ab6a838a
fix(api): 修復 ruff lint 錯誤
...
- langfuse_client.py: import Callable from collections.abc
- telemetry.py: import block 格式化
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 09:27:00 +08:00
OG T
0172dad197
feat(ci): Phase 14.2 dependency-cruiser 整合
...
- 新增 pnpm dep-check 腳本
- CI lint job 新增 Dependency Check 步驟
- 修復 tsPreCompilationDeps (monorepo 相容)
83 模組、57 依賴、0 違規 ✅
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 09:18:51 +08:00
OG T
31f554962e
fix(ci): 改用 cancel-in-progress: false 避免 Runner 衝突
...
Runner 被取消時不會清理 _diag/pages,導致下一次 run 檔案衝突
改為排隊等待而非取消
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 01:08:07 +08:00
OG T
ac294c1e3c
fix(ci): 清理 _diag/pages 避免 log 檔衝突
...
Runner 並行執行時 _diag/pages/*.log 會產生衝突
新增清理該目錄的步驟
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 01:07:07 +08:00
OG T
8ee2437a7f
fix(ci): Runner 暫存目錄清理 - 永久修復
...
- 每個 Job 開始前清理 $RUNNER_TEMP/*
- 新增 crontab 每小時自動清理
- 新增 ~/bin/runner-cleanup.sh 腳本
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 01:05:49 +08:00
OG T
df6ba33a1d
fix(k8s): NetworkPolicy 新增 Langfuse LLMOps 連線規則
...
Phase 15.1 必要: 允許 Pod 連接 Langfuse (192.168.0.110:3100)
變更:
- 新增 port 3100 (Langfuse HTTP API)
- 更新版本 v1.0 → v1.1
- 更新註解說明
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 01:01:20 +08:00
OG T
414e59d55f
docs: Phase 15 觀測架構更新 Skills 04/05
...
Skill 04 v1.5:
- 新增 Phase 15 三層觀測架構章節
- Deep Linking URL 格式說明
- Trace Context 傳遞架構圖
Skill 05 v1.3:
- 新增 Phase 15 觀測性測試章節
- 三系統健康檢查腳本
- Trace Context 驗證測試
參考:
- project_phase15_langfuse.md (Phase 15 完整記錄)
- project_phase17_tech_debt.md (技術債規劃)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 00:58:27 +08:00
OG T
60f8d770dd
docs: Phase 15 首席架構師審查通過 + Phase 17 技術債規劃
...
審查結果:
- 架構分層: ✅ 通過
- leWOOOgo 5 問: ✅ 通過
- 依賴注入: ✅ 通過
- 測試: 46/46 通過
Phase 17 技術債 (統帥批准):
- agents.py: Router 直接存取 Redis
- metrics.py: Router 直接存取 DB
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 00:54:40 +08:00
OG T
b6cff31653
feat(api): Phase 15.3 Deep Linking 三系統互連
...
實現 Sentry ↔ SignOz ↔ Langfuse 零斷鏈觀測:
新增 deep_linking.py:
- SignOz Trace URL 生成器
- Langfuse Trace URL 生成器
- Sentry Issue URL 生成器
- get_all_links() 統一取得所有連結
整合點:
- main.py: Sentry before_send 注入 otel_trace_id + signoz_trace_url
- langfuse_client.py: 自動注入 OTEL trace_id 到 metadata
- openclaw.py: SignOz span 記錄 langfuse.trace_id 反向連結
架構圖:
┌─────────┐ trace_id ┌─────────┐ trace_id ┌──────────┐
│ Sentry │◄────────►│ SignOz │◄────────►│ Langfuse │
│ Errors │ │ Traces │ │ LLMOps │
└─────────┘ └─────────┘ └──────────┘
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 00:48:28 +08:00
OG T
0d31ccb911
feat(api): Phase 15.2 Redis Trace Context 傳遞
...
實現 Redis Streams 跨服務追蹤零斷鏈:
- telemetry.py: 新增 get_trace_context() + restore_trace_context()
- webhooks.py: Producer 注入 _trace_id, _span_id 到 Redis
- signal_worker.py: Consumer 還原 Trace Context 建立子 Span
架構: API → Redis Streams → Worker 完整追蹤鏈
格式: W3C Trace Context (traceparent)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 00:40:20 +08:00
OG T
1ac8965a7a
feat(api): Phase 15.1 Langfuse LLMOps 整合 + 模型升級
...
## 新功能
- Langfuse 自建部署 (192.168.0.110:3100)
- langfuse_client.py - LLM 呼叫追蹤包裝
- OpenClaw 整合 Langfuse trace
## 模型升級 (統帥批准)
- 生產預設: llama3.2:3b → qwen2.5:7b-instruct
- 摘要任務: llama3.2:3b (速度優先)
## 配置更新
- requirements.txt: +langfuse>=2.0.0
- config.py: +LANGFUSE_* 設定
- models.json: 更新 Ollama 模型配置
- K8s: Secret + ConfigMap 更新
## 審查通過
- 模組化檢查 ✅
- 核心測試 31/31 ✅
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-26 00:32:19 +08:00
OG T
31fabe8d61
fix(ci): 修復 CI 失敗問題
...
- lewooogo-core: 新增 placeholder 測試檔 (vitest)
- api: 修復 I001 import 排序 (ruff --fix)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 23:57:24 +08:00
OG T
2fb011470e
refactor(api): Phase 16 R3.4 完整 Repository 層整合
...
- incident_repository: 新增 get_status(), update_status() 方法
- incidents.py: feedback + debug 端點全面改用 Repository
- 消除所有 Router 層直接 DB 存取 (符合積木化鐵律)
- trust_engine.py: 修復 import 順序 lint 警告
- pre-commit hook: 修正誤判問題 (排除刪除行+註解行)
- LOGBOOK: 更新 Phase 16 完成狀態
驗證結果: 31/31 測試通過
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 23:47:01 +08:00
OG T
e0584bc181
refactor(api): Phase 16 R2 封存死代碼 + RiskLevel 統一
...
封存 (866 行):
- routes/approvals.py → _archived/routes/ (477 行,未註冊死代碼)
- services/approval.py → _archived/services/ (389 行,僅被死代碼使用)
合併 RiskLevel:
- models/approval.py 新增 HIGH (從 trust_engine.py 合併)
- trust_engine.py 改 import from models/approval.py
- 保留舊定義為註解供回滾
更新 services/__init__.py:
- 移除已封存模組的 import (註解保留回滾路徑)
驗證:
- RiskLevel 統一: models 與 trust_engine 使用同一 class
- 24 個 action_parsing 測試通過
回滾指令見 _archived/README.md
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 23:14:24 +08:00
OG T
0afaea63f8
fix(api): Phase 16 R4 測試修復 - ParsedOperation 向後兼容
...
問題:
- test_action_parsing.py 導入路徑未更新 (舊: approvals.py)
- ParsedOperation dataclass 不支援 tuple 解包
修復:
- 更新測試導入至 src.services.operation_parser
- 新增 ParsedOperation.__iter__() 支援 tuple 解包
測試: 24/24 passed (100%)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 23:00:03 +08:00
OG T
4b3d98cd0b
fix(api): 修復 Repository 層 lint 錯誤
...
- 移除未使用的 imports
- 修正 import 排序
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 22:25:52 +08:00
OG T
663b80ab29
chore: 加強 .gitignore 防止 Claude Code 干擾 CI/CD
...
新增忽略:
- .claude/scheduled_tasks.lock
- .cursor/
- .agents/memory/
防止 worktrees 等 AI 工具暫存檔干擾 checkout
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 22:17:08 +08:00
OG T
27f20f4155
fix(git): 移除誤提交的 .claude/worktrees 目錄
...
此目錄已在 .gitignore 中,但之前被意外 commit
這導致 CI/CD 的 checkout 步驟出現 submodule 錯誤
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 22:12:54 +08:00
OG T
716b94f60a
feat(api): Phase 16 R4.2 抽取 ApprovalExecutionService
...
Strangler Fig Pattern: 從 approvals.py 抽取執行編排邏輯
新增:
- src/services/approval_execution.py (271 行)
- ApprovalExecutionService class
- 整合 OperationParser + Executor + Timeline + Notifications
瘦身成果:
- approvals.py: 1097 → 787 行 (-310 行)
- R4 總計: 移除 310 行內嵌業務邏輯
CI/CD 修復:
- 移除危險的 rm -f ~/actions-runner-* 指令
- 改用 checkout clean: true + workspace 內清理
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 22:04:15 +08:00
OG T
aefd351e20
docs: 更新 LOGBOOK Phase 16 R4.1 進度
...
- OperationParser 抽取完成
- approvals.py 瘦身 1097 → 988 行
- Runner diag log 衝突問題記錄
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 21:57:38 +08:00
OG T
31cf2ddbe7
feat(api): Phase 16 R4.1 抽取 OperationParser 模組
...
Strangler Fig Pattern: 從 approvals.py 抽取操作解析邏輯
新增:
- src/services/operation_parser.py
- ParsedOperation dataclass
- 支援中英文指令解析 (kubectl/自然語言)
瘦身 approvals.py: 移除 117 行內嵌邏輯
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 21:52:27 +08:00
OG T
39eca4535b
fix(ci): 清理 Runner diag logs 避免 "file already exists" 衝突
...
Pre-flight Check 加入清理步驟:
- rm -f ~/actions-runner-awoooi/_diag/pages/*.log
- rm -f ~/actions-runner-awoooi-2/_diag/pages/*.log
同時修復 CI 和 CD workflow
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 21:49:17 +08:00
OG T
0e22680547
fix(cd): 清理 worktree 目錄避免 submodule 衝突
...
Deploy job 增加 rm -rf .claude/worktrees 清理步驟
解決 "no submodule mapping found" 錯誤
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 21:46:51 +08:00
OG T
f6a28d235c
feat(api): Phase 16 R3.4 ApprovalDBService DI 重構
...
變更:
- ApprovalDBService 新增 __init__(repository) 建構子
- get_approval() 支援 Repository 注入
- get_pending_approvals() 支援 Repository 注入
- get_approval_service(use_repository=True) 啟用 DI
絞殺者模式:
- use_repository=False (預設): 內嵌 DB 操作
- use_repository=True: 使用 ApprovalDBRepository
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 21:26:23 +08:00
OG T
bfda353270
fix(ci): 清理 .claude/worktrees 防止 submodule 錯誤
...
問題: Runner 上的 .claude/worktrees 被誤認為 submodule
解決: 在 checkout 前清理 worktrees 目錄
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 21:24:08 +08:00
OG T
75ef8fee0c
feat(api): Phase 16 R3.3 Repository 實作 + CI 修復
...
新增:
- ApprovalDBRepository: Approval CRUD 操作
- IncidentDBRepository: Incident CRUD 操作
- get_approval_repository/get_incident_repository 函數
修復:
- .gitignore 新增 .claude/worktrees/ (防止 CI 失敗)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 21:22:02 +08:00
OG T
fe76d0b108
feat(api): Phase 16 R3.1-R3.2 Repository 介面定義
...
新增:
- IApprovalRepository Protocol
- IIncidentRepository Protocol
- ITimelineRepository Protocol
設計: DI 友好的 Protocol 介面,Service 層只依賴抽象
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 21:17:51 +08:00
OG T
14dc77e4ad
chore(api): Phase 16 R2 封存舊版代碼
...
封存:
- incident_memory_v1.py (483 行) - 絞殺者模式前版本
- incident_engine_v1.py (657 行) - 絞殺者模式前版本
策略: 90 天後無問題才刪除 (2026-06-24)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 16:08:49 +08:00
OG T
ef12228cc7
docs: Phase 16 絞殺者模式啟用 - 48hr 驗證開始
...
- USE_NEW_ENGINE=true 已在 Production 啟用
- 驗證期: 2026-03-26 16:04 → 2026-03-27 16:04
- 所有組件 healthy
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 16:06:32 +08:00
OG T
708ea4686e
fix(cd): 修復 Build 跳過時的 ImagePullBackOff 問題
...
問題: 當 Build Web/API 被跳過時,Deploy 仍更新 image tag 到不存在的版本
解決: 根據 build job 結果條件性更新 image
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 16:02:44 +08:00
OG T
485cce8c01
docs: 更新 Phase 16 R1.2-R1.3 完成狀態
...
- LOGBOOK: 記錄絞殺者模式完成 + 架構圖
- 下一步: 部署驗證 → USE_NEW_ENGINE=true → 48hr 監控
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 15:55:48 +08:00
OG T
2637263093
feat(api): Phase 16 R1.3 IncidentEngine 絞殺者模式
...
新增:
- IncidentMemoryAdapter: 實作 IIncidentMemory Protocol
- BlastRadiusAdapter: 實作 IBlastRadiusAnalyzer Protocol
- get_incident_engine() 雙軌切換 (USE_NEW_ENGINE)
絞殺者模式設計:
- 預設 USE_NEW_ENGINE=false (使用內嵌版)
- 設為 true 時使用 lewooogo-brain IncidentEngine
- 回滾: kubectl set env deployment/awoooi-api USE_NEW_ENGINE=false
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 15:47:52 +08:00
OG T
21ecedded2
fix(api): 修復 incident_memory import 排序 (I001)
...
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 15:41:42 +08:00
OG T
b097567819
chore: Runner 穩定性 + 封存目錄結構
...
Runner 穩定性:
- 新增 setup-runner-watchdog.sh (5分鐘 Watchdog)
- 新增 setup-runner-2.sh (第二個 Runner 安裝)
封存策略:
- 建立 _archived/ 目錄結構
- 新增 ARCHIVE_LOG.md 封存紀錄模板
統帥裁示: 不要只是臨時解決,要徹底解決!
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 15:38:29 +08:00
OG T
20984fd354
feat(api): Phase 16 R1.2 完善 PostgreSQL 整合 + 封存策略
...
lewooogo-brain:
- 新增 IIncidentDbAdapter Protocol (DI 模式)
- load_incident 支援 Episodic Memory 回填
- persist_incident 透過 db_adapter 執行
apps/api:
- 新增 IncidentDbAdapter 實現 (SQLAlchemy 操作封裝)
- 絞殺者模式完整整合 lewooogo-brain + PostgreSQL
Skill 06 v1.4:
- 新增「封存而非刪除」策略 (統帥裁示)
- 封存目錄結構 + ARCHIVE_LOG.md 格式
- 90 天保留期 + 48hr 驗證期
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 15:31:03 +08:00
OG T
a202a2693a
feat(api): Phase 16 R1.2 絞殺者模式 (Strangler Fig Pattern)
...
- 新增 USE_NEW_ENGINE 設定開關 (預設 False)
- incident_memory.py 雙軌切換: 內嵌版本 ↔ lewooogo-brain
- 自動降級: lewooogo-brain 不可用時回退內嵌版本
- 回滾指令: kubectl set env deployment/awoooi-api USE_NEW_ENGINE=false
統帥批准 2026-03-26 立即執行
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 15:23:03 +08:00
OG T
cdbd6f0fa6
fix(api): 修復 MCP providers lint 錯誤
...
- interfaces.py: 修正 import 排序
- signoz_provider.py: 移除未使用變數
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 14:44:03 +08:00
OG T
643946e60c
refactor(api): ADR-015 MCP 模組化架構重構
...
## 重構內容
符合 leWOOOgo 積木化原則:
- 新增 interfaces.py: MCPToolProvider ABC 定義
- 新增 registry.py: Provider 註冊中心 (DI 模式)
- 新增 providers/: K8s, SignOz, Database 具體實作
- 重構 mcp_bridge.py: 透過 ProviderRegistry 委派執行
## 修復 Code Review 問題
- 🔴 移除 _execute_stdio logging 敏感 parameters
- 🔴 修復 conversational-view.tsx i18n 硬編碼
## 新增檔案
- apps/api/src/plugins/mcp/interfaces.py
- apps/api/src/plugins/mcp/registry.py
- apps/api/src/plugins/mcp/providers/__init__.py
- apps/api/src/plugins/mcp/providers/k8s_provider.py
- apps/api/src/plugins/mcp/providers/signoz_provider.py
- apps/api/src/plugins/mcp/providers/database_provider.py
- docs/adr/ADR-015-mcp-modular-architecture.md
- .dependency-cruiser.cjs (Phase 14.2 準備)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 14:31:32 +08:00
OG T
c0ad8f8686
fix(api): 方案 C - Incident 解析相容舊格式 Enum
...
問題: Redis 存有舊 Enum 值 (status='open', severity='critical')
導致 Pydantic 驗證失敗
解法:
- normalize_status(): 'open' → 'investigating'
- normalize_severity(): 'critical' → 'P0' 等
- 應用於 get_from_working_memory, get_active_incidents, _record_to_incident
優點:
- 零資料風險 (不動 Redis)
- 回滾 = git revert (秒級)
- 新舊格式都能讀
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 14:14:58 +08:00
OG T
805d353892
feat(web): Phase 11.3 響應式設計 - Mobile/Tablet 支援
...
ConversationalView 響應式改造:
- Mobile: 全屏詳情面板 + 返回按鈕
- Tablet: 64px 側邊欄 (w-64)
- Desktop: 80px 側邊欄 (w-80)
- i18n: backToList 翻譯
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 13:46:15 +08:00
OG T
7a8f869104
feat(api): Phase 13.2 #81 PostgreSQL MCP Tool 整合
...
整合 Approval/Incident/Timeline 查詢到 MCP Bridge:
- list_approvals: 列出授權請求 (可依狀態篩選)
- get_approval: 取得單一授權詳情
- list_incidents: 列出 Incident (可依狀態篩選)
- list_timeline: 列出最近時間軸事件
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 12:46:52 +08:00
OG T
23b753dbec
feat(api): Phase 13.2 #79 SignOz MCP Tool 整合
...
整合真實 SignOzClient 到 MCP Bridge:
- gold_metrics: RPS + Error Rate + P99 Latency
- trace_url: 動態 Trace URL 生成
- system_metrics: CPU/Disk 系統指標
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 12:44:46 +08:00
OG T
e36dab1aee
fix(ci): add Python and uv setup to Ollama test job
...
The self-hosted runner doesn't have uv pre-installed.
Add setup-python and setup-uv steps before running pytest.
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 12:30:58 +08:00
OG T
d31160f4e1
feat(api): Phase 13.2 #80 Kubernetes MCP Tool real implementation
...
- Integrate real ActionExecutor instead of mock responses
- kubectl_get: Execute real kubectl get with JSON output
- kubectl_delete: Dry-run validation + actual pod deletion
- kubectl_scale: Real kubectl scale command
- kubectl_restart: Deployment rollout restart with validation
- Database query placeholder for #81
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 12:30:13 +08:00
OG T
b8f9cd315c
fix(ci): replace jq with python3 for JSON parsing in Ollama test
...
The self-hosted runner doesn't have jq installed.
Use Python's json module as a portable alternative.
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 12:07:23 +08:00
OG T
e87ac11f4f
feat(web): Phase 11 UX improvements for approval card
...
- Change dashed border buttons to solid filled style for better clickability
- Add signature progress bar with visual indicator
- Add signed users list showing who has already signed
- Convert Blast Radius section to collapsible panel (auto-open for CRITICAL)
- Convert Dry-Run Checks to collapsible panel with pass/fail summary badge
- Add slide-in animations for expanded content
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com >
2026-03-25 11:45:04 +08:00