From 6737a3d48bacff2ba37e8b818d7d458aaff643a0 Mon Sep 17 00:00:00 2001 From: Your Name Date: Sun, 31 May 2026 19:20:51 +0800 Subject: [PATCH] docs(logbook): record web health probe rollout [skip ci] --- docs/LOGBOOK.md | 37 +++++++++++++++++++++++++++++++++++-- 1 file changed, 35 insertions(+), 2 deletions(-) diff --git a/docs/LOGBOOK.md b/docs/LOGBOOK.md index e4f88a2e..d2ad341e 100644 --- a/docs/LOGBOOK.md +++ b/docs/LOGBOOK.md @@ -31,9 +31,42 @@ local production server: GET http://127.0.0.1:3107/ -> 307 /zh-TW ``` -**目前整體進度(pre-deploy)**: +**Gitea / Production deploy**: -- Web rollout / Smoke 穩定性:約 82%;已完成 probe surface 修正,待 Gitea deploy 後驗證 restartCount 是否不再增加。 +```text +56c8a41e fix(web): add cheap health probe endpoint + +CI/CD: + build-and-deploy=success + post-deploy=success + summary=API=✅; Web=✅; AlertChain=✅; SourceLink=✅; Monitoring=✅; Smoke=✅ + +k8s awoooi-prod: + awoooi-web image=192.168.0.110:5000/awoooi/web:56c8a41e5b20c2e785c61c3d57b9496a598524a7 + liveness=/api/health + readiness=/api/health + startup=/api/health + pod awoooi-web-6d4f748d5d-jr4ss restarts=0 ready=true + pod awoooi-web-6d4f748d5d-w8gjh restarts=0 ready=true + rollout status deploy/awoooi-web -> successfully rolled out + +pod-internal probe: + awoooi-web-6d4f748d5d-jr4ss GET 127.0.0.1:3000/api/health -> 200 {"status":"ok","service":"awoooi-web"} + awoooi-web-6d4f748d5d-w8gjh GET 127.0.0.1:3000/api/health -> 200 {"status":"ok","service":"awoooi-web"} + +public smoke: + https://awoooi.wooo.work/zh-TW/awooop/work-items?project_id=awoooi&incident_id=INC-20260530-0DD83C -> 200 +``` + +**Deployment caveat**: + +- GitOps 先把 probe path 套到舊 `59b4943b` image 時,過渡 pod 因舊 image 尚無 `/api/health` 而回 404,短暫重啟 2 次;舊 ready pods 保持服務可用。 +- `56c8a41e` image 上線後,新 replica set 兩個 pod 均 `restarts=0`,Smoke 從上一輪 `⚠️` 回到 `✅`。 +- Public `https://awoooi.wooo.work/api/health` 仍會走 FastAPI `/api/*` 路由而回 404;這不是 web probe 端點。K8s probes 打的是 pod 內 `:3000/api/health`。 + +**目前整體進度(post-deploy)**: + +- Web rollout / Smoke 穩定性:約 96%;已完成 probe surface 修正,production Smoke=✅,新 web pods restartCount=0。 - 前端 AI 自動化管理介面同步:約 89%;Work Items 已 production 驗證,probe 修正避免下一輪 UI rollout 造成短暫 502。 - 整體 AI 自動化飛輪:約 74%;本輪是部署穩定性收斂,不改 24h auto-repair claim。 - 24h 完整 AI Agent 自動修復 production claim:0%;仍不可宣稱全自動修復閉環。