整合三大問題的完整解決方案: 1. Prometheus 規則未部署 (13條→40+條,含SentryDown/AlertChain) 2. 日誌收集但無log-based alerting 3. 自動修復只限K8s層,無Host Docker/systemd修復能力 包含: - 統一標籤規範 (layer/component/team/host) - Sprint 1: 規則部署+Sentry啟動+CD同步 - Sprint 2: SigNoz log alert + Sentry整合 - Sprint 3: SSH HostRepairAgent + Playbooks - SOP v4.0整合更新點 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>