OpenClaw Development Digest: Enhancing Reliability, Diagnostics, and Prompt Efficiency
Merged PRs
- fix(ios): recover rotated gateway certificates Original PR
- fix(gateway): avoid sync restart sentinel startup probes Original PR
- fix(heartbeat): prevent ack-only pending delivery loops Original PR
- fix(slack): include API error details in log messages Original PR
- [AI-assisted] fix(cron): surface nested lane timeout result Original PR
- [AI-assisted] fix(slack): enable native socket reconnect Original PR
- fix(feishu): fall back from missing thread replies Original PR
- fix(tasks): route group acp completions through parent Original PR
- fix(context): persist Codex run context maps Original PR
- [Fix] Wait for Pi abort settle before cleanup Original PR
- fix gateway message action media roots Original PR
- [AI-assisted] fix(agents): remove sessions_yield spawn guidance Original PR
- fix Telegram stale reply context Original PR
- fix(codex): normalize thread id/sessionId cross-fill before schema validation Original PR
- [codex] Persist external CLI OAuth refreshes Original PR
- fix(imessage): WARN-log when private API bridge is unavailable Original PR
- fix(doctor): don't flag the live compatibility agent dir as orphan Original PR
- fix(status): surface model-pricing health degradation Original PR
- [codex] Expose active model plugin context Original PR
- refactor: trim OpenClaw prompt guidance Original PR
- fix(cli): surface durable delivery status Original PR
- fix(codex): mark native tools active for diagnostics Original PR
- Add /context map treemap Original PR
- fix(auto-reply): suppress stale foreground replies Original PR
- fix(telegram): preserve spacing before numbered sections Original PR
Key Changes
Core Infrastructure & Reliability
Several PRs targeted critical stability issues and event-loop performance. A notable fix addressed a regression where update-sentinel caused synchronous filesystem probes during gateway startup, leading to significant event-loop delays (up to 9 seconds) on certain environments. Additionally, the heartbeat system was patched to prevent "death loops" where ack-only pending deliveries would perpetually block future heartbeats.
In the agent orchestration layer, a fix was introduced to ensure that embedded Pi aborts wait for a bounded settle barrier before releasing session write locks, preventing race conditions where subsequent turns could write to a transcript while a previous abort was still unwinding.
Codex & Model Integration
Codex integration received significant updates to improve protocol compatibility and observability. A critical fix was implemented to normalize thread.id and thread.sessionId before schema validation, resolving an issue where agents failed to boot due to missing required properties in the app-server response. To improve diagnostics, Codex-native tools (like bash) now emit trusted diagnostic events, preventing the watchdog from incorrectly aborting long-running native tools as stale embedded runs.
Prompt Optimization & UX
In a major refactor, the canonical agent prompt guidance was trimmed across skills, tooling, and safety sections. This resulted in a token reduction of approximately 10-12% for common Codex happy-path prompts. For example, Telegram direct prompts saw a reduction from 19,414 to 17,422 tokens.
User experience improvements include:
- Telegram Context Maps: The addition of
/context mapprovides a WinDirStat-style treemap image of session context contributors. - Stale Reply Suppression: A freshness guard now suppresses older foreground auto-reply finals if a newer inbound message has already started for the same session target, preventing out-of-order responses in channels like WhatsApp.
- Feishu Fallback: Feishu group replies now fall back to top-level sends if a thread reply targets a withdrawn or missing message.
Diagnostics & Tooling
Diagnostic visibility was expanded across several areas. The openclaw status and health commands now surface degraded model-pricing states, and Slack API errors now include structured details (scopes, error codes) rather than generic error messages. The openclaw doctor command was also updated to stop falsely flagging the legacy compatibility agent directory as an orphan.
Impact
These changes collectively reduce operational overhead and improve the robustness of agent-user interactions. The prompt trimming directly lowers token costs and reduces latency for every turn. The resolution of the heartbeat and sentinel probes removes silent failures and event-loop saturation that previously rendered the system unusable in specific network or OS environments.
From a user perspective, the suppression of stale replies and the improved Feishu fallback ensure a more coherent conversational flow, while the new context mapping tools provide deeper insight into how the model is utilizing session history. The Codex normalization and diagnostic updates ensure that high-capability agents can boot reliably and execute long-running tasks without premature termination.