OpenClaw Issue Digest: Critical Regressions in Codex Runtimes and Channel Stability
Open Issues
Recent activity in the OpenClaw repository reveals a series of critical regressions and stability challenges, particularly affecting the Codex app-server runtime and various messaging channel integrations. The most severe issues involve data loss during configuration migrations and silent message drops in production channels.
Codex Runtime & Context Stability
Several high-severity issues are impacting the Codex app-server. A critical bug (#84086) causes native Codex threads to churn repeatedly in Discord multi-agent channels, leading to a loss of accumulated cache state. This is compounded by a failure in the legacy mirrored-history fallback (#84084), which ignores the contextTokenBudget and caps high-window sessions at approximately 24k rendered characters, severely limiting the agent's memory during thread rebuilds.
Furthermore, a regression in the Codex app-server (#84110) is causing prompt rewrites during tool-call continuation turns. This busts the OpenAI prompt cache, dropping the cache ratio from ~93% to ~47% and increasing effective per-token costs by roughly 3.5x.
Channel & Delivery Regressions
Messaging channels are experiencing significant delivery failures. WhatsApp users are reporting that long or complex responses (over ~500 characters or containing markdown tables) are silently dropped in version 2026.5.18 (#84092). Similarly, Discord users are seeing a total failure of channel initialization after upgrading to the same version (#83972), leaving bots completely unresponsive.
Other channel-specific issues include:
- Feishu: Tools like
feishu_docare not being injected into agent tool lists during DM sessions (#84095), and interactive card content is failing to extract text when using post-format fallback content (#60380). - Telegram: A regression in version 4.5 causes DeepSeek preamble/reasoning text to leak into Telegram messages (#62121), and voice output remains inconsistent in normal reply flows (#61590).
- Mattermost: The plugin fails to detect file attachments because the WebSocket
postedevent fires before file linkage is finalized (#59576).
Configuration & Infrastructure Failures
Several "P1" severity issues are affecting the core gateway and CLI:
- Data Loss: The
openclaw doctor --fixcommand is corrupting Signal multi-account configs by inventing phantomaccounts.defaultblocks (#62763) and silently migrating intentionalopenai-codex/configs toopenai/, breaking PI+OAuth runtimes (#84038). - Auth Failures: The Codex harness is ignoring documented main-to-agent auth inheritance, treating populated auth profiles as unusable and bootstrapping from binaries instead (#84114).
- System Stability: Windows users are experiencing CLI crashes (stack overflow/heap OOM) due to the large ESM module graph in v2026.4.5 (#62055), and some users report a global 30-minute gateway stall on WSL2 (#61616).
Key Themes
1. The "Fragility of Migration"
There is a recurring theme of doctor --fix and version upgrades causing destructive changes. From Signal config corruption to the silent loss of cron jobs in v2026.4.2 (#60799), the automation intended to simplify maintenance is frequently introducing critical failures.
2. Context & Cache Degradation
Across the Codex and general agent runtimes, there is a struggle to maintain stable context. Whether it is the prompt-cache busting in Codex (#84110) or the redundant metadata injection in Telegram DMs causing hallucinations (#62077), the overhead of maintaining state is becoming a primary source of both cost and instability.
3. Silent Failures in Delivery
Many of the reported bugs involve "silent drops"—where the system logs no error, but the user receives no message. This is evident in the WhatsApp long-message drop (#84092) and the message tool's failure to normalize SendMessage arguments for Anthropic models (#84079), which leads to rejected calls without delivery.
Action Required
Immediate Attention Required (P1/High Severity)
- Fix
doctor --fixdestructive migrations: Immediate patches are needed for #62763 and #84038 to prevent further user data loss during config repairs. - Restore WhatsApp/Discord Delivery: Resolve the silent message drops in WhatsApp (#84092) and the initialization failure in Discord (#83972) to restore basic service for those channels.
- Address Codex Thread Churn: Investigate the dynamic-tool fingerprinting in #84086 to stop the repeated abandonment of native Codex threads.
Blocked or High-Impact Issues
- Codex Auth Inheritance: #84114 needs a fix to ensure secondary agents can correctly inherit auth profiles from the main agent, as currently documented.
- Windows CLI Stability: The V8 stack size issue (#62055) is blocking Windows users from using heavy CLI commands like
dashboard. - Cron Job Recovery: A loader fix is required for #60799 to prevent the silent clobbering of legacy
jobs.jsonfiles during upgrades.