| Plan | Author / Model | Page |
|---|---|---|
| This session — Plan Mode output | view | |
| Cursor Plan Mode | view | |
| Copilot Chat (VS Code) | view | |
| Moonshot Kimi (IDE agent) | view | |
| OpenAI Codex | view |
| Plan | Headline verdict | Strongest class | Rollout aggressiveness | Real-money gate |
|---|---|---|---|---|
| NOT READY — 3 data-layer CRITICALs (n=0, walk-forward folds=0, dashboard 40.7h stale) block every downstream verdict | Unclear — agent's per-class baseline read returned n=0 across all classes | Block everything until §1/§2 fixed | Skill freshness PASS + populated payload + ≥1 fabrication-flag | |
| Fast-track EQUITY/COMMODITY/ETF; contain FOREX/CRYPTO/BOND | COMMODITY (PF 3.92), EQUITY (PF 1.60), ETF (PF 1.48, near-T2) | Moderate — 72h triage then phased scale | ≥2 classes sustain Tier-2 on rolling window + drift cleared | |
| Refresh-and-lock first; ETF + COMMODITY rollout first, CRYPTO curated sleeve parallel | ETF (cleanest OOS), COMMODITY (PF edge) | Conservative pilot — 2 consecutive weekly Tier-2 passes | 2 consecutive weekly Tier-2 snapshots per class | |
| Start real-money pilot in EQUITY+COMMODITY NOW at 1% risk; FOREX hard-stop; quarantine 3 toxic systems | COMMODITY (PF 2.08), EQUITY (PF 1.42) | Aggressive — 1% risk pilot this week | 7-check Go/No-Go gate (currently 0/7); items 1-4 are 1-hour fixes | |
| Block all classes until all six clean (per user's stated "all classes first" preference); fix truth layer before edge work | EQUITY (strongest broad candidate) | Most conservative — no class until ALL six SHADOW-ready | Class state-machine: BLOCKED→REHAB→OOS_READY→SHADOW→LIVE_ELIGIBLE; 14-30d shadow per class |
All 5 plans claim to source dashboard_data.json::performance.asset_class_health. The numbers diverge enough that at least one plan is mis-reading the JSON:
| Class | Claude Code | Cursor | Kimi | Combined plan (peer 2026-05-11 20:00Z) |
|---|---|---|---|---|
| COMMODITY | n=0 / WR 67.2 / PF 3.97 | n=408 / WR 67.4 / PF 3.92 | n=816 / WR 48.7 / PF 2.08 | n=408 / WR 67.4 / PF 3.92 |
| EQUITY | n=0 / WR 53.7 / PF 1.58 | n=443 / WR 54.0 / PF 1.60 | n=428 / WR 52.8 / PF 1.42 | n=443 / WR 54.0 / PF 1.60 |
| CRYPTO | n=0 / WR 48.0 / PF 1.40 | n=7875 / WR 47.4 / PF 1.39 | n=8166 / WR 44.8 / PF 1.26 | n=7875 / WR 47.4 / PF 1.39 |
| FOREX | n=0 / WR 41.7 / PF 0.27 | n=1825 / WR 41.8 / PF 0.28 | n=1249 / WR 45.6 / PF 0.28 | n=1825 / WR 41.8 / PF 0.28 |
| ETF | — | n=100 / WR 60.0 / PF 1.48 | n=88 / WR 53.4 / PF 1.20 | n=100 / WR 60.0 / PF 1.48 |
| BOND | n=0 / WR 54.5 / PF 0.66 | n=11 / WR 54.5 / PF 0.66 | n=18 / WR 55.6 / PF 1.72 | n=11 / WR 54.5 / PF 0.66 |
by_asset_class.raw instead of performance.asset_class_health; (b) Kimi read at a different moment between two refresh cycles. Note Kimi's CRYPTO n=8166 is higher than Cursor's 7875, consistent with a later read.asset_class_health[CLASS].closed_picks=0 from the wrong sub-block (possibly walkforward.by_class[CLASS].n), then misreported it as the verdict-grade n. This invalidates Claude Code's §1 finding ("STRUCTURAL BUG: all n=0") but does NOT invalidate the rest of its plan — the dragger list, baby_strats overfit flag, drift state, and BLOCKED list state were all confirmed by the second Explore agent against the source-of-truth files.The §1 baseline in the Claude Code plan should be replaced with the Cursor / combined-plan numbers. The downstream P0 "fix asset_class_health.n=0 bug" action is not real — the bug is in the agent's read, not the data. Replace P0 #1 + P0 #2 with: verify Cursor / Kimi divergence by running both reads side-by-side; if Cursor is canonical, drop the n=0 P0s.
hf_stats.concept_drift.drift_alert=true as a hard gate. Codex makes it the #1 blocker ("fix truth layer first").crypto_soc_*).| Most aggressive | → | Most conservative |
|---|---|---|
| Kimi (1% risk pilot THIS WEEK on EQUITY+COMMODITY) | Cursor (Phase 1 triage 72h, then phased scale) ≈ Copilot (2-week conservative gate) | Codex (block ALL classes until all six SHADOW-ready) |
Codex notes the user "chose all classes first" — that puts the conservative end as the user's stated preference.
alpha_engine/walkforward_validator.py; capital-gate scaffoldkimi_signal_tracking, crypto_soc_*, FOREX) — 1-hour fixesreadiness.by_class, leaders.by_class, draggers.by_class, capped_vs_raw_pnl_gap, single_symbol_concentration). Biggest payload-schema change./audit DB-lineage telemetry card; backtests-vs-live consistency check.Adopt user-stated preference (Codex's "all classes first") as the rollout posture. Mix in Kimi's 1-hour fixes for immediate P0 action and Codex's payload-contract extension for the structural fix. Use Cursor's measurable success criteria as the gate definitions.
kimi_signal_tracking via alpha_engine/config.py:216 BLACKLISTED_STRATEGIES (Kimi). Memory feedback_gate_at_execution_not_generation: verify enforcement at exec gate, not just intake.baby_strats:crypto_soc_* family via per-strategy BLOCKED_ASSET_STRATEGY_PAIRS at audit_trail/quality_gates.py:1499 (Claude Code + Kimi + Cursor; existing proposal at reports/baby_strats_overfit_quarantine_proposal_2026_05_10.md).docs/MUTATION_THREE_AXIS_PROTOCOL.md).multi_asset_cot PF=19.19 via DB query against ejaguiar1_stocks (Claude Code) — data integrity smoke test.claude_gainer_st winner-vs-blacklist contradiction (Claude Code; system shows PF 6.12 / n=3472 in systems yet appears in BLACKLISTED_STRATEGIES at alpha_engine/config.py:216).alpha_engine/walkforward_validator.py (Cursor); surface in audit_trail/dashboard_generator.py.readiness.by_class payload block (class state-machine fields: stage, blockers, n_cumulative, oos_sharpe, oos_consistency, system_concentration, symbol_concentration, data_trust_ok).hf_stats.concept_drift.KS_D uncomputed-zero bug + refresh 19-day stale hf_stats. Wire drift→auto-pause sizing when D > 0.10./audit threshold text with docs/PERFORMANCE_CHARTER.md v1.0 (Codex).last_signal_date to systems payload (Claude Code) — currently absent for all top-6 winners.Adopt the strictest of the 5: Codex's all-classes-first state machine. No class receives live capital until ALL six major classes (CRYPTO/EQUITY/ETF/FOREX/COMMODITY/BOND) reach SHADOW state for 14 consecutive days, AND DB-health is green on all 6 sub-checks, AND drift_alert is false. This honors the user's stated "all classes first" preference and avoids per-class promotion races that the other 4 plans implicitly invite.
money-maker-ready skill works. All 5 plans cite the same canonical inputs (dashboard_data.json::performance.asset_class_health, walkforward.by_class, fwd_vs_bt_divergence.rows, hf_stats.concept_drift) and reach overlapping verdicts. The skill is succeeding at its stated job: making `/audit` real-money-readiness analyzable.audit_dashboard/data/dashboard_data.json (canonical payload — all 5)audit_trail/dashboard_generator.py (writer — Cursor, Copilot, Claude Code)audit_trail/quality_gates.py:1499 BLOCKED_ASSET_STRATEGY_PAIRS (Claude Code, Cursor, Kimi)alpha_engine/config.py:216 BLACKLISTED_STRATEGIES (Claude Code, Kimi)alpha_engine/outcome_resolver.py (Copilot, Codex)alpha_engine/real_money_tracker.py (Codex — proposes retire)alpha_engine/walkforward_validator.py (Cursor)audit_dashboard/template.html (Claude Code, Codex)audit_trail/mysql_client.py (Cursor, Copilot).github/workflows/audit-dashboard.yml (Cursor, Copilot)docs/PERFORMANCE_CHARTER.md (all 5)docs/STRATEGY_INVESTIGATION_BEFORE_KILL.md (Claude Code, Cursor)docs/MUTATION_THREE_AXIS_PROTOCOL.md (Claude Code, Cursor).claude/skills/money-maker-ready/SKILL.md (all 5)reports/baby_strats_overfit_quarantine_proposal_2026_05_10.md (Claude Code, referenced by 3 others implicitly)User to pick a posture:
money-maker-ready skill weekly; gate promotion on ≥1 fabrication-flag-clean + populated payload + all 11 sections green.Recommendation: Option C as the strategic posture (matches stated preference + Codex framework) + Option A's P0 cluster as the immediate tactical actions (1-hour fixes are cheap insurance regardless of strategy).