← Back to Updates

Money-Maker-Ready — 5-Agent Plan Comparison

RESEARCH SURFACE — NOT FINANCIAL ADVICE
Internal multi-agent planning output. No live-capital deployment from this page. Real-money sizing requires the gates listed below.
Generated 2026-05-11 · Sources: Claude Code (this session), Cursor real_money_edge_plan_ed80c0d8.plan.md, GitHub Copilot, Kimi IDE, ChatGPT Codex · Skill: .claude/skills/money-maker-ready/SKILL.md v1.0 (shipped 2026-05-11)

The 5 plans

PlanAuthor / ModelPage
Claude Code (Opus 4.7)This session — Plan Mode outputview
CursorCursor Plan Modeview
GitHub CopilotCopilot Chat (VS Code)view
Kimi IDEMoonshot Kimi (IDE agent)view
ChatGPT CodexOpenAI Codexview

Headline verdicts at a glance

PlanHeadline verdictStrongest classRollout aggressivenessReal-money gate
Claude Code NOT READY — 3 data-layer CRITICALs (n=0, walk-forward folds=0, dashboard 40.7h stale) block every downstream verdict Unclear — agent's per-class baseline read returned n=0 across all classes Block everything until §1/§2 fixed Skill freshness PASS + populated payload + ≥1 fabrication-flag
Cursor Fast-track EQUITY/COMMODITY/ETF; contain FOREX/CRYPTO/BOND COMMODITY (PF 3.92), EQUITY (PF 1.60), ETF (PF 1.48, near-T2) Moderate — 72h triage then phased scale ≥2 classes sustain Tier-2 on rolling window + drift cleared
GitHub Copilot Refresh-and-lock first; ETF + COMMODITY rollout first, CRYPTO curated sleeve parallel ETF (cleanest OOS), COMMODITY (PF edge) Conservative pilot — 2 consecutive weekly Tier-2 passes 2 consecutive weekly Tier-2 snapshots per class
Kimi IDE Start real-money pilot in EQUITY+COMMODITY NOW at 1% risk; FOREX hard-stop; quarantine 3 toxic systems COMMODITY (PF 2.08), EQUITY (PF 1.42) Aggressive — 1% risk pilot this week 7-check Go/No-Go gate (currently 0/7); items 1-4 are 1-hour fixes
ChatGPT Codex Block all classes until all six clean (per user's stated "all classes first" preference); fix truth layer before edge work EQUITY (strongest broad candidate) Most conservative — no class until ALL six SHADOW-ready Class state-machine: BLOCKED→REHAB→OOS_READY→SHADOW→LIVE_ELIGIBLE; 14-30d shadow per class

Per-class baseline reads — RERUN-TO-RERUN DISAGREEMENT

All 5 plans claim to source dashboard_data.json::performance.asset_class_health. The numbers diverge enough that at least one plan is mis-reading the JSON:

ClassClaude CodeCursorKimiCombined plan (peer 2026-05-11 20:00Z)
COMMODITYn=0 / WR 67.2 / PF 3.97n=408 / WR 67.4 / PF 3.92n=816 / WR 48.7 / PF 2.08n=408 / WR 67.4 / PF 3.92
EQUITYn=0 / WR 53.7 / PF 1.58n=443 / WR 54.0 / PF 1.60n=428 / WR 52.8 / PF 1.42n=443 / WR 54.0 / PF 1.60
CRYPTOn=0 / WR 48.0 / PF 1.40n=7875 / WR 47.4 / PF 1.39n=8166 / WR 44.8 / PF 1.26n=7875 / WR 47.4 / PF 1.39
FOREXn=0 / WR 41.7 / PF 0.27n=1825 / WR 41.8 / PF 0.28n=1249 / WR 45.6 / PF 0.28n=1825 / WR 41.8 / PF 0.28
ETFn=100 / WR 60.0 / PF 1.48n=88 / WR 53.4 / PF 1.20n=100 / WR 60.0 / PF 1.48
BONDn=0 / WR 54.5 / PF 0.66n=11 / WR 54.5 / PF 0.66n=18 / WR 55.6 / PF 1.72n=11 / WR 54.5 / PF 0.66

Diagnosis

Correction

The §1 baseline in the Claude Code plan should be replaced with the Cursor / combined-plan numbers. The downstream P0 "fix asset_class_health.n=0 bug" action is not real — the bug is in the agent's read, not the data. Replace P0 #1 + P0 #2 with: verify Cursor / Kimi divergence by running both reads side-by-side; if Cursor is canonical, drop the n=0 P0s.

Convergence (where all 5 plans agree)

  1. FOREX is broken. PF 0.27-0.28 across all reads. Every plan either kills, hard-caps, or rehabs-before-kill. No plan proposes scaling FOREX.
  2. Drift alert is a blocker. All 5 cite hf_stats.concept_drift.drift_alert=true as a hard gate. Codex makes it the #1 blocker ("fix truth layer first").
  3. baby_strats family is overfitted. 12 divergence rows. Claude Code, Cursor, Copilot, Kimi all flag for surgical quarantine. Kimi names the variants (crypto_soc_*).
  4. kimi_signal_tracking is a dragger. -954% PnL / PF 0.26. Named in Claude Code, Kimi, Codex. Cursor + Copilot wrap into broader "dragger quarantine" without name.
  5. EQUITY + COMMODITY are the two closest-to-ready classes. 4/5 plans say so (Codex defers but acknowledges EQUITY strongest qualitatively).
  6. ETF needs sample expansion. 4/5 plans want n≥100→150-200 before promotion.
  7. BOND is too thin. All 5 plans keep BOND paper-only.
  8. Walk-forward coverage is missing for COMMODITY (and BOND). All plans that touched walk-forward note this gap.
  9. No live capital today. Even Kimi's "aggressive" plan gates the pilot on 4 fixes-of-the-day clearing first.

Divergence (where the plans split)

Axis 1 — Rollout aggressiveness

Most aggressiveMost conservative
Kimi (1% risk pilot THIS WEEK on EQUITY+COMMODITY)Cursor (Phase 1 triage 72h, then phased scale) ≈ Copilot (2-week conservative gate)Codex (block ALL classes until all six SHADOW-ready)

Codex notes the user "chose all classes first" — that puts the conservative end as the user's stated preference.

Axis 2 — What to fix first

Axis 3 — Real-money gate design

Axis 4 — Net-new infrastructure proposed

Recommended convergence plan (synthesis)

Adopt user-stated preference (Codex's "all classes first") as the rollout posture. Mix in Kimi's 1-hour fixes for immediate P0 action and Codex's payload-contract extension for the structural fix. Use Cursor's measurable success criteria as the gate definitions.

P0 (next 24h — fastest fixes with biggest impact)

  1. Blacklist kimi_signal_tracking via alpha_engine/config.py:216 BLACKLISTED_STRATEGIES (Kimi). Memory feedback_gate_at_execution_not_generation: verify enforcement at exec gate, not just intake.
  2. Surgically quarantine baby_strats:crypto_soc_* family via per-strategy BLOCKED_ASSET_STRATEGY_PAIRS at audit_trail/quality_gates.py:1499 (Claude Code + Kimi + Cursor; existing proposal at reports/baby_strats_overfit_quarantine_proposal_2026_05_10.md).
  3. Hard-cap FOREX sizing at 0 until PF ≥ 0.8 — explicit per-class gate, not silent kill (Kimi + Cursor; respect mutate-before-kill protocol from docs/MUTATION_THREE_AXIS_PROTOCOL.md).
  4. Verify max-drawdown calculation uses capped PnL (Kimi flagged 680% MDD anomaly).
  5. Verify multi_asset_cot PF=19.19 via DB query against ejaguiar1_stocks (Claude Code) — data integrity smoke test.
  6. Resolve claude_gainer_st winner-vs-blacklist contradiction (Claude Code; system shows PF 6.12 / n=3472 in systems yet appears in BLACKLISTED_STRATEGIES at alpha_engine/config.py:216).
  7. Add walk-forward coverage for COMMODITY + BOND in alpha_engine/walkforward_validator.py (Cursor); surface in audit_trail/dashboard_generator.py.

P1 (week 1 — structural)

  1. Implement Codex's readiness.by_class payload block (class state-machine fields: stage, blockers, n_cumulative, oos_sharpe, oos_consistency, system_concentration, symbol_concentration, data_trust_ok).
  2. Drift detector — fix hf_stats.concept_drift.KS_D uncomputed-zero bug + refresh 19-day stale hf_stats. Wire drift→auto-pause sizing when D > 0.10.
  3. Reconcile /audit threshold text with docs/PERFORMANCE_CHARTER.md v1.0 (Codex).
  4. Add last_signal_date to systems payload (Claude Code) — currently absent for all top-6 winners.

P2 (week 2-4 — class rehabs in parallel)

Real-money gate (consensus)

Adopt the strictest of the 5: Codex's all-classes-first state machine. No class receives live capital until ALL six major classes (CRYPTO/EQUITY/ETF/FOREX/COMMODITY/BOND) reach SHADOW state for 14 consecutive days, AND DB-health is green on all 6 sub-checks, AND drift_alert is false. This honors the user's stated "all classes first" preference and avoids per-class promotion races that the other 4 plans implicitly invite.

Plan-level lessons learned

Files cited across all 5 plans

Next step

User to pick a posture:

Recommendation: Option C as the strategic posture (matches stated preference + Codex framework) + Option A's P0 cluster as the immediate tactical actions (1-hour fixes are cheap insurance regardless of strategy).