Unified Audit Dashboard v99.0

Birds-eye view of ALL picks, portfolios, and system performance - loading...
✨ Jump to Active Picks 📖 Research 📊 Edge Stability 🏆 AI Leaderboard 🔍 Pick Funnel 🐝 Swarm Paper Picks
⚠ TRUTH LAYER Dashboard vs DB-reality gap · Per-class tiles below show post-resolver-v2 filtered performance. RAW at_raw_picks (Kimi + 2026-05-12 audit): 11.13% WR / PF 0.46. Refreshed 2026-05-29 (n=73,817 closed): WR 25.28% / PF 0.61 (outlier-capped) — uncapped PF 2.5 is a handful-of-rows outlier artifact, do NOT cite it. Either read: RAW is net-unprofitable.
No asset class currently passes industry-standard real-money thresholds (DSR>0.95, PBO<0.05, WFE>60%, live-Sharpe>0.5, n≥100). The earlier DSR=1.0 candidate (cot_positioning + CT=F) was FALSIFIED on 2026-05-13 after dedup audit found 6.33× over-emission (38 raw signals from only 6 unique CFTC release weeks); pilot status is now SHADOW_INSUFFICIENT_N with dsr=null. See paper_pilot.html for live pilot JSON and reports/cot_paper_pilot_overemission_falsified_20260513.md for the autopsy. real_money.html still gates the 10-step Lopez de Prado readiness; NFA — no real-money sizing without explicit greenlight + all gates clear.
✅ DATA QUALITY IMPROVEMENTS (2026-05-31): Ghost rows: 0 (was 22,947). Status-PnL mismatches: 0. Non-canonical statuses (WON/CLOSED_SL/CLOSED_TP/FLAT): all 0. Backup tables: removed. at_raw_picks UNKNOWN: 3 (was thousands). FOREX: PR #6 policy-gate shipped (all blocked except cta_cross_asset_tsmom SHORT). BOND: now emitting (78 raw picks). See updates/2026-05-31-pr1-data-integrity-repair.md.
⚠ COMMODITY filter-survival gap (2026-05-17): dashboard shows WR=85.5% (n=228) vs raw WR=60.2% (n=354). The −126 difference is picks excluded by _is_valid_resolved_pick() (missing / corrupt pnl_pct — concentrated in losing cta_replicator and combined_confidence trades). The 85.5% reflects only picks with validated pnl data, not the full cohort. Raw 60.2% is the conservative estimate. FOREX: WR=57.8% but PF=0.85 — wins avg 0.62% vs losses avg 1.00% (tight-TP / wide-SL asymmetry); total PnL=−15.84% confirms it is net-negative despite majority wins.
📏 ML CALIBRATION INVERTED High confidence != high win rate. CRYPTO conf≥0.90 → 14.4% WR; conf 0.50-0.60 → 60.3% WR. · EQUITY 0.85-0.90 → 20% WR (worst band); >0.90 recovers. · FOREX peak 0.75-0.80; 0.70-0.75 = danger.
Score-booster now applies per-class penalty/reward at alpha_engine/score_booster._calibrate_confidence() (CRYPTO >0.85 = −12, <0.6 = +3; EQUITY 0.85-0.90 = −15; FOREX ≥0.85 = −10). Until upstream generators recalibrate, do not size off raw confidence — weight off post-booster score.
🎯 MAJOR GOAL Phenomenal performance across ALL asset classes — sustainable, hedge-fund-grade.
Sports betting tracked at /live-monitor/sports-betting.html (separate KPIs).
EQUITY — FAIL (PF 0.70, WR 37.4%, n=567 closed_picks; 14d=PF 5.56 WR 66.7% but 61.6% single-source smart_money + 6 dup-groups). Only proven sleeve: stocks_rsi2_pullback n=70 WR 62.9%. DISPUTED (INCIDENT #96, 2026-06-04): after deduping the 5.6x INCIDENT #91 NULL-opened_at row inflation, stocks_rsi2_pullback is n=140 WR 33.9% avg -0.007% (losing). The 62.9% was raw-row pre-dedup. CRYPTO — FAIL (pf_registry policy-clean PF 0.98 WR 39.7% n=229; money_ready PF 0.96 WR 31.3% DSR 0.0073). DISPUTED: Smart/VA/HC/ELITE surfaces show PF 9-18 / WR 78-88% but 91.7% concentrated in claude_gainer_st with EXPIRED→WON mislabels — raw DB equivalent is WR 33%. ETF — INSUFF-N (pf_registry policy-clean n=1, dashboard n=13 PF 0.34). Backtests PF 2.05–4.50 (etf_sector_rotation, etf_rotation_vix_regime) are 100% orphan — no production wire-up. COMMODITY — FAIL+INSUFF-N (policy-clean n=3 only; FUTURES proxy n=11 WR 9.09% PF 0.475). CT=F DSR=1.0 was FALSIFIED 2026-05-13 (6.33× over-emission); pilot now SHADOW_INSUFFICIENT_N dsr=null. FOREX — BLOCKED (PR #6 consolidation 2026-05-31: all FOREX strategies blocked except cta_cross_asset_tsmom SHORT). Policy-clean net PF 0.29 / WR 46.1% / PnL -1026% (raw resolved cohort PF 9.67 is outlier-skewed; 7.6% WR on 12,673 resolved). Hard disable default ON in non_crypto_policy.py. forex_carry added to allowlist on probation. recent_exits.json cleaned of stale entries. BOND — ACCUMULATING (at_raw_picks: 78 rows, up from 2). Backtests PF 1.29–1.62 (bond_tlt_ief_v3, bond_hyg_lqd_v1) are orphan; universe limited to 5 symbols. First production emissions detected 2026-05-31. Per-class status below is static (historical snapshot); JS auto-updates live metrics on page load from money_ready_verdict.json and pf_registry.by_asset_class_policy_clean_net. DB health: ghost rows 0, status mismatch 0, non-canonical statuses 0. PR #6 FOREX consolidation shipped.
Tier definitions: T1 PF>2/WR>55/MDD<10 (Renaissance). T2 PF>1.5/WR>50/MDD<20 (Institutional). T3 PF>1.2/WR>48/MDD<30 (Retail-OK). Source: asset_class_health in audit_dashboard/data/dashboard_data.json (post-resolver-v2 noise filter, generated 2026-05-05T01:37Z). Resolver v2 shipped 2026-04-28; v2.1 bug bundle 2026-05-02. Historical re-resolve (re_resolve_historical_v2.py) pending — pre-fix labels still in closed_picks for picks closed before 2026-04-28.
⚠️ Two PF/WR figures may appear per class on this page. The card above pulls asset_class_health (full-history closed-trade aggregate, post-resolver-v2 noise filter). The lower-down hf_stats.by_asset_class panel uses a recent-subset window (typically last 60-90d closed picks). Recent figures often diverge from headline — CRYPTO recent PF 0.89 vs headline 1.25 (n=1650 vs n=8067), COMMODITY recent 1.09 vs 1.78. For deploy/sizing decisions, weigh the recent panel. For long-horizon strategy validation, weigh the headline. Source: dashboard_data.json::hf_stats.by_asset_class.
📊 Per-asset-class walk-forward (OOS) Out-of-sample metrics from walk_forward_by_class() — Sharpe colored: green > 0.5, yellow 0–0.5, red < 0.
Loading…
📊 Strategy IC Analysis Spearman rank correlation between elite_score and PnL outcome — does the scoring model actually rank winners above losers?
Loading…

Crypto + Non-Crypto Performance

1x - No leverage
Portfolios 🧪 Anti-Overfit (DSR) 💰 Paper Pilot (COT) 🏦 Real Money
How the feeds stack up:
Verified Alpha — PROVEN trust tier from vetted source systems. Bimodal: sub-cohort quality varies; check per-strategy stats before sizing.
Smart Picks — passes the strictest per-asset gates (min score, RR, forward WR, regime alignment). Tightest filter currently live.
High Conviction — forward-validated tier: FWD WR ≥55% (≥70% crypto/equity/forex), score ≥ asset floor, ≥5 forward trades, trust tier clean. Applies hc_filter.js gates + per-asset S/A/B contract.
Active Picks — all live picks that pass hygiene (dead-symbol guard, blocklist, status checks). Broadest feed.
Risk-adjusted metrics (Sharpe, max DD, net-of-cost PF, regime decomposition) pending — no headline point-estimate claims until they land. See docs/REMAINING_ENHANCEMENT_PROPOSALS_V3_2026_04_20.md.

Claude Top Picks Curated by Claude Opus 4.6

3 highest-conviction picks selected via multi-factor analysis: RSI positioning, volume liquidity, momentum quality, Genesis score, and sector narrative. Updated each scan cycle.

Entry/TP/SL levels auto-tracked against live Binance prices. Historical performance builds over time as rounds complete.

Loading...

5-AI Top Picks Battle Claude vs Antigravity vs Grok vs KIMI vs Mercury

Live paper trading competition. 5 AI agents selected their top picks independently. Prices auto-refresh from Binance. Who will win?

Loading...

Score Tracker - What-If Performance

Every 15 minutes, the dashboard snapshots the top-scored picks. This tracker shows how those picks performed over time - answering "what if you traded the top picks by score?"

Snapshot History (newest first)

🧠 Smart Picks ? Loading...

ℹ How Smart Picks Work: Every scan cycle, our engine scores all active picks on 6 dimensions: Direction matches regime (25%), Quality score (35%), Freshness (15%), TP upside remaining (15%), Higher-timeframe alignment (10%), Proven Winner Boost (+8-15pts).
NEW GATES: Crypto shorts blocked (15.3% WR live), Confidence floor at 0.70 (below = 23% WR), 14 banned systems auto-excluded. NEW SOURCE: Gainer Momentum Interceptor — intercepts top 24h gainers with confirmed momentum continuation. ? Glossary
ASSET CLASS:

🎯 What Makes a Winner - Decile Test Results 1,927 closed picks

▶ click to expand
📈 US Equity Picks (UEPS) Building track record · n=0/100
Long-term value (1y–3y+) and swing (1w–1m) US equity picks. Composite scoring per findings/SYNTHESIS.md §3: 0.55 × ValueComposite + 0.45 × QualityComposite × SafetyGate where ValueComposite blends Magic Formula + Acquirer's Multiple + FCF Yield, and SafetyGate requires Altman Z'' ≥ 1.10 AND Beneish M ≤ -1.78.
Tier classification deferred until n≥100 closed picks per docs/PERFORMANCE_CHARTER.md §10. Resolver (long-term) never closes on price drawdown alone — thesis-break + IV-attainment + horizon expiry only.

US Equity Picks

Building track record for Long-Term Value Holds. No picks emitted yet - the screener has not yet produced n≥100 closed trades.

This tab will populate once the weekly/daily screener writes its first batch of picks. See updates/long_term_value_project_2026-04-27/PROJECT.md for status.

Building track record · n=0/100
First weekly value-screener run scheduled. Picks will populate here once the runner emits.
See .github/workflows/value_screener_weekly.yml · Charter: docs/PERFORMANCE_CHARTER.md
🐝 SWARM PICK TRACKING — TRADINGVIEW Loading…
Per consensus tier
Per asset class
Per underlying model (sorted by WR▼)
Picks (filter by account / asset class / tier)