ADE-Bench

Benchmarks

SignalPilot holds the highest score ever on dbt Labs' ADE-Bench — resolving 62 of 64 tasks on the analytics-engineering benchmark.

Full task-by-task evaluation results below.

Read the full ADE-Bench report
96.9% pass rate·62 passed·2 failed·64 total·claude-sonnet-4-6

Run ID: ade-full-v2 · Suite: ade-bench · ADE-Bench GitHub